共 14 条
- [1] [Anonymous], 1968, Information Theory and Statistics
- [2] Arevian G, 2007, LECT NOTES COMPUT SC, V4669, P425
- [3] LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02): : 157 - 166
- [4] Bengio Y, 2013, INT CONF ACOUST SPEE, P8624, DOI 10.1109/ICASSP.2013.6639349
- [5] Glorot X., 2010, P 13 INT C ART INT S, P249, DOI DOI 10.1109/LGRS.2016.2565705
- [6] Graves A, 2013, ARXIV13080850
- [7] Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
- [9] Jozefowicz R, 2015, PR MACH LEARN RES, V37, P2342
- [10] Koutník J, 2014, PR MACH LEARN RES, V32, P1863