共 16 条
- [1] LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02): : 157 - 166
- [2] Bengio Y, 2001, ADV NEUR IN, V13, P932
- [3] Collobert R., 2008, P 25 ICML, P160, DOI [DOI 10.1145/1390156.1390177, 10.1145/1390156.1390177]
- [5] Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.8.1735, 10.1007/978-3-642-24797-2, 10.1162/neco.1997.9.1.1]
- [6] Hochreiter S, 2001, Gradient flow in recurrent nets: the difficulty of learning long-term dependencies, P237, DOI [10.1109/9780470544037.ch14, DOI 10.1109/9780470544037.CH14]
- [7] Le Q., 2014, DISTRIBUTED REPRESEN, DOI DOI 10.1145/2740908.2742760
- [8] Liu Y, 2015, AAAI CONF ARTIF INTE, P2418
- [9] Mikolov T., 2013, EFFICIENT ESTIMATION
- [10] Mikolov T., 2013, Adv Neural Inf Process Syst, P26, DOI DOI 10.48550/ARXIV.1310.4546