共 29 条
[1]
LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT
[J].
IEEE TRANSACTIONS ON NEURAL NETWORKS,
1994, 5 (02)
:157-166
[3]
Cao D., 2020, Proceedings of the 34th International Conference on Neural Information Processing Systems, P1491
[5]
Cho K., 2014, COMPUT SCI
[8]
Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.8.1735, 10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
[10]
Kingma D. P., 2015, INT C LEARN REPR