共 28 条
[1]
Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, DOI 10.48550/ARXIV.1409.0473]
[2]
LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT
[J].
IEEE TRANSACTIONS ON NEURAL NETWORKS,
1994, 5 (02)
:157-166
[3]
Chorowski J, 2015, ADV NEUR IN, V28
[4]
Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition
[J].
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING,
2012, 20 (01)
:30-42
[6]
Ferguson J. D., 1980, Application of hidden Markov models to text and speech
[7]
Graves A.B., 2016, U.S. Patent, Patent No. [9,263,036, 9263036]
[8]
Graves A, 2013, INT CONF ACOUST SPEE, P6645, DOI 10.1109/ICASSP.2013.6638947
[10]
Hochreiter S., 1997, Neural Computation, V9, P1735