共 26 条
[1]
Alphonso Issac, 2018, RANKING APPROACH COM, V12, P664
[2]
Battenberg E, 2017, 2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), P206, DOI 10.1109/ASRU.2017.8268937
[3]
DEVELOPING REAL-TIME STREAMING TRANSFORMER TRANSDUCER FOR SPEECH RECOGNITION ON LARGE-SCALE DATASET
[J].
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021),
2021,
:5904-5908
[4]
Fan Angela, 2020, arXiv
[5]
Graves A., 2012, arXiv
[6]
Han S., 2016, PROC INT C LEARN REP
[7]
Ju YC, 2008, INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, P2179
[8]
Kudo T, 2018, CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, P66
[9]
On the Comparison of Popular End-to-End Models for Large Scale Speech Recognition
[J].
INTERSPEECH 2020,
2020,
:1-5
[10]
Li Jinyu, 2014, INTERSPEECH, P2, DOI 10.21437/INTERSPEECH.2014-432