共 36 条
[21]
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
[J].
INTERSPEECH 2019,
2019,
:2613-2617
[22]
Peddinti V, 2015, 2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), P539, DOI 10.1109/ASRU.2015.7404842
[23]
Ramachandran P., 2017, Searching for activation functions
[24]
Rao K, 2017, 2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), P193, DOI 10.1109/ASRU.2017.8268935
[25]
Sailor HB, 2019, 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), P980, DOI [10.1109/ASRU46091.2019.9003755, 10.1109/asru46091.2019.9003755]
[26]
Sainath TN, 2020, INT CONF ACOUST SPEE, P6059, DOI [10.1109/ICASSP40776.2020.9054188, 10.1109/icassp40776.2020.9054188]
[27]
MobileNetV2: Inverted Residuals and Linear Bottlenecks
[J].
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2018,
:4510-4520
[28]
Saon G, 2013, 2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), P55, DOI 10.1109/ASRU.2013.6707705
[29]
Shen Jonathan, 2019, Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
[30]
Synnaeve G., 2019, END END ASR SUPERVIS