共 24 条
[1]
Ardila R, 2020, Arxiv, DOI [arXiv:1912.06670, DOI 10.48550/ARXIV.1912.06670]
[2]
EFFICIENT CONFORMER: PROGRESSIVE DOWNSAMPLING AND GROUPED ATTENTION FOR AUTOMATIC SPEECH RECOGNITION
[J].
2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU),
2021,
:8-15
[3]
Dat V. T., 2022, VNU J SCI COMPUTER S, V38
[4]
AN END-TO-END SPEECH ACCENT RECOGNITION METHOD BASED ON HYBRID CTC/ATTENTION TRANSFORMER ASR
[J].
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021),
2021,
:7253-7257
[5]
Conformer: Convolution-augmented Transformer for Speech Recognition
[J].
INTERSPEECH 2020,
2020,
:5036-5040
[7]
Deep Residual Learning for Image Recognition
[J].
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR),
2016,
:770-778
[8]
Hung P.N., 2016, J. Comput. Sci. Cybern., V32, P19
[9]
Hung Pham Ngoc, 2019, UTEHY Journal of Science and Technology, V24, P82
[10]
Accent Identification by Combining Deep Neural Networks and Recurrent Neural Networks Trained on Long and Short Term Features
[J].
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES,
2016,
:2388-2392