共 29 条
- [1] Bacvski Alexei, 2020, Advances in neural information processing systems, V33, P12449, DOI DOI 10.48550/ARXIV.2006.11477
- [2] Bu H, 2017, 2017 20TH CONFERENCE OF THE ORIENTAL CHAPTER OF THE INTERNATIONAL COORDINATING COMMITTEE ON SPEECH DATABASES AND SPEECH I/O SYSTEMS AND ASSESSMENT (O-COCOSDA), P58, DOI 10.1109/ICSDA.2017.8384449
- [3] Chen G., 2021, P INT 2021
- [4] Duquenne PA, 2021, ADV NEUR IN, V34
- [5] Conformer: Convolution-augmented Transformer for Speech Recognition [J]. INTERSPEECH 2020, 2020, : 5036 - 5040
- [6] Inaguma H, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020): SYSTEM DEMONSTRATIONS, P302
- [7] Jia Y., 2021, ARXIV210708661
- [8] Direct speech-to-speech translation with a sequence-to-sequence model [J]. INTERSPEECH 2019, 2019, : 1123 - 1127
- [9] Jia Y, 2019, INT CONF ACOUST SPEE, P7180, DOI 10.1109/ICASSP.2019.8683343
- [10] Kahn J, 2020, INT CONF ACOUST SPEE, P7669, DOI [10.1109/icassp40776.2020.9052942, 10.1109/ICASSP40776.2020.9052942]