共 114 条
[1]
Baevski A, 2020, ADV NEUR IN, V33
[2]
The fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines
[J].
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES,
2018,
:1561-1565
[3]
Carletta J, 2005, LECT NOTES COMPUT SC, V3869, P28
[4]
Chan W, 2016, INT CONF ACOUST SPEE, P4960, DOI 10.1109/ICASSP.2016.7472621
[5]
End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation
[J].
INTERSPEECH 2022,
2022,
:3819-3823
[6]
AN EXPLORATION OF SELF-SUPERVISED PRETRAINED REPRESENTATIONS FOR END-TO-END SPEECH RECOGNITION
[J].
2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU),
2021,
:228-235
[7]
Chang XK, 2020, INT CONF ACOUST SPEE, P6134, DOI [10.1109/ICASSP40776.2020.9054029, 10.1109/icassp40776.2020.9054029]
[8]
Chang XK, 2019, 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), P237, DOI [10.1109/asru46091.2019.9003986, 10.1109/ASRU46091.2019.9003986]
[9]
GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio
[J].
INTERSPEECH 2021,
2021,
:3670-3674