共 34 条
- [1] Battenberg E, 2017, 2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), P206, DOI 10.1109/ASRU.2017.8268937
- [2] Improving Speech Recognition using GAN-based Speech Synthesis and Contrastive Unspoken Text Selection [J]. INTERSPEECH 2020, 2020, : 556 - 560
- [3] Chorowski J., 2014, NIPS 2014 WORKSH DEE
- [4] SynthASR: Unlocking Synthetic Data for Speech Recognition [J]. INTERSPEECH 2021, 2021, : 896 - 900
- [5] Graves A., 2006, P 23 INT C MACH LEAR, P369, DOI DOI 10.1145/1143844.1143891
- [6] Conformer: Convolution-augmented Transformer for Speech Recognition [J]. INTERSPEECH 2020, 2020, : 5036 - 5040
- [7] Hayashi T, 2018, IEEE W SP LANG TECH, P426, DOI 10.1109/SLT.2018.8639619
- [8] SYNT plus plus : UTILIZING IMPERFECT SYNTHETIC DATA TO IMPROVE SPEECH RECOGNITION [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7682 - 7686
- [9] Kannan A, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), P5824, DOI 10.1109/ICASSP.2018.8462682
- [10] Kingma D.P., 2014, arXiv, DOI [DOI 10.48550/ARXIV.1412.6980, 10.48550/arXiv.1412.6980]