共 30 条
- [21] How Useful is Self-Supervised Pretraining for Visual Tasks? [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 7343 - 7352
- [22] Panayotov V, 2015, INT CONF ACOUST SPEE, P5206, DOI 10.1109/ICASSP.2015.7178964
- [23] Prahallad K, 2012, 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, P2545
- [24] Exploring the use of Common Label Set to Improve Speech Recognition of Low Resource Indian Languages [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7223 - 7227
- [25] Snyder D, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), P5329
- [26] Srivastava B.M.L., 2018, SLTU, P11
- [27] Tsai HS, 2022, Arxiv, DOI arXiv:2203.06849
- [28] Wang A., 2018, P 2018 EMNLP WORKSH, P353, DOI [DOI 10.18653/V1/W18-5446, DOI 10.18653/V1/W18-5446,URL]
- [29] Yang SW, 2021, Arxiv, DOI arXiv:2105.01051
- [30] SUPERB: Speech processing Universal PERformance Benchmark [J]. INTERSPEECH 2021, 2021, : 1194 - 1198