A Survey of Multilingual Models for Automatic Speech Recognition

被引:0
作者
Yadav, Hemant [1 ]
Sitaram, Sunayana [1 ]
机构
[1] Microsoft Res India, Bangalore, India
来源
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2022年
关键词
speech recognition; multilingual; low-resource languages; FEATURES;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Although Automatic Speech Recognition (ASR) systems have achieved human-like performance for a few languages, the majority of the world's languages do not have usable systems due to the lack of large speech datasets to train these models. Cross-lingual transfer is an attractive solution to this problem, because low-resource languages can potentially benefit from higher-resource languages either through transfer learning, or being jointly trained in the same multilingual model. The problem of cross-lingual transfer has been well studied in ASR, however, recent advances in Self Supervised Learning are opening up avenues for unlabeled speech data to be used in multilingual ASR models, which can pave the way for improved performance on low-resource languages. In this paper, we survey the state of the art in multilingual ASR models that are built with cross-lingual transfer in mind. We present best practices for building multilingual models from research across diverse languages and techniques, discuss open questions and provide recommendations for future work.
引用
收藏
页码:5071 / 5079
页数:9
相关论文
共 48 条
[1]  
Amodei D, 2016, PR MACH LEARN RES, V48
[2]  
[Anonymous], 11 ANN C INT SPEECH
[3]  
[Anonymous], ARXIV171104564
[4]  
Baevski A., 2020, Advances in neural information processing systems
[5]  
Billa J., 2021, ARXIV21061227
[6]   ISI ASR System for the Low Resource Speech Recognition Challenge for Indian Languages [J].
Billa, Jayadev .
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, :3207-3211
[7]  
Chan William, 2015, CoRR, V2, P5
[8]  
Chung Y.-A., 2021, ARXIV210806209
[9]  
Conneau A., 2020, ARXIV200613979
[10]  
Conneau Alexis, 2020, P 58 ANN M ASS COMPU, P8440, DOI [10.18653/v1/2020.acl-main.747, DOI 10.18653/V1/2020.ACL-MAIN.747]