Semi-supervised learning for acoustic model retraining: Handling speech data with noisy transcript

被引:0
作者
Madan, Abhijith [1 ]
Khopkar, Ayush [1 ]
Nadig, Shreekantha [1 ]
Raghavan, K. M. Srinivasa [1 ]
Eledath, Dhanya [1 ]
Ramasubramanian, V [1 ]
机构
[1] Int Inst Informat Technol Bangalore IIIT B, Bangalore, Karnataka, India
来源
2020 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS, SPCOM | 2020年
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We address the problem of retraining a seed acoustic model from a large corpus which is associated with noisy labeling. We propose a forced-alignment likelihood and fuzzy string matching score based iterative selection of the corpus data to retrain the acoustic model in an order of increasing degree of noise in the transcript, yielding a succession of enhanced acoustic-models, offering progressively lower error rates on an held-out test data. We show results in terms of PER (phoneme-errorrate) on a large broadcast news data from a national broadcast network containing multiple languages of transcribed-speech, demonstrating the strong utility of such an approach for training of acoustic models from noisy-transcript.
引用
收藏
页数:5
相关论文
共 18 条
[1]  
[Anonymous], 2011, Proceeding of workshop on new tools and methods for very-large scale phonetics research
[2]  
Beaufays Francoise, P INT 2010
[3]  
Boulianne D., 2011, IEEE 2011 WORKSH AUT, P1, DOI DOI 10.1017/CBO9781107415324.004
[4]  
Chellapriyadharshini M, 2018, INTERSPEECH, P1041
[5]  
Davel MH, 2011, 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, P3160
[6]   Machine Learning Paradigms for Speech Recognition: An Overview [J].
Deng, Li ;
Li, Xiao .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (05) :1060-1089
[7]  
Hazen TJ, 2006, INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, P1606
[8]  
Kapralova Olga, P INT 2014
[9]  
Kleynhans N, 2015, PROCEEDINGS OF THE 2015 PATTERN RECOGNITION ASSOCIATION OF SOUTH AFRICA AND ROBOTICS AND MECHATRONICS INTERNATIONAL CONFERENCE (PRASA-ROBMECH), P136, DOI 10.1109/RoboMech.2015.7359512
[10]   A normalized Levenshtein distance metric [J].
Li Yujian ;
Liu Bo .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (06) :1091-1095