SYNTHESIZED STEREO-BASED STOCHASTIC MAPPING WITH DATA SELECTION FOR ROBUST SPEECH RECOGNITION

被引:0
作者
Du, Jun [1 ]
Huo, Qiang [1 ]
机构
[1] Microsoft Res Asia, Beijing, Peoples R China
来源
2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING | 2012年
关键词
stereo-based stochastic mapping; HMM-based speech synthesis; data selection;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a synthesized stereo-based stochastic mapping approach for robust speech recognition. We extend the traditional stereo-based stochastic mapping (SSM) in two main aspects. First, the constraint of stereo-data, which is not practical in real applications, is relaxed by using HMM-based speech synthesis. Then we make feature mapping more focused on those incorrectly recognized samples via a data selection strategy. Experimental results on Aurora3 databases show that our approach can achieve consistently significant improvements of recognition performance in the well-matched (WM) condition among four different European languages.
引用
收藏
页码:122 / 125
页数:4
相关论文
共 17 条
[1]  
*AALB U, 2001, AU37801
[2]  
ACERO A, 1993, ACOUSTIC ENV ROBUSTN
[3]  
Afify M, 2007, INT CONF ACOUST SPEE, P377
[4]  
[Anonymous], INTERSPEECH
[5]  
[Anonymous], 1996, THESIS CARNEGIE MELL
[6]   Analysis and comparison of two speech feature extraction/compensation algorithms [J].
Deng, L ;
Wu, J ;
Droppo, J ;
Acero, A .
IEEE SIGNAL PROCESSING LETTERS, 2005, 12 (06) :477-480
[7]  
DROPPO J, 2001, P EUR, P217
[8]   HMM-BASED PSEUDO-CLEAN SPEECH SYNTHESIS FOR SPLICE ALGORITHM [J].
Du, Jun ;
Hu, Yu ;
Dai, Li-Rong ;
Wang, Ren-Hua .
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, :4570-4573
[9]   SPEECH RECOGNITION IN NOISY ENVIRONMENTS - A SURVEY [J].
GONG, YF .
SPEECH COMMUNICATION, 1995, 16 (03) :261-291
[10]  
*NOK, 1999, AU21799 NOK