A NOISE-ROBUST SPEECH RECOGNITION METHOD COMPOSED OF WEAK NOISE SUPPRESSION AND WEAK VECTOR TAYLOR SERIES ADAPTATION

被引:0
作者
Komeiji, Shuji [1 ]
Arakawa, Takayuki [1 ]
Koshinaka, Takafumi [1 ]
机构
[1] NEC Corp Ltd, Informat & Media Proc Lab, Nakahara Ku, Kawasaki, Kanagawa 2118666, Japan
来源
2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012) | 2012年
关键词
Automatic Speech Recognition; AURORA2; Noise Suppression; Noise Estimation; VTS; Model Adaptation;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes a noise-robust speech recognition method composed of weak noise suppression (NS) and weak Vector Taylor Series Adaptation (VTSA). The proposed method compensates defects of NS and VTSA, and gains only the advantages by them. The weak NS reduces distortion by over-suppression that may accompany noise-suppressed speech. The weak VTSA avoids over-adaptation by offsetting a part of acoustic-model adaptation that corresponds to the suppressed noise. Evaluation results with the AURORA2 database show that the proposed method achieves as much as 1.2 points higher word accuracy (87.4%) than a method with VTSA alone (86.2%) that is always better than its counterpart with NS.
引用
收藏
页码:103 / 106
页数:4
相关论文
共 8 条
[1]  
[Anonymous], 2000, INTERSPEECH, DOI DOI 10.1016/S0167-6393(03)00016-5
[2]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[3]   On using acoustic environment classification for statistical model-based speech enhancement [J].
Choi, Jae-Hun ;
Chang, Joon-Hyuk .
SPEECH COMMUNICATION, 2012, 54 (03) :477-490
[4]  
Hansen J. H. L, 1999, ENCY ELECT ELECT ENG
[5]  
Hirsch H. G., 2000, P ISCA ITRW ASR OCT
[6]  
Kato M., 2001, PROC IWAENC2001, P183
[7]  
Li JY, 2007, 2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, P65, DOI 10.1109/ASRU.2007.4430085
[8]  
Martin R., 1994, Signal Processing VII, Theories and Applications. Proceedings of EUSIPCO-94. Seventh European Signal Processing Conference, P1182