A discriminative and robust training algorithm for noisy speech recognition

被引:0
作者
Hong, WT
机构
来源
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I | 2003年
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A combined technique of discriminative and robust training algorithms, referred to as the D-REST (Discriminative and Robust Environment-effects Suppression Training), is proposed for noisy speech recognition. The D-REST technique can separately model the environmental characteristics and phonetic information and thus it can train speech models discriminatively on phonetic variability by eliminating the disturbance of environment-specific effects. According to the experimental results of Taiwan stock name recognition task over wireless network, the proposed D-REST algorithm has the potential to improve performance not only on diverse training data but also on noise-type unmatched environments between training and testing. Furthermore, the usage of the D-REST algorithm amounted to a 60% reduction in average word error rate over the performance by the conventional MCE/GPD-based training approach without environment-effects suppression training technique.
引用
收藏
页码:8 / 11
页数:4
相关论文
共 11 条
[1]  
Chou W., 1992, ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech and Signal Processing (Cat. No.92CH3103-9), P473, DOI 10.1109/ICASSP.1992.225869
[2]   CEPSTRAL PARAMETER COMPENSATION FOR HMM RECOGNITION IN NOISE [J].
GALES, MJF ;
YOUNG, SJ .
SPEECH COMMUNICATION, 1993, 12 (03) :231-239
[3]   A robust training algorithm for adverse speech recognition [J].
Hong, WT ;
Chen, SH .
SPEECH COMMUNICATION, 2000, 30 (04) :273-293
[4]  
HONG WT, 1997, EUROSPEECH 97, V3, P1083
[5]  
HONG WT, 1999, EUROSPEECH 1999, V6, P2495
[6]  
KATAGIRI S, 1991, IEEE WORKSH N EUR NE, P229
[7]  
LEE LS, 1994, IEEE SIGNAL PROC MAG, P17
[8]  
MEYER C, 2001, ICASSP 2001, V1, P293
[9]  
Rahim MG, 1996, IEEE T SPEECH AUDI P, V4, P19
[10]   Noise compensation methods for hidden Markov model speech recognition in adverse environments [J].
Vaseghi, SV ;
Milner, BP .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (01) :11-21