A discriminative and robust training algorithm for noisy speech recognition

被引:0
作者
Hong, WT
机构
来源
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I | 2003年
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A combined technique of discriminative and robust training algorithms, referred to as the D-REST (Discriminative and Robust Environment-effects Suppression Training), is proposed for noisy speech recognition. The D-REST technique can separately model the environmental characteristics and phonetic information and thus it can train speech models discriminatively on phonetic variability by eliminating the disturbance of environment-specific effects. According to the experimental results of Taiwan stock name recognition task over wireless network, the proposed D-REST algorithm has the potential to improve performance not only on diverse training data but also on noise-type unmatched environments between training and testing. Furthermore, the usage of the D-REST algorithm amounted to a 60% reduction in average word error rate over the performance by the conventional MCE/GPD-based training approach without environment-effects suppression training technique.
引用
收藏
页码:8 / 11
页数:4
相关论文
共 11 条
  • [1] Chou W., 1992, ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech and Signal Processing (Cat. No.92CH3103-9), P473, DOI 10.1109/ICASSP.1992.225869
  • [2] CEPSTRAL PARAMETER COMPENSATION FOR HMM RECOGNITION IN NOISE
    GALES, MJF
    YOUNG, SJ
    [J]. SPEECH COMMUNICATION, 1993, 12 (03) : 231 - 239
  • [3] A robust training algorithm for adverse speech recognition
    Hong, WT
    Chen, SH
    [J]. SPEECH COMMUNICATION, 2000, 30 (04) : 273 - 293
  • [4] HONG WT, 1997, EUROSPEECH 97, V3, P1083
  • [5] HONG WT, 1999, EUROSPEECH 1999, V6, P2495
  • [6] KATAGIRI S, 1991, IEEE WORKSH N EUR NE, P229
  • [7] LEE LS, 1994, IEEE SIGNAL PROC MAG, P17
  • [8] MEYER C, 2001, ICASSP 2001, V1, P293
  • [9] Rahim MG, 1996, IEEE T SPEECH AUDI P, V4, P19
  • [10] Noise compensation methods for hidden Markov model speech recognition in adverse environments
    Vaseghi, SV
    Milner, BP
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (01): : 11 - 21