A discriminative and robust training algorithm for noisy speech recognition

被引：0

作者：

Hong, WT

机构：

来源：

2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I | 2003年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

A combined technique of discriminative and robust training algorithms, referred to as the D-REST (Discriminative and Robust Environment-effects Suppression Training), is proposed for noisy speech recognition. The D-REST technique can separately model the environmental characteristics and phonetic information and thus it can train speech models discriminatively on phonetic variability by eliminating the disturbance of environment-specific effects. According to the experimental results of Taiwan stock name recognition task over wireless network, the proposed D-REST algorithm has the potential to improve performance not only on diverse training data but also on noise-type unmatched environments between training and testing. Furthermore, the usage of the D-REST algorithm amounted to a 60% reduction in average word error rate over the performance by the conventional MCE/GPD-based training approach without environment-effects suppression training technique.

引用

页码：8 / 11

页数：4

共 11 条

[1]

Chou W., 1992, ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech and Signal Processing (Cat. No.92CH3103-9), P473, DOI 10.1109/ICASSP.1992.225869

[2] CEPSTRAL PARAMETER COMPENSATION FOR HMM RECOGNITION IN NOISE [J].

GALES, MJF ;

YOUNG, SJ .

SPEECH COMMUNICATION, 1993, 12 (03) :231-239

[3] A robust training algorithm for adverse speech recognition [J].

Hong, WT ;

Chen, SH .

SPEECH COMMUNICATION, 2000, 30 (04) :273-293

[4]

HONG WT, 1997, EUROSPEECH 97, V3, P1083

[5]

HONG WT, 1999, EUROSPEECH 1999, V6, P2495

[6]

KATAGIRI S, 1991, IEEE WORKSH N EUR NE, P229

[7]

LEE LS, 1994, IEEE SIGNAL PROC MAG, P17

[8]

MEYER C, 2001, ICASSP 2001, V1, P293

[9]

Rahim MG, 1996, IEEE T SPEECH AUDI P, V4, P19

[10] Noise compensation methods for hidden Markov model speech recognition in adverse environments [J].

Vaseghi, SV ;

Milner, BP .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (01) :11-21

← 1 2 →