Discriminative Training of Variable-Parameter HMMs for Noise Robust Speech Recognition

被引:0
作者
Yu, Dong [1 ]
Deng, Li [1 ]
Gong, Yifan [1 ]
Acero, Alex [1 ]
机构
[1] Microsoft Corp, Redmond, WA 98052 USA
来源
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 | 2008年
关键词
speech recognition; variable-parameter hidden Markov model; discriminative training; cubic spline; growth transformation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a new type of variable-parameter hidden Markov model (VPHMM) whose mean and variance parameters vary each as a continuous function of additional environment-dependent parameters. Different from the polynomial-function-based VPHMM proposed by Cui and Gong (2007), the new VPHMM uses cubic splines to represent the dependency of the means and variances of Gaussian mixture's on the environment parameters. Importantly, the new model no longer requires quantization in estimating the model parameters and it supports parameter sharing and instantaneous conditioning parameters directly. We develop and describe a growth-transformation algorithm that discriminatively learns the parameters in our cubic-spline-based VPHMM (CS-VPHMM), and evaluate the model on the Aurora-3 corpus with our recently developed MFCC-MMSE noise suppressor applied. Our experiments show that the proposed CS-VPHMM outperforms the discriminatively trained and maximum-likelihood trained conventional HMMs with relative word error rate (WER) reduction of 14% and 20% respectively under the well-matched conditions when both mean and variances are updated.
引用
收藏
页码:285 / 288
页数:4
相关论文
共 6 条
[1]   Noise estimation by minima controlled recursive averaging for robust speech enhancement [J].
Cohen, I ;
Berdugo, B .
IEEE SIGNAL PROCESSING LETTERS, 2002, 9 (01) :12-15
[2]   A study of variable-parameter Gaussian mixture hidden Markov modeling for noisy speech recognition [J].
Cui, Xiaodong ;
Gong, Yifan .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04) :1366-1376
[3]  
He X., 2008, IEEE SIGNAL IN PRESS
[4]  
HE X, 2006, P INT WORKSH MULT SI
[5]  
YU D, 2008, PARAMETER C IN PRESS
[6]  
YU D, 2008, IEEE T AUDI IN PRESS