Variational Bayesian learning of speech GMMs for feature enhancement based on Algonquin

被引:0
作者
Pettersen, Svein G. [1 ]
Johnsen, Magne H. [1 ]
Wellekens, Christian [2 ]
机构
[1] Norwegian Univ Sci & Technol, Dept Elect & Telecommun, NO-7491 Trondheim, Norway
[2] Inst Eurecom, F-06904 Sophia Antipolis, France
来源
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3 | 2007年
关键词
speech recognition; speech enhancement; robustness; variational methods;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Many feature enhancement methods make use of probabilistic models of speech and noise in order to improve performance of speech recognizers in the presence of background noise. The traditional approach for training such models is maximum likelihood estimation. This paper investigates the novel application of variational Bayesian learning for front-end models under the Algonquin denoising framework. Compared to maximum likelihood training, it is shown that variational Bayesian learning has advantages both in terms of increased robustness with respect to choice of model complexity, as well as increased performance.
引用
收藏
页码:905 / +
页数:2
相关论文
共 10 条
[1]  
Attias H, 2000, ADV NEUR IN, V12, P209
[2]  
ATTIAS H, 2000, P NIPS, P758
[3]   A Bayesian prediction approach to robust speech recognition and online environmental learning [J].
Chien, JT .
SPEECH COMMUNICATION, 2002, 37 (3-4) :321-334
[4]  
Deng L., 2000, P ANN C INT SPEECH C, P806
[5]  
FREY BJ, 2002, P NIPS
[6]  
Hirsch H.G, 2000, P ASR2000 AUT SPEECH
[7]  
KRISTJANSSON TT, 2002, THESIS U WATERLOO
[8]  
MYRVOLL TA, 2004, P ISCLP, P117
[9]  
Valente F., 2003, P EUROSPEECH, P441
[10]   Automatic determination of acoustic model topology using variational Bayesian estimation and clustering for large vocabulary continuous speech recognition [J].
Watanabe, S ;
Sako, A ;
Nakamura, A .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (03) :855-872