Improving Viterbi Bayesian predictive classification via sequential Bayesian learning in robust speech recognition

被引:11
作者
Jiang, H
Hirose, K
Huo, Q
机构
[1] Univ Tokyo, Dept Informat & Commun Engn, Bunkyo Ku, Tokyo 1138656, Japan
[2] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada
[3] Univ Hong Kong, Dept Informat Syst & Comp Sci, Hong Kong, Peoples R China
关键词
Bayesian predictive classification (BPC); Viterbi BPC (VBPC); sequential Bayesian learning; robust speech recognition; natural conjugate prior;
D O I
10.1016/S0167-6393(99)00018-7
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we extend our proposed Viterbi Bayesian predictive classification (VBPC) algorithm to a new class of prior probability density function(pdf), namely a family of natural conjugate prior pdf's of the complete-data density in continuous density hidden Markov model (CDHMM) and their mixtures. In this way, we can on-line adapt the prior pdf via a sequential Bayesian learning algorithm when some new data are available, so that the performance of VBPC can be continuously improved. Moreover, we also study a sequential Bayesian learning strategy for CDHMM based on a finite mixture approximation of its prior/posterior density which attempts to derive a more accurate prior pdf to describe the unknown mismatches. The experimental results on a speaker-independent recognition task of isolated Japanese digits confirm the viability and the usefulness of the proposed method. (C) 1999 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:313 / 326
页数:14
相关论文
共 15 条
[1]  
Bernardo J., 1988, BAYESIAN STAT, V3, P67
[2]  
Furui S., 1997, P ESCA NATO TUT RES, P11
[3]   Maximum a Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains [J].
Gauvain, Jean-Luc ;
Lee, Chin-Hui .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (02) :291-298
[4]  
Huo Q, 1997, IEEE T SPEECH AUDI P, V5, P161, DOI 10.1109/89.554778
[5]  
HUO Q, 1997, P INT C AC SPEECH SI
[6]  
HUO Q, 1997, UNPUB IEEE T SPEECH
[7]  
HUO Q, 1997, P EUR C SPEECH COMM, P1847
[8]  
JIANG H, 1999, IN PRESS IEEE T SPEE, V7
[9]  
JIANG H, 1997, P INT C AC SPEECH SI
[10]   On stochastic feature and model compensation approaches to robust speech recognition [J].
Lee, CH .
SPEECH COMMUNICATION, 1998, 25 (1-3) :29-47