BAYESIAN ADAPTIVE LEARNING OF THE PARAMETERS OF HIDDEN MARKOV MODEL FOR SPEECH RECOGNITION

被引:41
作者
HUO, Q [1 ]
CHAN, C [1 ]
LEE, CH [1 ]
机构
[1] UNIV HONG KONG,DEPT COMP SCI,HONG KONG,HONG KONG
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1995年 / 3卷 / 05期
关键词
D O I
10.1109/89.466661
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, a theoretical framework for Bayesian adaptive training of the parameters of discrete hidden Markov model (DHMM) and of semi-continuous HMM (SCHMM) with Gaussian mixture state observation densities is presented, In addition to formulating the forward-backward MAP (maximum a posteriori) and the segmental MAP algorithms for estimating the above HMM parameters, a computationally efficient segmental quasi-Bayes algorithm for estimating the state-specific mixture coefficients in SCHMM is developed, For estimating the parameters of the prior densities, a new empirical Bayes method based on the moment estimates is also proposed. The MAP algorithms and the prior parameter specification are directly applicable to training speaker adaptive HMM's, Practical issues related to the use of the proposed techniques for HMM-based speaker adaptation are studied, The proposed MAP algorithms are shown to be effective especially in the cases in which the training or adaptation data are limited.
引用
收藏
页码:334 / 345
页数:12
相关论文
共 46 条
[1]  
Aitchison J., Shen S.M., Logistic-normal distributions: Some properties and uses, Biometrika, 67, pp. 261-272, (1980)
[2]  
Baum L.E., An inequality and associated maximization techniques in statistical estimation for probabilistic functions of Markov processes, Inequalities, 3, pp. 1-8, (1972)
[3]  
Baum L.E., Egon J.A., An inequality with applications to statistical estimation for probabilistic functions of a Markov process and to a model for ecology, Bull. Amer. Math. Soc., 73, pp. 360-363, (1967)
[4]  
Baum L.E., Petrie T., Soules G., Weiss N., A maximization technique occurring in the statistical analysis of probabilistic function of Markov chains, Annals Math. Statis., 41, 1, pp. 164-171, (1970)
[5]  
Bellegarda J.R., Nahamoo D., x2018
[6]  
Tied mixture continuous parameter modeling for speech recognition, IEEE Trans Acoust., 38, 12, pp. 2033-2045, (1990)
[7]  
Brown P.F., Lee C.-H., Spohrer J.C., Bayesian adaptation in speech recognition, Proc. ICASSP-83, pp. 761-764, (1983)
[8]  
DeGroot M.H., Optimal Statistical Decisions. New York: McGraw-Hill, (1970)
[9]  
Dempster A.P., Laird N.M., Rubin D.B., Maximum likelihood from incomplete data via the EM algorithm, Royal Statist. Soc., 39, 1, pp. 1-38, (1977)
[10]  
Ferretti M., Scarci S., Large-vocabulary speech recognition with speaker-adapted codebook and HMM parameters, Proc. Eurospeech89, pp. 154-156, (1989)