A hybrid algorithm for speaker adaptation using MAP transformation and adaptation

被引:24
作者
Chien, JT [1 ]
Lee, CH [1 ]
Wang, HC [1 ]
机构
[1] AT&T BELL LABS,DIALOGUE SYST RES DEPT,MURRAY HILL,NJ 07974
关键词
D O I
10.1109/97.586038
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a hybrid algorithm for adapting a set of speaker-independent hidden Markov models (HMM's) to a new speaker based on a combination of maximum a posteriori (MAP) parameter transformation and adaptation. The algorithm is developed by first transforming clusters of HMM parameters through a class of transformation functions, Then, the transformed HMM parameters are further smoothed via Bayesian adaptation, The proposed transformation/adaptation process can be iterated for any given amount of adaptation data, and it converges rapidly in terms of likelihood improvement, The algorithm also gives a better speech recognition performance than that obtained using transformation or adaptation alone for almost any practical amount of adaptation data.
引用
收藏
页码:167 / 169
页数:3
相关论文
共 4 条
[1]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[2]   Maximum a Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains [J].
Gauvain, Jean-Luc ;
Lee, Chin-Hui .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (02) :291-298
[3]   MAXIMUM-LIKELIHOOD LINEAR-REGRESSION FOR SPEAKER ADAPTATION OF CONTINUOUS DENSITY HIDDEN MARKOV-MODELS [J].
LEGGETTER, CJ ;
WOODLAND, PC .
COMPUTER SPEECH AND LANGUAGE, 1995, 9 (02) :171-185
[4]   Maximum-likelihood approach to stochastic matching for robust speech recognition [J].
Sankar, A ;
Lee, CH .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1996, 4 (03) :190-202