A maximum model distance approach for HMM-based speech recognition

被引:12
作者
Kwong, S [1 ]
He, QH
Man, KF
Tang, KS
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Hong Kong
[2] S China Univ Technol, Dept Elect Engn, Guangzhou, Peoples R China
[3] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Hong Kong
关键词
hidden Markov mode; maximum likelihood; corrective training; speech recognition; stochastic process;
D O I
10.1016/S0031-3203(97)00042-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a new approach for HMM-training which is based on the maximum model distance (MMD) criterion for different similar utterances. This approach differs from the traditional maximum likelihood (ML) approach in that the ML only considers the likelihood P(O-nu \ lambda(nu)) for a single utterance, while the MMD compares the likelihood P(O-nu \ lambda(nu)) against those similar utterances and maximizes their likelihood differences. Theoretical and practical issues concerning this approach are investigated. In addition, the corrective training [Bahl, Brown, de Souza and Mercer, IEEE Trans. Speech Audio Process. 1(1), (1993)] of the MMD was also included in this paper and we proved that the corrective training proposed by Bahl et al. (1993) is a special case of our MMD approach. Both speaker-dependent and multi-speaker experiments have bean carried out on the Chinese An-set syllables and also the 599 most common utterances from the TIMIT database. Experimental results showed that significant error reduction can be achieved through the proposed approach. (C) 1997 Pattern Recognition Society. Published by Elsevier Science Ltd.
引用
收藏
页码:219 / 229
页数:11
相关论文
共 19 条
[1]   A MAXIMUM-LIKELIHOOD APPROACH TO CONTINUOUS SPEECH RECOGNITION [J].
BAHL, LR ;
JELINEK, F ;
MERCER, RL .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1983, 5 (02) :179-190
[2]  
BAHL R, 1993, IEEE T SPEECH AUDIO, V1
[3]   STATISTICAL INFERENCE FOR PROBABILISTIC FUNCTIONS OF FINITE STATE MARKOV CHAINS [J].
BAUM, LE ;
PETRIE, T .
ANNALS OF MATHEMATICAL STATISTICS, 1966, 37 (06) :1554-&
[4]   A MAXIMIZATION TECHNIQUE OCCURRING IN STATISTICAL ANALYSIS OF PROBABILISTIC FUNCTIONS OF MARKOV CHAINS [J].
BAUM, LE ;
PETRIE, T ;
SOULES, G ;
WEISS, N .
ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (01) :164-&
[5]  
BHAL R, 1986, P 1986 IEEE INT C AC, P49
[6]  
CHANG PC, 1992, P ICASSP 92 SAN FRAN, V1, P493
[7]  
CHOU W, 1992, P INT C AC SPEECH SI, V1, P473
[8]  
CHU W, 1994, J PATTERN RECOGNITIO, V8
[9]  
EPHRAIM Y, 1990, IEEE T INFORM THEORY, V36
[10]  
EPHRAIM Y, 1989, IEEE T INFORM THEORY, V35