HIDDEN MARKOV-MODELS FOR SPEECH RECOGNITION

被引:361
作者
JUANG, BH
RABINER, LR
机构
[1] Speech Research Department, ATandT Bell Laboratories, Murray Hill, NJ, 07974, United States
关键词
BAUM-WELCH ALGORITHM; INCOMPLETE DATA PROBLEM; MAXIMUM A-POSTERIORI DECODING; MAXIMUM LIKELIHOOD;
D O I
10.2307/1268779
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The use of hidden Markov models for speech recognition has become predominant in the last several years, as evidenced by the number of published papers and talks at major speech conferences. The reasons this method has become so popular are the inherent statistical (mathematically precise) framework; the ease and availability of training algorithms for estimating the parameters of the models from finite training sets of speech data; the flexibility of the resulting recognition system in which one can easily change the size, type, or architecture of the models to suit particular words, sounds, and so forth; and the ease of implementation of the overall recognition system. In this expository article, we address the role of statistical methods in this powerful technology as applied to speech recognition and discuss a range of theoretical and practical issues that are as yet unsolved in terms of their importance and their effect on performance for different system implementations.
引用
收藏
页码:251 / 272
页数:22
相关论文
共 84 条
[1]   UNIFIED APPROACH TO SHORT-TIME FOURIER-ANALYSIS AND SYNTHESIS [J].
ALLEN, JB ;
RABINER, LR .
PROCEEDINGS OF THE IEEE, 1977, 65 (11) :1558-1564
[2]   SPEECH ANALYSIS AND SYNTHESIS BY LINEAR PREDICTION OF SPEECH WAVE [J].
ATAL, BS ;
HANAUER, SL .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1971, 50 (02) :637-+
[3]  
AVERBUCH A, 1987, APR IEEE INT C AC SP, P701
[4]  
Bahl L. R., 1988, P ICASSP 88 NEW YORK, P40
[5]   A MAXIMUM-LIKELIHOOD APPROACH TO CONTINUOUS SPEECH RECOGNITION [J].
BAHL, LR ;
JELINEK, F ;
MERCER, RL .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1983, 5 (02) :179-190
[6]  
BAHL LR, 1986, P IEEE INT C AC SPEE, P49
[7]  
BAHL LR, 1980, P INT C ACOUSTICS SP, P872
[8]  
BAHL LR, 1988, P ICASSP 88 NEW YORK, P493
[9]   DRAGON SYSTEM - OVERVIEW [J].
BAKER, JK .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1975, AS23 (01) :24-29
[10]  
BAKIS R, 1976, UNPUB APR M AC SOC A