A genetic algorithm-aided Hidden Markov Model topology estimation for phoneme recognition of Thai continuous speech

被引:9
作者
Bhuriyakorn, Pattana [1 ]
Punyabukkana, Proadpran [1 ]
Suchato, Atiwong [1 ]
机构
[1] Chulalongkorn Univ, Fac Engn, Spoken Language Syst Res Grp, Dept Comp Engn, Bangkok 10330, Thailand
来源
PROCEEDINGS OF NINTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING | 2008年
关键词
D O I
10.1109/SNPD.2008.73
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The use of Hidden Markov Models (HMM) in many pattern recognition tasks is now very common. Like other pattern recognitions, most Automatic Speech Recognition systems rely on HMM acoustic models. In such systems, recognition performances are significantly affected by their topologies. In this paper, we propose an HMM topology estimation approach for Thai phoneme recognition tasks whose process is divided into 2 stages. First, a set of suitable topologies are constructed by combinations of different objective functions and topology generation methods. Second, a Genetic Algorithm is deployed as the topology selection algorithm which considers global fitness and selects the most suitable topology from the candidates proposed in the previous stage for each phoneme. As a result, the well-trained topology yields a maximum of 4.36% error reduction over predefined left-to-right models. The estimated topologies still work well when the topology estimation was performed on speech utterances whose recording environments differ from the ones recognized.
引用
收藏
页码:475 / 480
页数:6
相关论文
共 13 条
[1]  
BHURIYAKORN P, 2007, 7 INT SYM NAT LANG P
[2]  
Biem A, 2003, PROC INT CONF DOC, P104
[3]  
Biem A., 2002, P ICASSP, V1, P1
[4]  
DANFENG L, 2001, P IEEE ICASSP, P1521
[5]  
Han S, 2007, PROC INT CONF DOC, P604
[6]  
HOLLAND JH, 1998, GENETIC PROGRAMMING, V6
[7]   Variational Bayesian approach for automatic generation of HMM topologies [J].
Jitsuhiro, T ;
Nakamura, S .
ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, :77-82
[8]  
KASURIYAM S, 2003, P COCOSDA 03
[9]   A TUTORIAL ON HIDDEN MARKOV-MODELS AND SELECTED APPLICATIONS IN SPEECH RECOGNITION [J].
RABINER, LR .
PROCEEDINGS OF THE IEEE, 1989, 77 (02) :257-286
[10]   ESTIMATING DIMENSION OF A MODEL [J].
SCHWARZ, G .
ANNALS OF STATISTICS, 1978, 6 (02) :461-464