A genetic algorithm-aided Hidden Markov Model topology estimation for phoneme recognition of Thai continuous speech

被引：9

作者：

Bhuriyakorn, Pattana ^{[1
]}

Punyabukkana, Proadpran ^{[1
]}

Suchato, Atiwong ^{[1
]}

机构：

[1] Chulalongkorn Univ, Fac Engn, Spoken Language Syst Res Grp, Dept Comp Engn, Bangkok 10330, Thailand

来源：

PROCEEDINGS OF NINTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING | 2008年

关键词：

D O I：

10.1109/SNPD.2008.73

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The use of Hidden Markov Models (HMM) in many pattern recognition tasks is now very common. Like other pattern recognitions, most Automatic Speech Recognition systems rely on HMM acoustic models. In such systems, recognition performances are significantly affected by their topologies. In this paper, we propose an HMM topology estimation approach for Thai phoneme recognition tasks whose process is divided into 2 stages. First, a set of suitable topologies are constructed by combinations of different objective functions and topology generation methods. Second, a Genetic Algorithm is deployed as the topology selection algorithm which considers global fitness and selects the most suitable topology from the candidates proposed in the previous stage for each phoneme. As a result, the well-trained topology yields a maximum of 4.36% error reduction over predefined left-to-right models. The estimated topologies still work well when the topology estimation was performed on speech utterances whose recording environments differ from the ones recognized.

引用

页码：475 / 480

页数：6

共 13 条

[1]

BHURIYAKORN P, 2007, 7 INT SYM NAT LANG P

[2]

Biem A, 2003, PROC INT CONF DOC, P104

[3]

Biem A., 2002, P ICASSP, V1, P1

[4]

DANFENG L, 2001, P IEEE ICASSP, P1521