A method for isolated Thai tone recognition using a combination of neural networks

被引:6
作者
Thubthong, N [1 ]
Kijsirikul, B
Pusittrakul, A
机构
[1] Chulalongkorn Univ, Dept Phys, Bangkok 10330, Thailand
[2] Chulalongkorn Univ, Dept Comp Engn, Bangkok 10330, Thailand
关键词
Thai tone; tone recognition; combination of neural networks; combination rules; voting techniques;
D O I
10.1111/0824-7935.00193
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tone information is very important to speech recognition in a tonal language such as Thai. In this article, we present a method for isolated Thai tone recognition. First, we define three sets of tone features to capture the characteristics of Thai tones and employ a feedforward neural network to classify tones based on these features. Next, we describe several experiments using the proposed features. The experiments are designed to study the effect of initial consonants, vowels, and final consonants on tone recognition. We find that there are some correlations between tones and other phonemes, and the recognition performances are satisfying. A human perception test is then conducted to judge the recognition rate. The recognition rate of a human is much lower than that of a machine. Finally, we explore various combination schemes to enhance the recognition rate. Further improvements are found in most experiments.
引用
收藏
页码:313 / 335
页数:23
相关论文
共 31 条
[1]  
Abramson A. S., 1976, Tai Linguistics in Honor of Fang-Kuei Li, P1
[2]  
[Anonymous], 1993, P 1 S NATURAL LANGUA
[3]  
CHANG PC, 1990, INT C AC SPEECH SIGN, V1, P517
[4]  
CHEN SH, 1995, IEEE T SPEECH AUDIO, V3, P150
[5]  
CHO S, 1997, IEEE INT C EV COMP, P647
[6]  
Duin RPW, 2000, LECT NOTES COMPUT SC, V1857, P16
[7]   TONAL COARTICULATION IN THAI [J].
GANDOUR, J ;
POTISUK, S ;
DECHONGKIT, S .
JOURNAL OF PHONETICS, 1994, 22 (04) :477-492
[8]  
Gerald C F, 1994, APPL NUMERICAL ANAL
[9]   NEURAL NETWORK ENSEMBLES [J].
HANSEN, LK ;
SALAMON, P .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1990, 12 (10) :993-1001
[10]  
HO TK, 1994, IEEE T PATTERN ANAL, V16, P66, DOI 10.1109/34.273716