Kannada Word Recognition System Using HTK

被引:0
作者
Ananthakrishna, T. [1 ]
Maithri, M. [1 ]
Shama, Kumara [1 ]
机构
[1] Manipal Inst Technol, Dept E&C, Manipal, Karnataka, India
来源
2015 ANNUAL IEEE INDIA CONFERENCE (INDICON) | 2015年
关键词
Hidden Markov Tool Kit (HTK); Mel frequency cepstral coefficients (MFCC); Kannada language; Syllable-level and Phone-level modeling;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In the present work, speech recognition system for Kannada language has been implemented using the Hidden Markov Tool Kit (HTK). The system performance is comparatively studied and evaluated for syllable and phone level models. The Kannada word dictionary of size about 110 words is used in the study and Mel frequency cepstral coefficients (MFCC) are computed in acoustic front-end processing. The system is designed to recognize isolated utterances of Kannada words, which are recorded from a Kannada short story. Baum-Welch algorithm is used to train the Hidden Markov Model (HMM) and Viterbi algorithm for decoding process. The objective of this study is to compare the performances of phone-level and syllable-level acoustical models for small to medium sized Kannada language vocabulary. The results are part of the on-going research work on large vocabulary continuous speech recognition system for Kannada language. Average word recognition accuracy of 97.1% for syllable-level modeling and 98.6% for phone-level modeling has been reported. Analysis of system performance also carried out based on the confusion matrices.
引用
收藏
页数:5
相关论文
共 21 条
[1]  
Aggarwal R. K., 2011, INT J SIGNAL PROCESS, V4
[2]  
[Anonymous], 1993, Discrete-Time Processing of Speech Signals
[3]  
[Anonymous], 1 ORDER HIDDEN MARKO
[4]  
[Anonymous], 2011, INT J SIGNAL PROCESS
[5]  
[Anonymous], INT J COMPUTER APPL
[6]  
[Anonymous], 2013, INT J ENG TRENDS TEC
[7]  
Anusuya MA, 2012, IJCA P NAT C ADV EL, P32
[8]  
Davis Steven B., 1980, IEEE T ACOUSTICS SPE, V4
[9]  
Gawali Bharti W., 2011, ACEEE INT J INFORM T, V01
[10]  
Hegde S, 2012, COMM COM INF SC, V292, P262