SVM and HMM Modeling Techniques for Speech Recognition Using LPCC and MFCC Features

被引:11
作者
Ananthi, S. [1 ]
Dhanalakshmi, P. [1 ]
机构
[1] Annamalai Univ, Dept Comp Sci & Engn, Chidambaram, Tamil Nadu, India
来源
PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2014, VOL 1 | 2015年 / 327卷
关键词
Acoustic Feature Extraction; Hidden Markov Model (HMM); Isolated word recognition; LPCC; MFCC; Support Vector Machine (SVM); Voice Activity Detection (VAD);
D O I
10.1007/978-3-319-11933-5_58
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech Recognition approach intends to recognize the text from the speech utterance which can be more helpful to the people with hearing disabled. Support Vector Machine (SVM) and Hidden Markov Model (HMM) are widely used techniques for speech recognition system. Acoustic features namely Linear Predictive Coding (LPC), Linear Prediction Cepstral Coefficient (LPCC) and Mel Frequency Cepstral Coefficients (MFCC) are extracted. Modeling techniques such as SVM and HMM were used to model each individual word thus owing to 620 models which are trained to the system. Each isolated word segment from the test sentence is matched against these models for finding the semantic representation of the test input speech. The performance of the system is evaluated for the words related to computer domain and the system shows an accuracy of 91.46% for SVM 98.92% for HMM. From the exhaustive analysis, it is evident that HMM performs better than other modeling techniques such as SVM.
引用
收藏
页码:519 / 526
页数:8
相关论文
共 12 条
[1]   Front end analysis of speech recognition: a review [J].
Anusuya, M. ;
Katti, S. .
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2011, 14 (02) :99-145
[2]  
Baum L.E., 1972, EQUALITIES, V3, P1
[3]  
Bhattacharjee U., 2013, Int. J. Eng. Res. Technol., V2, P1
[4]  
BOUROBUA H, 2006, INFORMATICA, V30, P373
[5]   Choosing multiple parameters for support vector machines [J].
Chapelle, O ;
Vapnik, V ;
Bousquet, O ;
Mukherjee, S .
MACHINE LEARNING, 2002, 46 (1-3) :131-159
[6]  
Das B.P., 2012, INT J MODERN ENG RES, V2, P854
[7]   Evaluation of simple performance measures for tuning SVM hyperparameters [J].
Duan, K ;
Keerthi, SS ;
Poo, AN .
NEUROCOMPUTING, 2003, 51 :41-59
[8]  
HERSCHER MB, 1972, P C SPEECH COMM P BO, P89
[9]  
Ramirez J. M., 2007, ROBUST SPEECH RECOGN, V6, P1, DOI [10.5772/4740, DOI 10.5772/4740]
[10]   Connectionist Probability Estimators in HMM Speech Recognition [J].
Renals, Steve ;
Morgan, Nelson ;
Bourlard, Herve ;
Cohen, Michael ;
Franco, Horacio .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (01) :161-174