Urdu spoken digits recognition using classified MFCC and backpropgation neural network

被引:7
作者
Azam, S. M.
Mansoor, Z. A.
Mughal, M. Shahzad
Mohsin, S.
机构
来源
COMPUTER GRAPHICS, IMAGING AND VISUALISATION: NEW ADVANCES | 2007年
关键词
Mel Frequency Cepsptral Coefficients; Urdu spoken digits recognition; Backprapagation;
D O I
10.1109/CGIV.2007.85
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Neural networks have found profound success in the area of pattern recognition. In the recent years there has been use of Neural Network for speech recognition. In this paper Backpropgation Neural Network has been used for isolated spoken Urdu Digits recognition. Mel Frequency Cepsptral Coefficients (MFCC) has been used to represent speech signal. Dimensions of speech features were reduced to a vector of 39 values. Only 39 values from MFCC features speech are fed to the Neural Network having more than one hidden layers with varying number of neurons, for training and recognition An analysis has been made between different number of hidden layers and different number of neurons on hidden layers. It has been found that results for these 39 values are similar to that obtained using complete MFCC features that range from 804 to 67x39. With the use of 39 values on input layer, computational complexity and time for training and recognition of neural network is reduced. In order to evaluate the significance of the proposed method on data other than Urdu digits, 30 English words have been trained and recognized that gave 98% results. All the implementation has been done in MATLAB.
引用
收藏
页码:414 / 418
页数:5
相关论文
共 9 条
[1]  
AHAD A, 2002, ISCON 02 P IEEE, V1, P103
[2]   Spoken arabic digits recognizer using recurrent neural networks [J].
Alotaibi, YA .
PROCEEDINGS OF THE FOURTH IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, 2004, :195-199
[3]  
FAUSETT L, 1995, FUNDAMENTALS NEURAL
[4]  
Haykin S., 1999, Neural networks: a comprehensive foundation, V2nd ed.
[5]  
KIRSCHNING, 1995, THESIS U TOKUSHINIA
[6]   Review of Neural Networks for Speech Recognition [J].
Lippmann, Richard P. .
NEURAL COMPUTATION, 1989, 1 (01) :1-38
[7]   HIGH-PERFORMANCE CONNECTED DIGIT RECOGNITION USING HIDDEN MARKOV-MODELS [J].
RABINER, LR ;
WILPON, JG ;
SOONG, FK .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (08) :1214-1225
[8]  
TCHORZ J, 1999, ASAEAAIDEGA JOINT M
[9]  
TEBELSKIS J, 1995, THESIS CARNEGIE MELL