DWT features performance analysis for automatic speech recognition of Urdu

被引:15
作者
Ali, Hazrat [1 ,2 ]
Ahmad, Nasir [3 ]
Zhou, Xianwei [2 ]
Iqbal, Khalid [2 ]
Ali, Sahibzada Muhammad [4 ]
机构
[1] City Univ London, Dept Comp, Machine Learning Grp, London EC1V 0HB, England
[2] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing 100083, Peoples R China
[3] Univ Engn & Technol Peshawar, Dept Comp Syst Engn, Peshawar 25120, Pakistan
[4] N Dakota State Univ, Dept Elect & Comp Engn, Fargo, ND 58108 USA
来源
SPRINGERPLUS | 2014年 / 3卷
关键词
Automatic speech recognition; Discrete wavelet transforms; Linear discriminant analysis; Mel-frequency cepstral coefficients; Urdu isolated words recognition; WAVELET;
D O I
10.1186/2193-1801-3-204
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper presents the work on Automatic Speech Recognition of Urdu language, using a comparative analysis for Discrete Wavelets Transform (DWT) based features and Mel Frequency Cepstral Coefficients (MFCC). These features have been extracted for one hundred isolated words of Urdu, each word uttered by ten different speakers. The words have been selected from the most frequently used words of Urdu. A variety of age and dialect has been covered by using a balanced corpus approach. After extraction of features, the classification has been achieved by using Linear Discriminant Analysis. After the classification task, the confusion matrix obtained for the DWT features has been compared with the one obtained for Mel-Frequency Cepstral Coefficients based speech recognition. The framework has been trained and tested for speech data recorded under controlled environments. The experimental results are useful in determination of the optimum features for speech recognition task.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 31 条
  • [1] Akram MU, 2004, INMIC 2004: 8TH INTERNATIONAL MULTITOPIC CONFERENCE, PROCEEDINGS, P91
  • [2] Ali H, 2013, INT MULT C IMTIC 13
  • [3] Ali H., 2012, 2012 INT C EL COMP T, P473
  • [4] [Anonymous], 2010, P 8 INT C FRONTIERS
  • [5] [Anonymous], 2010, 2010 7 INT C INF SYS, DOI DOI 10.1007/978-3-642-13881-2_14
  • [6] Linear Discriminant Analysis for signal processing problems
    Balakrishnama, S
    Ganapathiraju, A
    Picone, J
    [J]. IEEE SOUTHEASTCON '99, PROCEEDINGS, 1999, : 78 - 81
  • [7] Balakrishnama S., 1998, LINEAR DISCRIMINANT
  • [8] Speech feature extracted from adaptive wavelet for speech recognition
    Chang, SW
    Kwon, Y
    Yang, SI
    [J]. ELECTRONICS LETTERS, 1998, 34 (23) : 2211 - 2213
  • [9] Criado C., 2011, 2011 Eighth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2011), P1012, DOI 10.1109/FSKD.2011.6019637
  • [10] Phoneme recognition using wavelet based features
    Farooq, O
    Datta, S
    [J]. INFORMATION SCIENCES, 2003, 150 (1-2) : 5 - 15