WAVELET SUB-BAND BASED TEMPORAL FEATURES FOR ROBUST HINDI PHONEME RECOGNITION

被引:21
作者
Farooq, O. [1 ]
Datta, S. [2 ]
Shrotriya, M. C. [1 ]
机构
[1] Aligarh Muslim Univ, Dept Elect Engn, Aligarh 202002, Uttar Pradesh, India
[2] Loughborough Univ Technol, Dept Elect Engn, Loughborough LE11 3TU, Leics, England
关键词
Feature extraction; Hindi speech; phoneme recognition; wavelet transform; SPEECH; SYSTEM;
D O I
10.1142/S0219691310003845
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper proposes the use of wavelet transform-based feature extraction technique for Hindi speech recognition application. The new proposed features take into account temporal as well as frequency band energy variations for the task of Hindi phoneme recognition. The recognition performance achieved by the proposed features is compared with the standard MFCC and 24-band admissible wavelet packet-based features using a linear discriminant function based classifier. To evaluate robustness of these features, the NOISEX database is used to add different types of noise into phonemes to achieve signal-to-noise ratios in the range of 20 dB to -5 dB. The recognition results show that under noisy background the proposed technique always achieves a better performance over MFCC-based features.
引用
收藏
页码:847 / 859
页数:13
相关论文
共 28 条
[1]  
[Anonymous], 2000, INTERSPEECH
[2]   Speech feature extracted from adaptive wavelet for speech recognition [J].
Chang, SW ;
Kwon, Y ;
Yang, SI .
ELECTRONICS LETTERS, 1998, 34 (23) :2211-2213
[3]   DISCRETE WAVELET TRANSFORM APPLIED ON PERSONAL IDENTITY VERIFICATION WITH ECG SIGNAL [J].
Chiu, Chuang-Chien ;
Chuang, Chou-Min ;
Hsu, Chih-Yu .
INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2009, 7 (03) :341-355
[4]   STOP VOICING IN HINDI [J].
DAVIS, K .
JOURNAL OF PHONETICS, 1994, 22 (02) :177-193
[5]   COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES [J].
DAVIS, SB ;
MERMELSTEIN, P .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04) :357-366
[6]   Wavelet based robust sub-band features for phoneme recognition [J].
Farooq, O ;
Datta, S .
IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2004, 151 (03) :187-193
[7]   Mel filter-like admissible wavelet packet structure for speech recognition [J].
Farooq, O ;
Datta, S .
IEEE SIGNAL PROCESSING LETTERS, 2001, 8 (07) :196-198
[8]   Wavelet time-frequency analysis and least squares support vector machines for the identification of voice disorders [J].
Fonseca, Everthon Silva ;
Guido, Rodrigo Capobianco ;
Scalassara, Paulo Rogerio ;
Maciel, Carlos Dias ;
Pereira, Jose Carlos .
COMPUTERS IN BIOLOGY AND MEDICINE, 2007, 37 (04) :571-578
[9]   A neural-wavelet architecture for voice conversion [J].
Guido, Rodrigo Capobianco ;
Vieira, Lucimar Sasso ;
Barbon Junior, Sylvio ;
Sanchez, Fabricio Lopes ;
Maciel, Carlos Dias ;
Fonseca, Everthon Silva ;
Pereira, Jose Carlos .
NEUROCOMPUTING, 2007, 71 (1-3) :174-180
[10]   PERCEPTUAL LINEAR PREDICTIVE (PLP) ANALYSIS OF SPEECH [J].
HERMANSKY, H .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 87 (04) :1738-1752