Admissible wavelet packet features based on human inner ear frequency response for Hindi consonant recognition

被引:24
作者
Biswas, Astik [1 ]
Sahu, P. K. [1 ]
Chandra, Mahesh [2 ]
机构
[1] Natl Inst Technol, Dept Elect Engn, Rourkela, India
[2] Birla Inst Technol, Dept ECE, Ranchi, Bihar, India
关键词
DESIGN;
D O I
10.1016/j.compeleceng.2014.01.008
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
It was observed that for non-stationary and quasi-stationary signals, wavelet transform has been found to be an effective tool for the time-frequency analysis. In the recent years wavelet transform being used for feature extraction in speech recognition applications. Here a new filter structure using admissible wavelet packet analysis is proposed for Hindi phoneme recognition. These filters have the benefit of having frequency bands spacing similar to the auditory Equivalent Rectangular Bandwidth (ERB) scale whose central frequencies are equally distributed along the frequency response of human cochlea. The phoneme recognition performance of proposed feature is compared with the standard baseline features and 24-band admissible wavelet packet-based features using a Hidden Markov Model (HMM) based classifier. Proposed feature shows better performance compared to conventional features for Hindi consonant recognition. To evaluate the robustness of proposed feature in the noisy environment NOISEX-92 database has been used. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1111 / 1122
页数:12
相关论文
共 25 条
[1]  
[Anonymous], 2009, WAVELET TOUR SIGNAL
[2]   An implementation of rational wavelets and filter design for phonetic classification [J].
Choueiter, Ghinwa F. ;
Glass, James R. .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (03) :939-948
[3]  
Coifman R.R., 1992, Wavelet analysis and signal processing
[4]   COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES [J].
DAVIS, SB ;
MERMELSTEIN, P .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04) :357-366
[5]   Separability-based multiscale basis selection and feature extraction for signal and image classification [J].
Etemad, K ;
Chellappa, R .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1998, 7 (10) :1453-1465
[6]   WAVELET SUB-BAND BASED TEMPORAL FEATURES FOR ROBUST HINDI PHONEME RECOGNITION [J].
Farooq, O. ;
Datta, S. ;
Shrotriya, M. C. .
INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2010, 8 (06) :847-859
[7]   Mel filter-like admissible wavelet packet structure for speech recognition [J].
Farooq, O ;
Datta, S .
IEEE SIGNAL PROCESSING LETTERS, 2001, 8 (07) :196-198
[8]   Auditory-based wavelet packet filterbank for speech recognition using neural network [J].
Gandhiraj, R. ;
Sathidevi, P. S. .
ADCOM 2007: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATIONS, 2007, :666-+
[9]   PERCEPTUAL LINEAR PREDICTIVE (PLP) ANALYSIS OF SPEECH [J].
HERMANSKY, H .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 87 (04) :1738-1752
[10]   SPEAKER-INDEPENDENT PHONE RECOGNITION USING HIDDEN MARKOV-MODELS [J].
LEE, KF ;
HON, HW .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (11) :1641-1648