WAVELET SUB-BAND BASED TEMPORAL FEATURES FOR ROBUST HINDI PHONEME RECOGNITION

被引：21

作者：

Farooq, O. ^{[1
]}

Datta, S. ^{[2
]}

Shrotriya, M. C. ^{[1
]}

机构：

[1] Aligarh Muslim Univ, Dept Elect Engn, Aligarh 202002, Uttar Pradesh, India

[2] Loughborough Univ Technol, Dept Elect Engn, Loughborough LE11 3TU, Leics, England

来源：

INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING | 2010年 / 8卷 / 06期

关键词：

Feature extraction; Hindi speech; phoneme recognition; wavelet transform; SPEECH; SYSTEM;

D O I：

10.1142/S0219691310003845

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

This paper proposes the use of wavelet transform-based feature extraction technique for Hindi speech recognition application. The new proposed features take into account temporal as well as frequency band energy variations for the task of Hindi phoneme recognition. The recognition performance achieved by the proposed features is compared with the standard MFCC and 24-band admissible wavelet packet-based features using a linear discriminant function based classifier. To evaluate robustness of these features, the NOISEX database is used to add different types of noise into phonemes to achieve signal-to-noise ratios in the range of 20 dB to -5 dB. The recognition results show that under noisy background the proposed technique always achieves a better performance over MFCC-based features.

引用

页码：847 / 859

页数：13

共 28 条

[1]

[Anonymous], 2000, INTERSPEECH

[2] Speech feature extracted from adaptive wavelet for speech recognition [J].

Chang, SW ;

Kwon, Y ;

Yang, SI .

ELECTRONICS LETTERS, 1998, 34 (23) :2211-2213

[3] DISCRETE WAVELET TRANSFORM APPLIED ON PERSONAL IDENTITY VERIFICATION WITH ECG SIGNAL [J].

Chiu, Chuang-Chien ;

Chuang, Chou-Min ;

Hsu, Chih-Yu .

INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2009, 7 (03) :341-355

[4] STOP VOICING IN HINDI [J].

DAVIS, K .

JOURNAL OF PHONETICS, 1994, 22 (02) :177-193

[5] COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES [J].

DAVIS, SB ;

MERMELSTEIN, P .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04) :357-366

[6] Wavelet based robust sub-band features for phoneme recognition [J].

Farooq, O ;

Datta, S .

IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2004, 151 (03) :187-193

[7] Mel filter-like admissible wavelet packet structure for speech recognition [J].

Farooq, O ;

Datta, S .

IEEE SIGNAL PROCESSING LETTERS, 2001, 8 (07) :196-198

[8] Wavelet time-frequency analysis and least squares support vector machines for the identification of voice disorders [J].

Fonseca, Everthon Silva ;

Guido, Rodrigo Capobianco ;

Scalassara, Paulo Rogerio ;

Maciel, Carlos Dias ;

Pereira, Jose Carlos .

COMPUTERS IN BIOLOGY AND MEDICINE, 2007, 37 (04) :571-578

[9] A neural-wavelet architecture for voice conversion [J].

Guido, Rodrigo Capobianco ;

Vieira, Lucimar Sasso ;

Barbon Junior, Sylvio ;

Sanchez, Fabricio Lopes ;

Maciel, Carlos Dias ;

Fonseca, Everthon Silva ;

Pereira, Jose Carlos .

NEUROCOMPUTING, 2007, 71 (1-3) :174-180

[10] PERCEPTUAL LINEAR PREDICTIVE (PLP) ANALYSIS OF SPEECH [J].

HERMANSKY, H .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 87 (04) :1738-1752

← 1 2 3 →