Sub-band Feature Statistics Compensation Techniques Based on Discrete Wavelet Transform for Robust Speech Recognition

被引:0
|
作者
Fan, Hao-Teng [1 ]
Hung, Jeih-weih [1 ]
机构
[1] Natl Chi Nan Univ, Dept Elect Engn, Puli, Taiwan
关键词
discrete wavelet transform; speech recognition; robust speech feature;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper proposes a novel scheme in performing feature statistics normalization techniques for robust speech recognition. In the proposed approach, the processed temporal-domain feature sequence is first decomposed into non-uniform sub-bands using discrete wavelet transform (DWT), and then each sub-band stream is individually processed by the well-known normalization methods, like mean and variance normalization (MVN) and histogram equalization (HEQ). Finally, we reconstruct the feature stream with all the modified sub-band streams using inverse DWT. With this process, the components that correspond to more important modulation spectral bands in the feature sequence can be processed separately. For the Aurora-2 clean-condition training task, the new proposed sub-band MVN and HEQ provide relative error rate reductions of 20.18% and 19.65% over the conventional MVN and HEQ.
引用
收藏
页码:586 / 589
页数:4
相关论文
共 50 条
  • [1] Subband Feature Statistics Normalization Techniques Based on a Discrete Wavelet Transform for Robust Speech Recognition
    Hung, Jeih-weih
    Fan, Hao-Teng
    IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (09) : 806 - 809
  • [2] Sub-band Modulation Spectrum Compensation for Robust Speech Recognition
    Tu, Wen-hsiang
    Huang, Sheng-Yuan
    Hung, Jeih-weih
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 261 - 265
  • [3] Wavelet based robust sub-band features for phoneme recognition
    Farooq, O
    Datta, S
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2004, 151 (03): : 187 - 193
  • [4] Overlapped sub-band modulation spectrum normalization techniques for robust speech recognition
    Fan, Hao-teng
    Yeh, Wei-jeih
    Hung, Jeih-weih
    2013 10TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2013, : 1035 - 1039
  • [5] A probabilistic union model for sub-band based robust speech recognition
    Ming, J
    Smith, FJ
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1787 - 1790
  • [6] Sub-band speech recognition
    Primor, D
    Furst-Yust, M
    22ND CONVENTION OF ELECTRICAL AND ELECTRONICS ENGINEERS IN ISRAEL, PROCEEDINGS, 2002, : 10 - 12
  • [7] Sub-band based recognition of noisy speech
    Tibrewala, S
    Hermansky, H
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1255 - 1258
  • [8] WAVELET SUB-BAND BASED TEMPORAL FEATURES FOR ROBUST HINDI PHONEME RECOGNITION
    Farooq, O.
    Datta, S.
    Shrotriya, M. C.
    INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2010, 8 (06) : 847 - 859
  • [9] Sub-band level Histogram Equalization for Robust Speech Recognition
    Joshi, Vikas
    Bilgi, Raghavendra
    Umesh, S.
    Garcia, L.
    Benitez, C.
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1672 - +
  • [10] Modeling Sub-Band Information Through Discrete Wavelet Transform to Improve Intelligibility Assessment of Dysarthric Speech
    Sahu, Laxmi Priya
    Pradhan, Gayadhar
    Singh, Jyoti Prakash
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2022, 7 (07): : 56 - 64