Phase Autocorrelation Bark Wavelet Transform (PACWT) Features for Robust Speech Recognition

被引:4
作者
Majeed, Sayf A. [1 ]
Husain, Hafizah [1 ]
Samad, Salina A. [1 ]
机构
[1] UKM, Natl Univ Malaysia, Fac Engn & Built Environm, Dept Elect Elect & Syst Engn, Bangi, Selangor, Malaysia
关键词
speech recognition; feature extraction; phase autocorrelation; wavelet transform; NOISE;
D O I
10.1515/aoa-2015-0004
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, a new feature-extraction method is proposed to achieve robustness of speech recognition systems. This method combines the benefits of phase autocorrelation (PAC) with bark wavelet transform. PAC uses the angle to measure correlation instead of the traditional autocorrelation measure, whereas the bark wavelet transform is a special type of wavelet transform that is particularly designed for speech signals. The extracted features from this combined method are called phase autocorrelation bark wavelet transform (PACWT) features. The speech recognition performance of the PACWT features is evaluated and compared to the conventional feature extraction method mel frequency cepstrum coefficients (MFCC) using TI-Digits database under different types of noise and noise levels. This database has been divided into male and female data. The result shows that the word recognition rate using the PACWT features for noisy male data (white noise at 0 dB SNR) is 60%, whereas it is 41.35% for the MFCC features under identical conditions.
引用
收藏
页码:25 / 31
页数:7
相关论文
共 27 条
  • [1] Addison P., 2010, ILLUSTRATED WAVELET
  • [2] [Anonymous], 2002, Series: Springer Series in Statistics
  • [3] [Anonymous], IEEE INT C AC SPEECH
  • [4] SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION
    BOLL, SF
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02): : 113 - 120
  • [5] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
  • [6] COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES
    DAVIS, SB
    MERMELSTEIN, P
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04): : 357 - 366
  • [7] ACOUSTICAL QUANTA AND THE THEORY OF HEARING
    GABOR, D
    [J]. NATURE, 1947, 159 (4044) : 591 - 594
  • [8] Phase Auto Correlation (PAC) features for noise robust speech recognition
    Ikbal, Shajith
    Misra, Hemant
    Hermansky, Hynek
    Magimai-Doss, Mathew
    [J]. SPEECH COMMUNICATION, 2012, 54 (07) : 867 - 880
  • [9] JIE Y., 2009, COMP COMM CONTR MAN
  • [10] LEONARD R, 1984, IEEE INT C AC SPEECH