Robust Feature Extraction for Speech Recognition Based on Perceptually Motivated MUSIC and CCBC

被引:0
作者
Han Zhiyan [1 ]
Wang Jian [1 ]
Wang Xu [2 ]
Lun Shuxian [1 ]
机构
[1] Bohai Univ, Coll Informat Sci & Engn, Jinzhou 121000, Peoples R China
[2] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110004, Peoples R China
来源
CHINESE JOURNAL OF ELECTRONICS | 2011年 / 20卷 / 01期
关键词
Speech recognition; Multiple signal classification (MUSIC); Canonical correlation based on compensation (CCBC); Feature extraction; SPECTRUM ESTIMATION; MVDR; ALGORITHM;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A novel feature extraction algorithm was proposed to improve the robustness of speech recognition. Core technology was incorporating perceptual information into the Multiple signal classification (MUSIC) spectrum, it provided improved robustness and computational efficiency comparing with the Mel frequency cepstral coefficient (MFCC) technique, then the cepstrum coefficients were extracted as the feature parameter. The effectiveness of the parameter was discussed in view of the class separability and speaker variability properties. To improve the robustness, we considered incorporating Canonical correlation based compensation (CCBC) to cope with the mismatch between training and test set. We evaluated the technique using improved Back-propagation neural networks (BPNN) in three different tasks: in different speakers, different recording channels and different noisy environments. The experimental results show that the novel feature has well robustness and effectiveness relative to MFCC and the CCBC algorithm can make speech recognition system robust in all three kinds of mismatch.
引用
收藏
页码:105 / 110
页数:6
相关论文
共 11 条
  • [1] MUSIC algorithm for two-dimensional inverse problems with special characteristics of cylinders
    Chen, Xudong
    Agarwal, Krishna
    [J]. IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 2008, 56 (06) : 1808 - 1812
  • [2] Robust feature extraction for continuous speech recognition using the MVDR spectrum estimation method
    Dharanipragada, Satya
    Yapanel, Umit H.
    Rao, Bhaskar D.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 224 - 234
  • [3] FSF MUSIC for joint DOA and frequency estimation and its performance analysis
    Lin, Jen-Der
    Fang, Wen-Hsien
    Wang, Yung-Yi
    Chen, Jiunn-Tsair
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (12) : 4529 - 4542
  • [4] Long Qian, 2006, Journal of Data Acquisition & Processing, V21, P297
  • [5] Spectrum estimation, notch filters, and MUSIC
    Mahata, K
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2005, 53 (10) : 3727 - 3737
  • [6] Combining evidence from residual phase and MFCC features for speaker recognition
    Murty, KR
    Yegnanarayana, B
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2006, 13 (01) : 52 - 55
  • [7] Generalizing MUSIC and MVDR for multiple noncoherent arrays
    Rieken, DW
    Fuhrmann, DR
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (09) : 2396 - 2406
  • [8] SUN CB, 2005, THESIS JILIN U CHINA
  • [9] Wang Yue, 2010, Acta Electronica Sinica, V38, P525
  • [10] HBP: Improvement in BP algorithm for an adaptive MLP decision feedback equalizer
    Yang, SS
    Ho, CL
    Lee, CM
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2006, 53 (03): : 240 - 244