Robust Feature Extraction for Speech Recognition Based on Perceptually Motivated MUSIC and CCBC

被引：0

作者：

Han Zhiyan ^{[1
]}

Wang Jian ^{[1
]}

Wang Xu ^{[2
]}

Lun Shuxian ^{[1
]}

机构：

[1] Bohai Univ, Coll Informat Sci & Engn, Jinzhou 121000, Peoples R China

[2] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110004, Peoples R China

来源：

CHINESE JOURNAL OF ELECTRONICS | 2011年 / 20卷 / 01期

关键词：

Speech recognition; Multiple signal classification (MUSIC); Canonical correlation based on compensation (CCBC); Feature extraction; SPECTRUM ESTIMATION; MVDR; ALGORITHM;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

A novel feature extraction algorithm was proposed to improve the robustness of speech recognition. Core technology was incorporating perceptual information into the Multiple signal classification (MUSIC) spectrum, it provided improved robustness and computational efficiency comparing with the Mel frequency cepstral coefficient (MFCC) technique, then the cepstrum coefficients were extracted as the feature parameter. The effectiveness of the parameter was discussed in view of the class separability and speaker variability properties. To improve the robustness, we considered incorporating Canonical correlation based compensation (CCBC) to cope with the mismatch between training and test set. We evaluated the technique using improved Back-propagation neural networks (BPNN) in three different tasks: in different speakers, different recording channels and different noisy environments. The experimental results show that the novel feature has well robustness and effectiveness relative to MFCC and the CCBC algorithm can make speech recognition system robust in all three kinds of mismatch.

引用

页码：105 / 110

页数：6

共 11 条

[1] MUSIC algorithm for two-dimensional inverse problems with special characteristics of cylinders
Chen, Xudong
Agarwal, Krishna
[J]. IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 2008, 56 (06) : 1808 - 1812
[2] Robust feature extraction for continuous speech recognition using the MVDR spectrum estimation method
Dharanipragada, Satya
Yapanel, Umit H.
Rao, Bhaskar D.
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (01): : 224 - 234
[3] FSF MUSIC for joint DOA and frequency estimation and its performance analysis
Lin, Jen-Der
Fang, Wen-Hsien
Wang, Yung-Yi
Chen, Jiunn-Tsair
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (12) : 4529 - 4542
[4] Long Qian, 2006, Journal of Data Acquisition & Processing, V21, P297
[5] Spectrum estimation, notch filters, and MUSIC
Mahata, K
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2005, 53 (10) : 3727 - 3737
[6] Combining evidence from residual phase and MFCC features for speaker recognition
Murty, KR
Yegnanarayana, B
[J]. IEEE SIGNAL PROCESSING LETTERS, 2006, 13 (01) : 52 - 55
[7] Generalizing MUSIC and MVDR for multiple noncoherent arrays
Rieken, DW
Fuhrmann, DR
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (09) : 2396 - 2406
[8] SUN CB, 2005, THESIS JILIN U CHINA
[9] Wang Yue, 2010, Acta Electronica Sinica, V38, P525
[10] HBP: Improvement in BP algorithm for an adaptive MLP decision feedback equalizer
Yang, SS
Ho, CL
Lee, CM
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2006, 53 (03): : 240 - 244

← 1 2 →