Robust feature extraction using subband spectral centroid histograms

被引:0
|
作者
Gajic, B [1 ]
Paliwal, KK [1 ]
机构
[1] Griffith Univ, Sch Microelect Engn, Brisbane, Qld 4111, Australia
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we propose a new framework for utilizing frequency information from the short-term power spectrum of speech. Feature extraction is based on the cepstral coefficients derived from the histograms of subband spectral centroids (SSC). Two new feature extraction algorithms are proposed, one based on frequency information alone, and the other which efficiently combines the frequency and amplitude information from the speech power spectrum. Experimental study on an automatic speech recognition task hag shown that the proposed methods outperform the conventional speech front-ends in presence of additive white noise, while they perform comparably in the noise-free conditions.
引用
收藏
页码:85 / 88
页数:4
相关论文
共 50 条
  • [1] Robust speech recognition in noisy environments based on subband spectral centroid histograms
    Gajic, B
    Paliwal, KK
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (02): : 600 - 608
  • [2] Model compensation for features based on subband spectral centroid histograms
    Pettersen, Svein G.
    Gajic, Bojana
    2006 7TH NORDIC SIGNAL PROCESSING SYMPOSIUM, 2006, : 306 - +
  • [3] Robust speech feature extraction based on dynamic minimum subband spectral subtraction
    Ma, Xin
    Zhou, Weidong
    Ju, Fang
    INTELLIGENT COMPUTING IN SIGNAL PROCESSING AND PATTERN RECOGNITION, 2006, 345 : 1056 - 1061
  • [4] Spectral subband centroid features for speech recognition
    Paliwal, KK
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 617 - 620
  • [5] Feature Extraction Using Spectral Centroid and Mel Frequency Cepstral Coefficient for Quranic Accent Automatic Identification
    Kamarudin, Noraziahtulhidayu
    Al-Haddad, S. A. R.
    Hashim, Shaiful Jahari
    Nematollahi, Mohammad Ali
    Bin Hassan, Abd Rauf
    2014 IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2014,
  • [6] Binaural Bark subband preprocessing of nonstationary signals for noise robust speech feature extraction
    Peters, M
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 281 - 284
  • [7] Binaural bark subband preprocessing of nonstationary signals for noise robust speech feature extraction
    Peters, M
    PROCEEDINGS OF THE IEEE-SP INTERNATIONAL SYMPOSIUM ON TIME-FREQUENCY AND TIME-SCALE ANALYSIS, 1998, : 609 - 612
  • [8] Subband feature extraction using lapped orthogonal transform for speech recognition
    Tufekci, Z
    Gowdy, JN
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 149 - 152
  • [9] Robust spatial and spectral feature extraction for multispectral and hyperspectral imagery
    Pinzon, JE
    Ustin, SL
    Castaneda, CM
    Pierce, JF
    Costick, LA
    ALGORITHMS FOR MULTISPECTRAL AND HYPERSPECTRAL IMAGERY IV, 1998, 3372 : 199 - 210
  • [10] Spectral Deconvolution and Feature Extraction With Robust Adaptive Tikhonov Regularization
    Liu, Hai
    Yan, Luxin
    Chang, Yi
    Fang, Houzhang
    Zhang, Tianxu
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2013, 62 (02) : 315 - 327