Performance evaluation of front-end algorithms for robust speech recognition

被引:0
|
作者
Cheng, O [1 ]
Abdulla, W [1 ]
Salcic, Z [1 ]
机构
[1] Univ Auckland, Dept Elect & Comp Engn, Auckland 1, New Zealand
来源
ISSPA 2005: THE 8TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1 AND 2, PROCEEDINGS | 2005年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Conventional speech feature extraction front-end algorithms (LPCC, PLP and MFCC) suffer severe performance degradation in noisy environment, especially when there is a noise level mismatch between the training and testing environments. Two more recently developed algorithms, namely Gammatone Cepstral Coefficients (GTCC) and Zero-crossings with Peak Amplitude (ZCPA), are claimed to have better performance than the conventional algorithms. To verify the claim, HMM-based speaker-independent continuous speech recognition experiments are conducted using TIMIT database. In these experiments, training data is kept in clean condition while various levels of white Gaussian noise are added to the testing data. Results suggest that GTCC outperforms PLP, which is the best amongst the three conventional algorithms.. by 1.6% in 0dB SNR to 4.4% in 20dB SNR. While ZCPA does not perform well in clean conditions, it performs better than PLP by 1. 5% in 20dB SNR to 4. 1 % in 0dB SNR. However, it has much higher computational complexity than all other evaluated algorithms.
引用
收藏
页码:711 / 714
页数:4
相关论文
共 50 条
  • [1] A robust front-end for telephone speech recognition
    Cho, HY
    Chi, SM
    Oh, YH
    PRICAI'98: TOPICS IN ARTIFICIAL INTELLIGENCE, 1998, 1531 : 636 - 644
  • [2] A comparison of front-end configurations for robust speech recognition
    Milner, B
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 797 - 800
  • [3] A Front-End Speech Enhancement System for Robust Automotive Speech Recognition
    Wang, Haikun
    Ye, Zhongfu
    Chen, Jingdong
    2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 1 - 5
  • [4] Investigation of Speech Separation as a Front-End for Noise Robust Speech Recognition
    Narayanan, Arun
    Wang, DeLiang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (04) : 826 - 835
  • [5] Robust Front-End Processing For Emotion Recognition In Noisy Speech
    Pandharipande, Meghna
    Chakraborty, Rupayan
    Panda, Ashish
    Kopparapu, Sunil Kumar
    2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 324 - 328
  • [6] ROBUST FRONT-END PROCESSING FOR SPEECH RECOGNITION IN NOISY CONDITIONS
    Das, Biswajit
    Panda, Ashish
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5235 - 5239
  • [7] A Reassigned Front-End for Speech Recognition
    Tryfou, Georgina
    Omologo, Maurizio
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 553 - 557
  • [8] Enhanced Sparse Imputation Techniques for a Robust Speech Recognition Front-End
    Tan, Qun Feng
    Georgiou, Panayiotis G.
    Narayanan, Shrikanth
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (08): : 2418 - 2429
  • [9] Advanced Front-end for Robust Speech Recognition in Extremely Adverse Environments
    Dimitriadis, Dimitrios
    Segura, Jose C.
    Garcia, Luz
    Potamianos, Alexandros
    Maragos, Petros
    Pitsikalis, Vassilis
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2221 - +
  • [10] Auditory masking based acoustic front-end for robust speech recognition
    Paliwal, KK
    Lilly, BT
    IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 165 - 168