Performance evaluation of front-end algorithms for robust speech recognition

被引：0

作者：

Cheng, O ^{[1
]}

Abdulla, W ^{[1
]}

Salcic, Z ^{[1
]}

机构：

[1] Univ Auckland, Dept Elect & Comp Engn, Auckland 1, New Zealand

来源：

ISSPA 2005: THE 8TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1 AND 2, PROCEEDINGS | 2005年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Conventional speech feature extraction front-end algorithms (LPCC, PLP and MFCC) suffer severe performance degradation in noisy environment, especially when there is a noise level mismatch between the training and testing environments. Two more recently developed algorithms, namely Gammatone Cepstral Coefficients (GTCC) and Zero-crossings with Peak Amplitude (ZCPA), are claimed to have better performance than the conventional algorithms. To verify the claim, HMM-based speaker-independent continuous speech recognition experiments are conducted using TIMIT database. In these experiments, training data is kept in clean condition while various levels of white Gaussian noise are added to the testing data. Results suggest that GTCC outperforms PLP, which is the best amongst the three conventional algorithms.. by 1.6% in 0dB SNR to 4.4% in 20dB SNR. While ZCPA does not perform well in clean conditions, it performs better than PLP by 1. 5% in 20dB SNR to 4. 1 % in 0dB SNR. However, it has much higher computational complexity than all other evaluated algorithms.

引用

页码：711 / 714

页数：4

共 50 条

[1] A robust front-end for telephone speech recognition
Cho, HY
Chi, SM
Oh, YH
PRICAI'98: TOPICS IN ARTIFICIAL INTELLIGENCE, 1998, 1531 : 636 - 644
[2] A comparison of front-end configurations for robust speech recognition
Milner, B
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 797 - 800
[3] A Front-End Speech Enhancement System for Robust Automotive Speech Recognition
Wang, Haikun
Ye, Zhongfu
Chen, Jingdong
2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 1 - 5
[4] Investigation of Speech Separation as a Front-End for Noise Robust Speech Recognition
Narayanan, Arun
Wang, DeLiang
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (04) : 826 - 835
[5] Robust Front-End Processing For Emotion Recognition In Noisy Speech
Pandharipande, Meghna
Chakraborty, Rupayan
Panda, Ashish
Kopparapu, Sunil Kumar
2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 324 - 328
[6] ROBUST FRONT-END PROCESSING FOR SPEECH RECOGNITION IN NOISY CONDITIONS
Das, Biswajit
Panda, Ashish
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5235 - 5239
[7] A Reassigned Front-End for Speech Recognition
Tryfou, Georgina
Omologo, Maurizio
2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 553 - 557
[8] Enhanced Sparse Imputation Techniques for a Robust Speech Recognition Front-End
Tan, Qun Feng
Georgiou, Panayiotis G.
Narayanan, Shrikanth
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (08): : 2418 - 2429
[9] Advanced Front-end for Robust Speech Recognition in Extremely Adverse Environments
Dimitriadis, Dimitrios
Segura, Jose C.
Garcia, Luz
Potamianos, Alexandros
Maragos, Petros
Pitsikalis, Vassilis
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2221 - +
[10] Auditory masking based acoustic front-end for robust speech recognition
Paliwal, KK
Lilly, BT
IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 165 - 168

← 1 2 3 4 5 →