An Improved Switch Speech Enhancement Algorithm for Automatic Speech Recognition

被引：0

作者：

Ma, Yongbao ^{[1
]}

Zhou, Yi ^{[1
]}

Liu, Jingang ^{[1
]}

Xia, Jie ^{[1
]}

Liu, Hongqing ^{[1
]}

机构：

[1] Chongqing Univ Posts & Telecommun, Sch Commun & Informat Engn, Chongqing, Peoples R China

来源：

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC) | 2015年

关键词：

ASR; speech enhancement; switch algorithm; OM-LSA; TCS; SPECTRAL AMPLITUDE ESTIMATOR;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In order to improve the performance and save the computational cost of automatic speech recognition (ASR) system in noisy environment, this paper studies a new switch speech enhancement algorithm. Firstly, based on an optimal modified log-spectral amplitude (OM-LSA) estimator and employing an improved a priori signal-to-noise ratio (SNR) estimate based on temporal cepstrum smoothing (TCS), the proposed new approach gains improved noise reduction performance. Next, this new approach and the conventional Wiener algorithm are combined to develop a switch algorithm, whose switching mechanism is designed by the maximum likelihood (ML) estimate of an a priori SNR and voice activity detection (VAD) technology. Both performance improvement and computational cost reduction can be achieved by the proposed switch algorithm. Computer simulations with ASR system verify the effectiveness of the proposed algorithm.

引用

页码：430 / 435

页数：6

共 9 条

[1] Amehraye A., 1988, IEEE INT C AC SPEECH, P2081
[2] A novel a priori SNR estimation approach based on selective cepstro-temporal smoothing
Breithaupt, Colin
Gerkmann, Timo
Martin, Rainer
[J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4897 - 4900
[3] Noise estimation by minima controlled recursive averaging for robust speech enhancement
Cohen, I
Berdugo, B
[J]. IEEE SIGNAL PROCESSING LETTERS, 2002, 9 (01) : 12 - 15
[4] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR LOG-SPECTRAL AMPLITUDE ESTIMATOR
EPHRAIM, Y
MALAH, D
[J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (02): : 443 - 445
[5] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR
EPHRAIM, Y
MALAH, D
[J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06): : 1109 - 1121
[6] MMSE BASED NOISE PSD TRACKING WITH LOW COMPLEXITY
Hendriks, Richard C.
Heusdens, Richard
Jensen, Jesper
[J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4266 - 4269
[7] Loizou P. C., 2013, SPEECH ENHANCEMENT T
[8] ASSESSMENT FOR AUTOMATIC SPEECH RECOGNITION .2. NOISEX-92 - A DATABASE AND AN EXPERIMENT TO STUDY THE EFFECT OF ADDITIVE NOISE ON SPEECH RECOGNITION SYSTEMS
VARGA, A
STEENEKEN, HJM
[J]. SPEECH COMMUNICATION, 1993, 12 (03) : 247 - 251
[9] Virtanen T, 2013, TECHNIQUES NOISE ROB

← 1 →