An Improved Switch Speech Enhancement Algorithm for Automatic Speech Recognition

被引:0
作者
Ma, Yongbao [1 ]
Zhou, Yi [1 ]
Liu, Jingang [1 ]
Xia, Jie [1 ]
Liu, Hongqing [1 ]
机构
[1] Chongqing Univ Posts & Telecommun, Sch Commun & Informat Engn, Chongqing, Peoples R China
来源
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC) | 2015年
关键词
ASR; speech enhancement; switch algorithm; OM-LSA; TCS; SPECTRAL AMPLITUDE ESTIMATOR;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In order to improve the performance and save the computational cost of automatic speech recognition (ASR) system in noisy environment, this paper studies a new switch speech enhancement algorithm. Firstly, based on an optimal modified log-spectral amplitude (OM-LSA) estimator and employing an improved a priori signal-to-noise ratio (SNR) estimate based on temporal cepstrum smoothing (TCS), the proposed new approach gains improved noise reduction performance. Next, this new approach and the conventional Wiener algorithm are combined to develop a switch algorithm, whose switching mechanism is designed by the maximum likelihood (ML) estimate of an a priori SNR and voice activity detection (VAD) technology. Both performance improvement and computational cost reduction can be achieved by the proposed switch algorithm. Computer simulations with ASR system verify the effectiveness of the proposed algorithm.
引用
收藏
页码:430 / 435
页数:6
相关论文
共 9 条
  • [1] Amehraye A., 1988, IEEE INT C AC SPEECH, P2081
  • [2] A novel a priori SNR estimation approach based on selective cepstro-temporal smoothing
    Breithaupt, Colin
    Gerkmann, Timo
    Martin, Rainer
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4897 - 4900
  • [3] Noise estimation by minima controlled recursive averaging for robust speech enhancement
    Cohen, I
    Berdugo, B
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2002, 9 (01) : 12 - 15
  • [4] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR LOG-SPECTRAL AMPLITUDE ESTIMATOR
    EPHRAIM, Y
    MALAH, D
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (02): : 443 - 445
  • [5] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR
    EPHRAIM, Y
    MALAH, D
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06): : 1109 - 1121
  • [6] MMSE BASED NOISE PSD TRACKING WITH LOW COMPLEXITY
    Hendriks, Richard C.
    Heusdens, Richard
    Jensen, Jesper
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4266 - 4269
  • [7] Loizou P. C., 2013, SPEECH ENHANCEMENT T
  • [8] ASSESSMENT FOR AUTOMATIC SPEECH RECOGNITION .2. NOISEX-92 - A DATABASE AND AN EXPERIMENT TO STUDY THE EFFECT OF ADDITIVE NOISE ON SPEECH RECOGNITION SYSTEMS
    VARGA, A
    STEENEKEN, HJM
    [J]. SPEECH COMMUNICATION, 1993, 12 (03) : 247 - 251
  • [9] Virtanen T, 2013, TECHNIQUES NOISE ROB