Likelihood-Maximizing-Based Multiband Spectral Subtraction for Robust Speech Recognition

被引:0
|
作者
Bagher BabaAli
Hossein Sameti
Mehran Safayani
机构
[1] Sharif University of Technology,Department of Computer Engineering
关键词
Speech Recognition; Speech Signal; Recognition Accuracy; Automatic Speech Recognition; Speech Quality;
D O I
暂无
中图分类号
学科分类号
摘要
Automatic speech recognition performance degrades significantly when speech is affected by environmental noise. Nowadays, the major challenge is to achieve good robustness in adverse noisy conditions so that automatic speech recognizers can be used in real situations. Spectral subtraction (SS) is a well-known and effective approach; it was originally designed for improving the quality of speech signal judged by human listeners. SS techniques usually improve the quality and intelligibility of speech signal while speech recognition systems need compensation techniques to reduce mismatch between noisy speech features and clean trained acoustic model. Nevertheless, correlation can be expected between speech quality improvement and the increase in recognition accuracy. This paper proposes a novel approach for solving this problem by considering SS and the speech recognizer not as two independent entities cascaded together, but rather as two interconnected components of a single system, sharing the common goal of improved speech recognition accuracy. This will incorporate important information of the statistical models of the recognition engine as a feedback for tuning SS parameters. By using this architecture, we overcome the drawbacks of previously proposed methods and achieve better recognition accuracy. Experimental evaluations show that the proposed method can achieve significant improvement of recognition rates across a wide range of signal to noise ratios.
引用
收藏
相关论文
共 50 条
  • [21] The synergy between bounded-distance HMM and spectral subtraction for robust speech recognition
    Vicente-Pena, Jesus
    Diaz-de-Maria, Fernando
    Kleijn, W. Bastiaan
    SPEECH COMMUNICATION, 2010, 52 (02) : 123 - 133
  • [22] Direct control on modulation spectrum for noise-robust speech recognition and spectral subtraction
    Wada, Naoya
    Hayasaka, Noboru
    Yoshizawa, Shingo
    Miyanaga, Yoshikazu
    2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 2533 - +
  • [23] OPTIMIZING SPECTRAL SUBTRACTION AND WIENER FILTERING FOR ROBUST SPEECH RECOGNITION IN REVERBERANT AND NOISY CONDITIONS
    Gomez, Randy
    Kawahara, Tatsuya
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4566 - 4569
  • [24] Speech recognition in nonstationary noise based on parallel HMMs and spectral subtraction
    Mine, R
    Kobayashi, T
    Shirai, K
    SYSTEMS AND COMPUTERS IN JAPAN, 1996, 27 (14) : 37 - 44
  • [25] Spectral Subtraction Based on Non-extensive Statistics for Speech Recognition
    Pardede, Hilman
    Iwano, Koji
    Shinoda, Koichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2013, E96D (08): : 1774 - 1782
  • [26] Robust speech recognition based on spectral adjusting and warping
    Zhao, R
    Wang, Z
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 553 - 556
  • [27] Multiband, Multisensor Robust Features for Noisy Speech Recognition
    Dimitriadis, Dimitrios
    Maragos, Petros
    Lefkimmiatis, Stamatios
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 889 - 892
  • [28] Robust speech feature extraction based on dynamic minimum subband spectral subtraction
    Ma, Xin
    Zhou, Weidong
    Ju, Fang
    INTELLIGENT COMPUTING IN SIGNAL PROCESSING AND PATTERN RECOGNITION, 2006, 345 : 1056 - 1061
  • [29] CEPSTRAL NOISE SUBTRACTION FOR ROBUST AUTOMATIC SPEECH RECOGNITION
    Rehr, Robert
    Gerkmann, Timo
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 375 - 378
  • [30] FPGA Implementation of Spectral Subtraction for Automotive Speech Recognition
    Whittington, Jim
    Deo, Kapeel
    Kleinschmidt, Tristan
    Mason, Michael
    CIVVS: 2009 IEEE WORKSHOP ON COMPUTATIONAL INTELLIGENCE IN VEHICLES AND VEHICULAR SYSTEMS, 2009, : 72 - +