Combining standard and throat microphones for robust speech recognition

被引:67
作者
Graciarena, M
Franco, H
Sonmez, K
Bratt, H
机构
[1] SRI Int, Speech Technol & Res Lab, Menlo Pk, CA 94025 USA
[2] Univ Buenos Aires, Sch Engn, Inst Biomed Engn, RA-1053 Buenos Aires, DF, Argentina
关键词
noise robustness; probabilistic optimum filtering; speech recognition; throat microphone;
D O I
10.1109/LSP.2003.808549
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a method to combine the standard and throat microphone signals for robust speech recognition in noisy environments. Our approach is to use the. probabilistic optimum filter (POF) mapping algorithm to estimate the standard microphone clean-speech feature vectors, used by standard speech recognizers, from both microphones' noisy-speech feature vectors. A small untranscribed "stereo" database (noisy and clean simultaneous recordings) is required to train the POF mappings. In continuous-speech recognition experiments using SRI International's DECIPHER recognition system, both using artificially added noise and using recorded noisy speech, the combined-microphone approach significantly outperforms the single-microphone approach.
引用
收藏
页码:72 / 74
页数:3
相关论文
共 50 条
  • [21] Multistream Bandpass Modulation Features for Robust Speech Recognition
    Nemala, Sridhar Krishna
    Patil, Kailash
    Elhilali, Mounya
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1284 - 1287
  • [22] Robust speech recognition using probabilistic union models
    Ming, J
    Jancovic, P
    Smith, FJ
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (06): : 403 - 414
  • [23] Normalizing the speech modulation spectrum for robust speech recognition
    Xiao, Xiong
    Chng, Eng Siong
    Li, Haizhou
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1021 - +
  • [24] Robust Speech Recognition using DNN-HMM Acoustic Model Combining Noise-aware training with Spectral Subtraction
    Abe, Akihiro
    Yamamoto, Kazumasa
    Nakagawa, Seiichi
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2849 - 2853
  • [25] Toward Robust Speech Recognition and Understanding
    Sadaoki Furui
    Journal of VLSI signal processing systems for signal, image and video technology, 2005, 41 : 245 - 254
  • [26] Toward robust speech recognition and understanding
    Furui, S
    JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2005, 41 (03): : 245 - 254
  • [27] Issues with uncertainty decoding for noise robust automatic speech recognition
    Liao, H.
    Gales, M. J. F.
    SPEECH COMMUNICATION, 2008, 50 (04) : 265 - 277
  • [28] Stereo-based stochastic mapping for robust speech recognition
    Afify, Mohamed
    Cui, Xiaodong
    Gao, Yuqing
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 377 - +
  • [29] Stereo-Based Stochastic Mapping for Robust Speech Recognition
    Afify, Mohamed
    Cui, Xiaodong
    Gao, Yuqing
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (07): : 1325 - 1334
  • [30] Discriminative classifiers with adaptive kernels for noise robust speech recognition
    Gales, M. J. F.
    Flego, F.
    COMPUTER SPEECH AND LANGUAGE, 2010, 24 (04) : 648 - 662