Combining standard and throat microphones for robust speech recognition

被引:67
作者
Graciarena, M
Franco, H
Sonmez, K
Bratt, H
机构
[1] SRI Int, Speech Technol & Res Lab, Menlo Pk, CA 94025 USA
[2] Univ Buenos Aires, Sch Engn, Inst Biomed Engn, RA-1053 Buenos Aires, DF, Argentina
关键词
noise robustness; probabilistic optimum filtering; speech recognition; throat microphone;
D O I
10.1109/LSP.2003.808549
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a method to combine the standard and throat microphone signals for robust speech recognition in noisy environments. Our approach is to use the. probabilistic optimum filter (POF) mapping algorithm to estimate the standard microphone clean-speech feature vectors, used by standard speech recognizers, from both microphones' noisy-speech feature vectors. A small untranscribed "stereo" database (noisy and clean simultaneous recordings) is required to train the POF mappings. In continuous-speech recognition experiments using SRI International's DECIPHER recognition system, both using artificially added noise and using recorded noisy speech, the combined-microphone approach significantly outperforms the single-microphone approach.
引用
收藏
页码:72 / 74
页数:3
相关论文
共 50 条
  • [41] Robust speech recognition for car environment noise
    Kokubo, H
    Amano, A
    Hataoka, N
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2002, 85 (11): : 65 - 73
  • [42] ROBUST FEATURE EXTRACTORS FOR CONTINUOUS SPEECH RECOGNITION
    Alam, M. J.
    Kenny, P.
    Dumouchel, P.
    O'Shaughnessy, D.
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 944 - 948
  • [43] Noise Suppression based on nonnegative matrix factorization for robust speech recognition
    Fan, Hao-teng
    Lin, Pao-han
    Hung, Jeih-weih
    2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, ELECTRONICS AND ELECTRICAL ENGINEERING (ISEEE), VOLS 1-3, 2014, : 1731 - +
  • [44] Magnitude Spectrum Enhancement for Robust Speech Recognition
    Tu, Wen-hsiang
    Hung, Jeih-weih
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4586 - 4589
  • [45] Feature Adaptation for Robust Mobile Speech Recognition
    Lee, Hyeopwoo
    Yook, Dongsuk
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (04) : 1393 - 1398
  • [46] Multi-candidate missing data imputation for robust speech recognition
    Wang, Yujun
    Van Hamme, Hugo
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2012,
  • [47] Multi-candidate missing data imputation for robust speech recognition
    Yujun Wang
    Hugo Van hamme
    EURASIP Journal on Audio, Speech, and Music Processing, 2012
  • [48] Modulation Spectrum Augmentation for Robust Speech Recognition
    Yan, Bi-Cheng
    Liu, Shih-Hung
    Chen, Berlin
    PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION SCIENCE AND SYSTEM, AISS 2019, 2019,
  • [49] Semantic Enhancement Framework for Robust Speech Recognition
    Yang, Baochen
    Yu, Kai
    MAN-MACHINE SPEECH COMMUNICATION, NCMMSC 2022, 2023, 1765 : 81 - 88
  • [50] A noise-robust speech recognition approach incorporating normalized speech/non-speech likelihood into hypothesis scores
    Oonishi, Tasuku
    Iwano, Koji
    Furui, Sadaoki
    SPEECH COMMUNICATION, 2013, 55 (02) : 377 - 386