Combining standard and throat microphones for robust speech recognition

被引:67
作者
Graciarena, M
Franco, H
Sonmez, K
Bratt, H
机构
[1] SRI Int, Speech Technol & Res Lab, Menlo Pk, CA 94025 USA
[2] Univ Buenos Aires, Sch Engn, Inst Biomed Engn, RA-1053 Buenos Aires, DF, Argentina
关键词
noise robustness; probabilistic optimum filtering; speech recognition; throat microphone;
D O I
10.1109/LSP.2003.808549
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a method to combine the standard and throat microphone signals for robust speech recognition in noisy environments. Our approach is to use the. probabilistic optimum filter (POF) mapping algorithm to estimate the standard microphone clean-speech feature vectors, used by standard speech recognizers, from both microphones' noisy-speech feature vectors. A small untranscribed "stereo" database (noisy and clean simultaneous recordings) is required to train the POF mappings. In continuous-speech recognition experiments using SRI International's DECIPHER recognition system, both using artificially added noise and using recorded noisy speech, the combined-microphone approach significantly outperforms the single-microphone approach.
引用
收藏
页码:72 / 74
页数:3
相关论文
共 50 条
  • [11] FUSION OF STANDARD AND ALTERNATIVE ACOUSTIC SENSORS FOR ROBUST AUTOMATIC SPEECH RECOGNITION
    Heracleous, Panikos
    Even, Jani
    Ishi, Carlos T.
    Miyashita, Takahiro
    Hagita, Norihiro
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4837 - 4840
  • [12] Throat Microphone Speech Recognition using MFCC
    Vijayan, Amritha
    Mathai, Bipil Mary
    Valsalan, Karthik
    Johnson, Riyanka Raji
    Mathew, Lani Rachel
    Gopakumar, K.
    2017 INTERNATIONAL CONFERENCE ON NETWORKS & ADVANCES IN COMPUTATIONAL TECHNOLOGIES (NETACT), 2017, : 392 - 395
  • [13] Subband correlation and robust speech recognition
    McAuley, J
    Ming, J
    Stewart, D
    Hanna, P
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (05): : 956 - 964
  • [14] Speech enhancement for robust speech recognition in car environments using grifriths-jim ANC based on two-paired microphones
    Cho, YS
    Ko, HS
    2004 IEEE INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS, PROCEEDINGS, 2004, : 123 - 127
  • [15] COMBINING MISSING-DATA RECONSTRUCTION AND UNCERTAINTY DECODING FOR ROBUST SPEECH RECOGNITION
    Gonzalez, Jose A.
    Peinado, Antonio M.
    Gomez, Angel M.
    Ma, Ning
    Barker, Jon
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4693 - 4696
  • [16] An analysis of the effect of combining standard and alternate sensor signals on recognition of syllabic units for multimodal speech recognition
    Radha, N.
    Shahina, A.
    Prabha, P.
    Sri, Preethi B. T.
    Khan, Nayeemulla A.
    PATTERN RECOGNITION LETTERS, 2018, 115 : 39 - 49
  • [17] Robust Speech Recognition using Generalized Distillation Framework
    Markov, Konstantin
    Matsui, Tomoko
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2364 - 2368
  • [18] A novel channel estimate for noise robust speech recognition
    Vanderreydt, Geoffroy
    Demuynck, Kris
    COMPUTER SPEECH AND LANGUAGE, 2024, 86
  • [19] Robust Speech Recognition by Nonlocal Means Denoising Processing
    Xu, Haitian
    Tan, Zheng-Hua
    Dalsgaard, Paul
    Lindberg, Brge
    IEEE SIGNAL PROCESSING LETTERS, 2008, 15 : 701 - 704
  • [20] Extended VTS for Noise-Robust Speech Recognition
    van Dalen, Rogier C.
    Gales, Mark J. F.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 733 - 743