Combining standard and throat microphones for robust speech recognition

被引：67

作者：

Graciarena, M

Franco, H

Sonmez, K

Bratt, H

机构：

[1] SRI Int, Speech Technol & Res Lab, Menlo Pk, CA 94025 USA

[2] Univ Buenos Aires, Sch Engn, Inst Biomed Engn, RA-1053 Buenos Aires, DF, Argentina

来源：

IEEE SIGNAL PROCESSING LETTERS | 2003年 / 10卷 / 03期

关键词：

noise robustness; probabilistic optimum filtering; speech recognition; throat microphone;

D O I：

10.1109/LSP.2003.808549

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

We present a method to combine the standard and throat microphone signals for robust speech recognition in noisy environments. Our approach is to use the. probabilistic optimum filter (POF) mapping algorithm to estimate the standard microphone clean-speech feature vectors, used by standard speech recognizers, from both microphones' noisy-speech feature vectors. A small untranscribed "stereo" database (noisy and clean simultaneous recordings) is required to train the POF mappings. In continuous-speech recognition experiments using SRI International's DECIPHER recognition system, both using artificially added noise and using recorded noisy speech, the combined-microphone approach significantly outperforms the single-microphone approach.

引用

页码：72 / 74

页数：3

共 50 条

[11] FUSION OF STANDARD AND ALTERNATIVE ACOUSTIC SENSORS FOR ROBUST AUTOMATIC SPEECH RECOGNITION
Heracleous, Panikos
Even, Jani
Ishi, Carlos T.
Miyashita, Takahiro
Hagita, Norihiro
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4837 - 4840
[12] Throat Microphone Speech Recognition using MFCC
Vijayan, Amritha
Mathai, Bipil Mary
Valsalan, Karthik
Johnson, Riyanka Raji
Mathew, Lani Rachel
Gopakumar, K.
2017 INTERNATIONAL CONFERENCE ON NETWORKS & ADVANCES IN COMPUTATIONAL TECHNOLOGIES (NETACT), 2017, : 392 - 395
[13] Subband correlation and robust speech recognition
McAuley, J
Ming, J
Stewart, D
Hanna, P
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (05): : 956 - 964
[14] Speech enhancement for robust speech recognition in car environments using grifriths-jim ANC based on two-paired microphones
Cho, YS
Ko, HS
2004 IEEE INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS, PROCEEDINGS, 2004, : 123 - 127
[15] COMBINING MISSING-DATA RECONSTRUCTION AND UNCERTAINTY DECODING FOR ROBUST SPEECH RECOGNITION
Gonzalez, Jose A.
Peinado, Antonio M.
Gomez, Angel M.
Ma, Ning
Barker, Jon
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4693 - 4696
[16] An analysis of the effect of combining standard and alternate sensor signals on recognition of syllabic units for multimodal speech recognition
Radha, N.
Shahina, A.
Prabha, P.
Sri, Preethi B. T.
Khan, Nayeemulla A.
PATTERN RECOGNITION LETTERS, 2018, 115 : 39 - 49
[17] Robust Speech Recognition using Generalized Distillation Framework
Markov, Konstantin
Matsui, Tomoko
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2364 - 2368
[18] A novel channel estimate for noise robust speech recognition
Vanderreydt, Geoffroy
Demuynck, Kris
COMPUTER SPEECH AND LANGUAGE, 2024, 86
[19] Robust Speech Recognition by Nonlocal Means Denoising Processing
Xu, Haitian
Tan, Zheng-Hua
Dalsgaard, Paul
Lindberg, Brge
IEEE SIGNAL PROCESSING LETTERS, 2008, 15 : 701 - 704
[20] Extended VTS for Noise-Robust Speech Recognition
van Dalen, Rogier C.
Gales, Mark J. F.
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 733 - 743

← 1 2 3 4 5 →