Combining standard and throat microphones for robust speech recognition

被引：67

作者：

Graciarena, M

Franco, H

Sonmez, K

Bratt, H

机构：

[1] SRI Int, Speech Technol & Res Lab, Menlo Pk, CA 94025 USA

[2] Univ Buenos Aires, Sch Engn, Inst Biomed Engn, RA-1053 Buenos Aires, DF, Argentina

来源：

IEEE SIGNAL PROCESSING LETTERS | 2003年 / 10卷 / 03期

关键词：

noise robustness; probabilistic optimum filtering; speech recognition; throat microphone;

D O I：

10.1109/LSP.2003.808549

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

We present a method to combine the standard and throat microphone signals for robust speech recognition in noisy environments. Our approach is to use the. probabilistic optimum filter (POF) mapping algorithm to estimate the standard microphone clean-speech feature vectors, used by standard speech recognizers, from both microphones' noisy-speech feature vectors. A small untranscribed "stereo" database (noisy and clean simultaneous recordings) is required to train the POF mappings. In continuous-speech recognition experiments using SRI International's DECIPHER recognition system, both using artificially added noise and using recorded noisy speech, the combined-microphone approach significantly outperforms the single-microphone approach.

引用

页码：72 / 74

页数：3

共 50 条

[21] Multistream Bandpass Modulation Features for Robust Speech Recognition
Nemala, Sridhar Krishna
Patil, Kailash
Elhilali, Mounya
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1284 - 1287
[22] Robust speech recognition using probabilistic union models
Ming, J
Jancovic, P
Smith, FJ
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (06): : 403 - 414
[23] Normalizing the speech modulation spectrum for robust speech recognition
Xiao, Xiong
Chng, Eng Siong
Li, Haizhou
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1021 - +
[24] Robust Speech Recognition using DNN-HMM Acoustic Model Combining Noise-aware training with Spectral Subtraction
Abe, Akihiro
Yamamoto, Kazumasa
Nakagawa, Seiichi
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2849 - 2853
[25] Toward Robust Speech Recognition and Understanding
Sadaoki Furui
Journal of VLSI signal processing systems for signal, image and video technology, 2005, 41 : 245 - 254
[26] Toward robust speech recognition and understanding
Furui, S
JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2005, 41 (03): : 245 - 254
[27] Issues with uncertainty decoding for noise robust automatic speech recognition
Liao, H.
Gales, M. J. F.
SPEECH COMMUNICATION, 2008, 50 (04) : 265 - 277
[28] Stereo-based stochastic mapping for robust speech recognition
Afify, Mohamed
Cui, Xiaodong
Gao, Yuqing
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 377 - +
[29] Stereo-Based Stochastic Mapping for Robust Speech Recognition
Afify, Mohamed
Cui, Xiaodong
Gao, Yuqing
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (07): : 1325 - 1334
[30] Discriminative classifiers with adaptive kernels for noise robust speech recognition
Gales, M. J. F.
Flego, F.
COMPUTER SPEECH AND LANGUAGE, 2010, 24 (04) : 648 - 662

← 1 2 3 4 5 →