Combining standard and throat microphones for robust speech recognition

被引：67

作者：

Graciarena, M

Franco, H

Sonmez, K

Bratt, H

机构：

[1] SRI Int, Speech Technol & Res Lab, Menlo Pk, CA 94025 USA

[2] Univ Buenos Aires, Sch Engn, Inst Biomed Engn, RA-1053 Buenos Aires, DF, Argentina

来源：

IEEE SIGNAL PROCESSING LETTERS | 2003年 / 10卷 / 03期

关键词：

noise robustness; probabilistic optimum filtering; speech recognition; throat microphone;

D O I：

10.1109/LSP.2003.808549

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

We present a method to combine the standard and throat microphone signals for robust speech recognition in noisy environments. Our approach is to use the. probabilistic optimum filter (POF) mapping algorithm to estimate the standard microphone clean-speech feature vectors, used by standard speech recognizers, from both microphones' noisy-speech feature vectors. A small untranscribed "stereo" database (noisy and clean simultaneous recordings) is required to train the POF mappings. In continuous-speech recognition experiments using SRI International's DECIPHER recognition system, both using artificially added noise and using recorded noisy speech, the combined-microphone approach significantly outperforms the single-microphone approach.

引用

页码：72 / 74

页数：3

共 50 条

[31] Noise-Robust speech recognition of Conversational Telephone Speech
Chen, Gang
Tolba, Hesham
O'Shaughnessy, Douglas
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1101 - 1104
[32] Robust speech detection method for telephone speech recognition system
Kuroiwa, S
Naito, M
Yamamoto, S
Higuchi, N
SPEECH COMMUNICATION, 1999, 27 (02) : 135 - 148
[33] ACQUIRING VARIABLE LENGTH SPEECH BASES FOR FACTORISATION-BASED NOISE ROBUST SPEECH RECOGNITION
Hurmalainen, Antti
Virtanen, Tuomas
2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
[34] Auditory processing of speech signals for robust speech recognition in real-world noisy environments
Kim, DS
Lee, SY
Kil, RM
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (01): : 55 - 69
[35] Adaptive nonlinear regression using multiple distributed microphones for in-car speech recognition
Li, WF
Miyajima, C
Nishino, T
Itou, K
Takeda, K
Itakura, F
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (07) : 1716 - 1723
[36] Robust recognition of simultaneous speech by a mobile robot
Valin, Jean-Marc
Yamamoto, Shun'ichi
Rouat, Jean
Michaud, Francois
Nakadai, Kazuhiro
Okuno, Hiroshi G.
IEEE TRANSACTIONS ON ROBOTICS, 2007, 23 (04) : 742 - 752
[37] Auditory contrast spectrum for robust speech recognition
Lu, Xugang
Dang, Jianwu
CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 325 - +
[38] Speaker and Noise Factorization for Robust Speech Recognition
Wang, Yongqiang
Gales, Mark J. F.
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (07): : 2149 - 2158
[39] Robust speech recognition using harmonic features
Goh, Yeh Huann
Raveendran, Paramesran
Jamuar, Sudhanshu Shekhar
IET SIGNAL PROCESSING, 2014, 8 (02) : 167 - 175
[40] Perceptual wavelet filtering for robust speech recognition
Van Pham, Tuan
Stark, Michael
Kubin, Gernot
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4385 - 4388

← 1 2 3 4 5 →