Combining standard and throat microphones for robust speech recognition

被引：67

作者：

Graciarena, M

Franco, H

Sonmez, K

Bratt, H

机构：

[1] SRI Int, Speech Technol & Res Lab, Menlo Pk, CA 94025 USA

[2] Univ Buenos Aires, Sch Engn, Inst Biomed Engn, RA-1053 Buenos Aires, DF, Argentina

来源：

IEEE SIGNAL PROCESSING LETTERS | 2003年 / 10卷 / 03期

关键词：

noise robustness; probabilistic optimum filtering; speech recognition; throat microphone;

D O I：

10.1109/LSP.2003.808549

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

We present a method to combine the standard and throat microphone signals for robust speech recognition in noisy environments. Our approach is to use the. probabilistic optimum filter (POF) mapping algorithm to estimate the standard microphone clean-speech feature vectors, used by standard speech recognizers, from both microphones' noisy-speech feature vectors. A small untranscribed "stereo" database (noisy and clean simultaneous recordings) is required to train the POF mappings. In continuous-speech recognition experiments using SRI International's DECIPHER recognition system, both using artificially added noise and using recorded noisy speech, the combined-microphone approach significantly outperforms the single-microphone approach.

引用

页码：72 / 74

页数：3

共 50 条

[41] Robust speech recognition for car environment noise
Kokubo, H
Amano, A
Hataoka, N
ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2002, 85 (11): : 65 - 73
[42] ROBUST FEATURE EXTRACTORS FOR CONTINUOUS SPEECH RECOGNITION
Alam, M. J.
Kenny, P.
Dumouchel, P.
O'Shaughnessy, D.
2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 944 - 948
[43] Noise Suppression based on nonnegative matrix factorization for robust speech recognition
Fan, Hao-teng
Lin, Pao-han
Hung, Jeih-weih
2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, ELECTRONICS AND ELECTRICAL ENGINEERING (ISEEE), VOLS 1-3, 2014, : 1731 - +
[44] Magnitude Spectrum Enhancement for Robust Speech Recognition
Tu, Wen-hsiang
Hung, Jeih-weih
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4586 - 4589
[45] Feature Adaptation for Robust Mobile Speech Recognition
Lee, Hyeopwoo
Yook, Dongsuk
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (04) : 1393 - 1398
[46] Multi-candidate missing data imputation for robust speech recognition
Wang, Yujun
Van Hamme, Hugo
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2012,
[47] Multi-candidate missing data imputation for robust speech recognition
Yujun Wang
Hugo Van hamme
EURASIP Journal on Audio, Speech, and Music Processing, 2012
[48] Modulation Spectrum Augmentation for Robust Speech Recognition
Yan, Bi-Cheng
Liu, Shih-Hung
Chen, Berlin
PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION SCIENCE AND SYSTEM, AISS 2019, 2019,
[49] Semantic Enhancement Framework for Robust Speech Recognition
Yang, Baochen
Yu, Kai
MAN-MACHINE SPEECH COMMUNICATION, NCMMSC 2022, 2023, 1765 : 81 - 88
[50] A noise-robust speech recognition approach incorporating normalized speech/non-speech likelihood into hypothesis scores
Oonishi, Tasuku
Iwano, Koji
Furui, Sadaoki
SPEECH COMMUNICATION, 2013, 55 (02) : 377 - 386

← 1 2 3 4 5 →