FUSION OF STANDARD AND ALTERNATIVE ACOUSTIC SENSORS FOR ROBUST AUTOMATIC SPEECH RECOGNITION

被引：0

作者：

Heracleous, Panikos ^{[1
]}

Even, Jani ^{[1
]}

Ishi, Carlos T. ^{[1
]}

Miyashita, Takahiro ^{[1
]}

Hagita, Norihiro ^{[1
]}

机构：

[1] ATR, Intelligent Robot & Commun Labs, Tokyo, Japan

来源：

2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2012年

关键词：

Alternative sensors; ear bone microphone; throat microphone; fusion; robust speech recognition;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper focuses on the problem of environmental noises in human-human communication and in automatic speech recognition. To deal with this problem, the use of alternative acoustic sensors -which are attached to the talker and receive the uttered speech through skin or bones- is investigated. In the current study, throat microphones and ear bone microphones are integrated with standard microphones using several fusion methods. The results obtained show that the recognition rates in noisy environments are drastically increased when these sensors are integrated with standard microphones. Moreover, the system does not show any recognition degradations in clean environments. In fact, recognition rates also increase slightly in clean environments. Using late fusion to integrate a throat microphone, an ear bone microphone, and a standard microphone, we achieved a 44% relative improvement in recognition rate in a noisy environment and a 24% relative improvement in recognition rate in a clean environment.

引用

页码：4837 / 4840

页数：4

共 50 条

[1] FUSION OF STANDARD AND ALTERNATIVE ACOUSTIC SENSORS FOR ROBUST AUTOMATIC SPEECH RECOGNITION
Heracleous, Panikos
Even, Jani
Ishi, Carlos T.
Miyashita, Takahiro
Hagita, Norihiro
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4837 - 4840
[2] JOINT ACOUSTIC FACTOR LEARNING FOR ROBUST DEEP NEURAL NETWORK BASED AUTOMATIC SPEECH RECOGNITION
Kundu, Souvik
Mantena, Gautam
Qian, Yanmin
Tan, Tian
Delcroix, Marc
Sim, Khe Chai
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5025 - 5029
[3] Robust Speaker Recognition with Combined Use of Acoustic and Throat Microphone Speech
Sahidullah, Md
Hautamaki, Rosa Gonzalez
Thomsen, Dennis Alexander Lehmann
Kinntinenl, Tomi
Tang, Zheng-Hua
Hautamaki, Ville
Parts, Robert
Pitkanen, Martti
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1720 - 1724
[4] Automatic speech recognition using acoustic doppler signal
Lee, Ki-Seung
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2016, 35 (01): : 74 - 82
[5] Combining standard and throat microphones for robust speech recognition
Graciarena, M
Franco, H
Sonmez, K
Bratt, H
IEEE SIGNAL PROCESSING LETTERS, 2003, 10 (03) : 72 - 74
[6] Noise Adaptive Training for Robust Automatic Speech Recognition
Kalinli, Ozlem
Seltzer, Michael L.
Droppo, Jasha
Acero, Alex
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (08): : 1889 - 1901
[7] EFFICIENT TRAINING OF ACOUSTIC MODELS FOR REVERBERATION-ROBUST MEDIUM-VOCABULARY AUTOMATIC SPEECH RECOGNITION
Sehr, Armin
Barfuss, Hendrik
Hofmann, Christian
Maas, Roland
Kellermann, Walter
2014 4TH JOINT WORKSHOP ON HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (HSCMA), 2014, : 177 - 181
[8] Transfer learning for acoustic modeling of noise robust speech recognition
Yi J.
Tao J.
Liu B.
Wen Z.
Qinghua Daxue Xuebao/Journal of Tsinghua University, 2018, 58 (01): : 55 - 60
[9] A Study on the Generalization Capability of Acoustic Models for Robust Speech Recognition
Xiao, Xiong
Li, Jinyu
Chng, Eng Siong
Li, Haizhou
Lee, Chin-Hui
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1158 - 1169
[10] Robust Automatic Speech Recognition for Accented Mandarin in Car Environments
Pei Ding
Lei He
Xiang Yan
Jie Hao
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2542 - 2545

← 1 2 3 4 5 →