FUSION OF STANDARD AND ALTERNATIVE ACOUSTIC SENSORS FOR ROBUST AUTOMATIC SPEECH RECOGNITION

被引：0

作者：

Heracleous, Panikos ^{[1
]}

Even, Jani ^{[1
]}

Ishi, Carlos T. ^{[1
]}

Miyashita, Takahiro ^{[1
]}

Hagita, Norihiro ^{[1
]}

机构：

[1] ATR, Intelligent Robot & Commun Labs, Tokyo, Japan

来源：

2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2012年

关键词：

Alternative sensors; ear bone microphone; throat microphone; fusion; robust speech recognition;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper focuses on the problem of environmental noises in human-human communication and in automatic speech recognition. To deal with this problem, the use of alternative acoustic sensors -which are attached to the talker and receive the uttered speech through skin or bones- is investigated. In the current study, throat microphones and ear bone microphones are integrated with standard microphones using several fusion methods. The results obtained show that the recognition rates in noisy environments are drastically increased when these sensors are integrated with standard microphones. Moreover, the system does not show any recognition degradations in clean environments. In fact, recognition rates also increase slightly in clean environments. Using late fusion to integrate a throat microphone, an ear bone microphone, and a standard microphone, we achieved a 44% relative improvement in recognition rate in a noisy environment and a 24% relative improvement in recognition rate in a clean environment.

引用

页码：4837 / 4840

页数：4

共 50 条

[41] Joint decoding of multiple speech patterns for robust speech recognition
Nair, Nishanth Ulhas
Sreenivas, T. V.
2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 93 - 98
[42] Robust Speech Recognition with Speech Enhanced Deep Neural Networks
Du, Jun
Wang, Qing
Gao, Tian
Xu, Yong
Dai, Lirong
Lee, Chin-Hui
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 616 - 620
[43] Robust speech recognition by integrating speech separation and hypothesis testing
Srinivasan, Soundararajan
Wang, DeLiang
SPEECH COMMUNICATION, 2010, 52 (01) : 72 - 81
[44] Enhancing the magnitude spectrum of speech features for robust speech recognition
Jeih-weih Hung
Hao-teng Fan
Wen-hsiang Tu
EURASIP Journal on Advances in Signal Processing, 2012
[45] Unsupervised Data-Driven Feature Vector Normalization With Acoustic Model Adaptation for Robust Speech Recognition
Buera, Luis
Miguel, Antonio
Saz, Oscar
Ortega, Alfonso
Lleida, Eduardo
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (02): : 296 - 309
[46] Compensation of Nonlinear Distortions in Speech for Automatic Recognition
Malek, Jiri
Silovsky, Jan
Cerva, Petr
Koldovsky, Zbynek
Nouza, Jan
Zdansky, Jindrich
2015 38TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2015,
[47] Method for adaptive on-line data fusion in Multi-Channel automatic speech recognition systems
Ivanov, R
2002 FIRST INTERNATIONAL IEEE SYMPOSIUM INTELLIGENT SYSTEMS, VOL 1, PROCEEDINGS, 2002, : 350 - 353
[48] NOISE ADAPTATION ALGORITHMS FOR ROBUST SPEECH RECOGNITION
CUNG, HM
NORMANDIN, Y
SPEECH COMMUNICATION, 1993, 12 (03) : 267 - 276
[49] Robust speech recognition using the modulation spectrogram
Kingsbury, BED
Morgan, N
Greenberg, S
SPEECH COMMUNICATION, 1998, 25 (1-3) : 117 - 132
[50] Robust Speech Recognition Using a Harmonic Model
许超
曹志刚
TsinghuaScienceandTechnology, 2004, (02) : 202 - 206

← 1 2 3 4 5 →