Use of Microphone Array and Model Adaptation for Hands-Free Speech Acquisition and Recognition

被引：0

作者：

Jen-Tzung Chien

Jain-Ray Lai

机构：

[1] National Cheng Kung University,Department of Computer Science and Information Engineering

来源：

Journal of VLSI signal processing systems for signal, image and video technology | 2004年 / 36卷

关键词：

microphone array; delay-and-sum beamformer; coherence measure; model adaptation; speech enhancement; speech recognition;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

This paper presents a combined microphone array and model adaptation algorithm for hands-free speech recognition. Our purpose is to remove the inconvenience of using head-mounted/hand-holding microphone in conventional speech recognizer. To improve the speech quality with car noise interference, a linear microphone array is applied and acted as robust acquisition system. A time-domain coherence measure (TDCM) is applied to reliably estimate the time delay for speech signals collected by different microphones. The estimated delay is adopted in a delay-and-sum beamformer for speech enhancement. Further, we adapt the speech hidden Markov models to get close to the acoustic conditions of the enhanced test speech for robust speech recognition. In acquisition and recognition experiments using connected Chinese digits, we found that TDCM can effectively estimate the time delay. The increase in the speech sampling rate is helpful to determine the time delay. Incorporating the model adaptation scheme significantly reduces the recognition errors with moderate computation overhead.

引用

页码：141 / 151

页数：10

共 50 条

[41] Template-based Spectral Estimation Using Microphone Array for Speech Recognition
Tamura, Satoshi
Hishikawa, Eriko
Taguchi, Wataru
Hayamizu, Satoru
[J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2050 - +
[42] COMPARISON OF REFERENCE MICROPHONE SELECTION ALGORITHMS FOR DISTRIBUTED MICROPHONE ARRAY BASED SPEECH ENHANCEMENT IN MEETING RECOGNITION SCENARIOS
Araki, Shoko
Ono, Nobutaka
Kinoshita, Keisuke
Delcroix, Marc
[J]. 2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 316 - 320
[43] Adaptive Microphone Array Processing for High-Performance Speech Recognition in Car Environment
Hong, Jungpyo
Han, Seungho
Jeong, Sangbae
Hahn, Minsoo
[J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2011, 57 (01) : 260 - 266
[44] Speech Interaction to Control a Hands-Free Delivery Robot for High-Risk Health Care Scenarios
Grasse, Lukas
Boutros, Sylvain J.
Tata, Matthew S.
[J]. FRONTIERS IN ROBOTICS AND AI, 2021, 8
[45] A system for speech enhancement in the context of hands-free radiotelephony with combined noise reduction and acoustic echo cancellation
Scalart, P
Benamar, A
[J]. SPEECH COMMUNICATION, 1996, 20 (3-4) : 203 - 214
[46] Close speaker cancellation for suppression of non-stationary background noise for hands-free speech interface
Even, Jani
Ishi, Carlos
Saruwatari, Hiroshi
Hagita, Norihiro
[J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 977 - 980
[47] Model Adaptation Based on Improved Variance Estimation for Robust Speech Recognition
Lu, Yong
Xu, Zongyu
Yan, Qin
Zhou, Lin
[J]. 2012 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP 2012), 2012,
[48] TWO-STEP ACOUSTIC MODEL ADAPTATION FOR DYSARTHRIC SPEECH RECOGNITION
Takashima, Ryoichi
Takiguchi, Tetsuya
Ariki, Yasuo
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6104 - 6108
[49] A novel HMM model adaptation and compensation method for robust speech recognition
Ning, GX
Wei, G
[J]. INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES 2005, VOLS 1 AND 2, PROCEEDINGS, 2005, : 274 - 277
[50] Kinect microphone array-based speech and speaker recognition for the exhibition control of humanoid robots
Ding, Ing-Jr
Shi, Jia-Yi
[J]. COMPUTERS & ELECTRICAL ENGINEERING, 2017, 62 : 719 - 729

← 1 2 3 4 5 →