Use of Microphone Array and Model Adaptation for Hands-Free Speech Acquisition and Recognition

被引:0
作者
Jen-Tzung Chien
Jain-Ray Lai
机构
[1] National Cheng Kung University,Department of Computer Science and Information Engineering
来源
Journal of VLSI signal processing systems for signal, image and video technology | 2004年 / 36卷
关键词
microphone array; delay-and-sum beamformer; coherence measure; model adaptation; speech enhancement; speech recognition;
D O I
暂无
中图分类号
学科分类号
摘要
This paper presents a combined microphone array and model adaptation algorithm for hands-free speech recognition. Our purpose is to remove the inconvenience of using head-mounted/hand-holding microphone in conventional speech recognizer. To improve the speech quality with car noise interference, a linear microphone array is applied and acted as robust acquisition system. A time-domain coherence measure (TDCM) is applied to reliably estimate the time delay for speech signals collected by different microphones. The estimated delay is adopted in a delay-and-sum beamformer for speech enhancement. Further, we adapt the speech hidden Markov models to get close to the acoustic conditions of the enhanced test speech for robust speech recognition. In acquisition and recognition experiments using connected Chinese digits, we found that TDCM can effectively estimate the time delay. The increase in the speech sampling rate is helpful to determine the time delay. Incorporating the model adaptation scheme significantly reduces the recognition errors with moderate computation overhead.
引用
收藏
页码:141 / 151
页数:10
相关论文
共 50 条
  • [41] Template-based Spectral Estimation Using Microphone Array for Speech Recognition
    Tamura, Satoshi
    Hishikawa, Eriko
    Taguchi, Wataru
    Hayamizu, Satoru
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2050 - +
  • [42] COMPARISON OF REFERENCE MICROPHONE SELECTION ALGORITHMS FOR DISTRIBUTED MICROPHONE ARRAY BASED SPEECH ENHANCEMENT IN MEETING RECOGNITION SCENARIOS
    Araki, Shoko
    Ono, Nobutaka
    Kinoshita, Keisuke
    Delcroix, Marc
    [J]. 2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 316 - 320
  • [43] Adaptive Microphone Array Processing for High-Performance Speech Recognition in Car Environment
    Hong, Jungpyo
    Han, Seungho
    Jeong, Sangbae
    Hahn, Minsoo
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2011, 57 (01) : 260 - 266
  • [44] Speech Interaction to Control a Hands-Free Delivery Robot for High-Risk Health Care Scenarios
    Grasse, Lukas
    Boutros, Sylvain J.
    Tata, Matthew S.
    [J]. FRONTIERS IN ROBOTICS AND AI, 2021, 8
  • [45] A system for speech enhancement in the context of hands-free radiotelephony with combined noise reduction and acoustic echo cancellation
    Scalart, P
    Benamar, A
    [J]. SPEECH COMMUNICATION, 1996, 20 (3-4) : 203 - 214
  • [46] Close speaker cancellation for suppression of non-stationary background noise for hands-free speech interface
    Even, Jani
    Ishi, Carlos
    Saruwatari, Hiroshi
    Hagita, Norihiro
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 977 - 980
  • [47] Model Adaptation Based on Improved Variance Estimation for Robust Speech Recognition
    Lu, Yong
    Xu, Zongyu
    Yan, Qin
    Zhou, Lin
    [J]. 2012 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP 2012), 2012,
  • [48] TWO-STEP ACOUSTIC MODEL ADAPTATION FOR DYSARTHRIC SPEECH RECOGNITION
    Takashima, Ryoichi
    Takiguchi, Tetsuya
    Ariki, Yasuo
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6104 - 6108
  • [49] A novel HMM model adaptation and compensation method for robust speech recognition
    Ning, GX
    Wei, G
    [J]. INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES 2005, VOLS 1 AND 2, PROCEEDINGS, 2005, : 274 - 277
  • [50] Kinect microphone array-based speech and speaker recognition for the exhibition control of humanoid robots
    Ding, Ing-Jr
    Shi, Jia-Yi
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2017, 62 : 719 - 729