Use of Microphone Array and Model Adaptation for Hands-Free Speech Acquisition and Recognition

被引:0
作者
Jen-Tzung Chien
Jain-Ray Lai
机构
[1] National Cheng Kung University,Department of Computer Science and Information Engineering
来源
Journal of VLSI signal processing systems for signal, image and video technology | 2004年 / 36卷
关键词
microphone array; delay-and-sum beamformer; coherence measure; model adaptation; speech enhancement; speech recognition;
D O I
暂无
中图分类号
学科分类号
摘要
This paper presents a combined microphone array and model adaptation algorithm for hands-free speech recognition. Our purpose is to remove the inconvenience of using head-mounted/hand-holding microphone in conventional speech recognizer. To improve the speech quality with car noise interference, a linear microphone array is applied and acted as robust acquisition system. A time-domain coherence measure (TDCM) is applied to reliably estimate the time delay for speech signals collected by different microphones. The estimated delay is adopted in a delay-and-sum beamformer for speech enhancement. Further, we adapt the speech hidden Markov models to get close to the acoustic conditions of the enhanced test speech for robust speech recognition. In acquisition and recognition experiments using connected Chinese digits, we found that TDCM can effectively estimate the time delay. The increase in the speech sampling rate is helpful to determine the time delay. Incorporating the model adaptation scheme significantly reduces the recognition errors with moderate computation overhead.
引用
收藏
页码:141 / 151
页数:10
相关论文
共 50 条
  • [31] Noise reduction based on microphone array and post-filtering for robust speech recognition
    Li, Junfeng
    Akagi, Masato
    Suzuki, Yoiti
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 680 - +
  • [32] A SPEECH PRESENCE MICROPHONE ARRAY BEAMFORMER USING MODEL BASED SPEECH PRESENCE PROBABILITY ESTIMATION
    Yu, Tao
    Hansen, John H. L.
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 213 - 216
  • [33] Intentional Voice Command Detection for Completely Hands-Free Speech Interface in Home Environments
    Obuchi, Yasunari
    Togami, Masahito
    Sumiyoshi, Takashi
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 119 - 122
  • [34] Comparison of Hands-Free Speech-Based Navigation Techniques for Virtual Reality Training
    Calandra, Davide
    Prattico, Filippo Gabriele
    Lamberti, Fabrizio
    2022 IEEE 21ST MEDITERRANEAN ELECTROTECHNICAL CONFERENCE (IEEE MELECON 2022), 2022, : 85 - 90
  • [35] Speech recognition based on HMM decomposition and composition method with a microphone array in noisy reverberant environments
    Miki, K
    Nishiura, T
    Nakamura, S
    Shikano, K
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2002, 85 (09): : 13 - 22
  • [36] Feature Mapping of Multiple Beamformed Sources for Robust Overlapping Speech Recognition Using a Microphone Array
    Li, Weifeng
    Wang, Longbiao
    Zhou, Yicong
    Dines, John
    Magimai-Doss, Mathew
    Bourlard, Herve
    Liao, Qingmin
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (12) : 2244 - 2255
  • [37] Two-Stage Enhancement of Noisy and Reverberant Microphone Array Speech for Automatic Speech Recognition Systems Trained with Only Clean Speech
    Wang, Quandong
    Wang, Sicheng
    Ge, Fengpei
    Han, Chang Woo
    Lee, Jaewon
    Guo, Lianghao
    Lee, Chin-Hui
    2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 21 - 25
  • [38] Acoustic Model Adaptation for Speech Recognition
    Shinoda, Koichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (09): : 2348 - 2362
  • [39] Design and implementation of a MEMS microphone array system for real-time speech acquisition
    Hafizovic, Ines
    Nilsen, Carl-Inge Colombo
    Kjolerbakken, Morgan
    Jahr, Vibeke
    APPLIED ACOUSTICS, 2012, 73 (02) : 132 - 143
  • [40] Microphone Array Processing Strategies for Distant-Based Automatic Speech Recognition
    Khoubrouy, Soudeh A.
    Hansen, John H. L.
    IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (10) : 1344 - 1348