Closely coupled array processing and model-based compensation for microphone array speech recognition

被引:12
|
作者
Zhao, Xianyu [1 ]
Ou, Zhijian [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
array signal processing; microphone array; model-based compensation; robust speech recognition;
D O I
10.1109/TASL.2006.881673
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In conventional microphone array speech recognition, the array processor and the speech recognizer are loosely coupled. The only connection between the two modules is the en hanced target signal output from the array processor, which then gets treated as a single input to. the recognizer. In this approach, useful environmental information, which can be provided by the array processor and also needs to be exploited by the recognizer, is ignored. Inherently, the array processor can generate multiple outputs of spatially filtered signals, as a multi-input-multi-output (MIMO) module. In this paper, a closely coupled approach is proposed, in which a recognizer with model-based noise compensation exploits the reference noise outputs from a MIMO array processor. Specifically, a multichannel model-based noise compensation is presented, including the compensation procedure using the vector Taylor series (VTS) expansion and parameter estimation using the expectation-maximization (EM) algorithm. It is also shown how to construct MIMO array processors from conventional beamformers. A number of practical implementations of the conventional loosely coupled approach and the proposed closely coupled approach were tested on a publicly available database, the Multichannel Overlapping Number Corpus (MONC). Experimental results showed that the proposed closely coupled approach significantly improved the speech recognition performance in the overlapping speech situations.
引用
收藏
页码:1114 / 1122
页数:9
相关论文
共 50 条
  • [41] Speech recognition in the blind condition based on multiple directivity patterns using a microphone array
    Sekiya, T
    Kobayashi, T
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 373 - 376
  • [42] A two-element-microphone-array-based speech recognition system in vehicle environment
    Zhang, Heng
    Fu, Qiang
    Yan, Yonghong
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2009, 30 (01) : 51 - 54
  • [43] Noise reduction based on microphone array and post-filtering for robust speech recognition
    Li, Junfeng
    Akagi, Masato
    Suzuki, Yoiti
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 680 - +
  • [44] Microphone array speech enhancement based on optimized IMCRA
    Li, Qiuying
    Zhang, Tao
    Geng, Yanzhang
    Gao, Zhen
    NOISE CONTROL ENGINEERING JOURNAL, 2021, 69 (06) : 468 - 476
  • [45] Microphone Array Speech Separation Algorithm based on DNN
    Wu, Chaoyan
    Zhou, Lin
    Chen, Xijin
    Chen, Liyuan
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1305 - 1310
  • [46] MODEL-BASED PROCESSING FOR A LARGE-APERTURE ARRAY
    CANDY, JV
    SULLIVAN, EJ
    IEEE JOURNAL OF OCEANIC ENGINEERING, 1994, 19 (04) : 519 - 528
  • [47] A Robust Speech Enhancement Method Based on Microphone Array
    Zhang, Qiquan
    Wang, Mingjiang
    Zhang, Lu
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT 2017), 2017, : 1673 - 1678
  • [48] SOURCE NUMBER ESTIMATION BASED ON CLUSTERING OF SPEECH ACTIVITY SEQUENCES FOR MICROPHONE ARRAY PROCESSING
    Jafari, Ingrid
    Ito, Nobutaka
    Souden, Mehrez
    Araki, Shoko
    Nakatani, Tomohiro
    2013 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2013,
  • [49] COMPARISON OF REFERENCE MICROPHONE SELECTION ALGORITHMS FOR DISTRIBUTED MICROPHONE ARRAY BASED SPEECH ENHANCEMENT IN MEETING RECOGNITION SCENARIOS
    Araki, Shoko
    Ono, Nobutaka
    Kinoshita, Keisuke
    Delcroix, Marc
    2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 316 - 320
  • [50] Speech enhancement and recognition using circular microphone array for service robots
    Choi, C
    Kong, D
    Kim, J
    Bang, S
    IROS 2003: PROCEEDINGS OF THE 2003 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2003, : 3516 - 3521