A Posterior Approach for Microphone Array Based Speech Recognition

被引:0
|
作者
Wang, Dong [1 ]
Himawan, Ivan [1 ]
Frankel, Joe [1 ]
King, Simon [1 ]
机构
[1] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh EH8 9YL, Midlothian, Scotland
关键词
speech recognition; microphone array; beamforming; tandem approach;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic speech recognition (ASR) is difficult in environments such as multiparty meetings because of adverse acoustic conditions: background noise, reverberation and cross-talk. Microphone arrays can increase ASR accuracy dramatically in such situations. However, most existing beamforming techniques use time-domain signal processing theory and are based on a geometric analysis of the relationship between sources and microphones. This limits their application, and leads to performance degradation when the geometric properties are unavailable, or heterogeneous channels are used. We present a new posterior-based approach for microphone array speech recognition. Instead of enhancing speech signals, we enhance posterior phone probabilities which are used in a tandem ANN-HMM system. Significant improvements were achieved over a single channel baseline. Combining beamforming and our method is significantly better than beamforming alone, especially in a moving speakers scenario.
引用
收藏
页码:996 / 999
页数:4
相关论文
共 50 条
  • [31] Kinect microphone array-based speech and speaker recognition for the exhibition control of humanoid robots
    Ding, Ing-Jr
    Shi, Jia-Yi
    COMPUTERS & ELECTRICAL ENGINEERING, 2017, 62 : 719 - 729
  • [32] Enhanced robot speech recognition based on microphone array source separation and missing feature theory
    Yamamoto, S
    Valin, JM
    Nakadai, K
    Rouat, J
    Michaud, F
    Ogata, T
    Okuno, HG
    2005 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-4, 2005, : 1477 - 1482
  • [33] Microphone array speech enhancement based on optimized IMCRA
    Li, Qiuying
    Zhang, Tao
    Geng, Yanzhang
    Gao, Zhen
    NOISE CONTROL ENGINEERING JOURNAL, 2021, 69 (06) : 468 - 476
  • [34] Microphone Array Speech Separation Algorithm based on DNN
    Wu, Chaoyan
    Zhou, Lin
    Chen, Xijin
    Chen, Liyuan
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1305 - 1310
  • [35] Speech dereverberation and noise reduction with a combined microphone array approach
    Gonzalez-Rodriguez, J
    Sanchez-Bote, JL
    Ortega-Garcia, J
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1037 - 1040
  • [36] A Robust Speech Enhancement Method Based on Microphone Array
    Zhang, Qiquan
    Wang, Mingjiang
    Zhang, Lu
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT 2017), 2017, : 1673 - 1678
  • [37] Microphone Array Speech Processing
    Nordholm, Sven
    Abhayapala, Thushara
    Doclo, Simon
    Gannot, Sharon
    Naylor, Patrick
    Tashev, Ivan
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2010,
  • [38] Microphone Array Speech Processing
    Sven Nordholm
    ThusharaD Abhayapala
    Simon Doclo
    Sharon Gannot
    P Naylor
    Ivan Tashev
    EURASIP Journal on Advances in Signal Processing, 2010
  • [39] MWF-based speech dereverberation with a local microphone array and an external microphone
    Ali, Randall
    van Waterschoot, Toon
    Moonen, Marc
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [40] COMBINING CEPSTRAL NORMALIZATION AND COCHLEAR IMPLANT-LIKE SPEECH PROCESSING FOR MICROPHONE ARRAY-BASED SPEECH RECOGNITION
    Cong-Thanh Do
    Taghizadeh, Mohammad J.
    Garner, Philip N.
    2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 137 - 142