MEETING RECOGNITION WITH ASYNCHRONOUS DISTRIBUTED MICROPHONE ARRAY

被引:0
|
作者
Araki, Shoko [1 ]
Ono, Nobutaka [2 ]
Kinoshita, Keisuke [1 ]
Delcroix, Marc [1 ]
机构
[1] NTT Corp, NTT Commun Sci Labs, Seika, Kyoto 6190237, Japan
[2] Natl Inst Informat, Chiyoda Ku, 2-1-2 Hitotsubashi, Tokyo 1018430, Japan
来源
2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU) | 2017年
基金
日本学术振兴会;
关键词
Meeting recognition; distributed microphones; blind synchronization; mask-based MVDR beamformer; independent vector analysis; BLIND SOURCE SEPARATION; SPEECH RECOGNITION; ICA;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, recognition of conversational speech such as meetings has widely been studied. However, most existing approaches rely on using a single close talking microphone or a distant microphone array where all the microphones are synchronous. In contrast, this paper tackles a recognition task of conversational speech recorded with asynchronous distributed microphones, to which conventional array processing is not directly applicable. We demonstrate that we can significantly improve recognition performance even when microphones are asynchronous by combining blind synchronization and state-of-the-art microphone array speech enhancement techniques such as independent vector analysis (IVA) and a time-frequency mask based minimum variance distortionless response (MVDR) beamformer. Using such a front-end, we could reduce the word error rate from 42.2 % to 29.9 % for real meeting recordings.
引用
收藏
页码:32 / 39
页数:8
相关论文
共 50 条
  • [1] MEETING RECOGNITION WITH ASYNCHRONOUS DISTRIBUTED MICROPHONE ARRAY USING BLOCK-WISE REFINEMENT OF MASK-BASED MVDR BEAMFORMER
    Araki, Shoko
    Ono, Nobutaka
    Kinoshita, Keisuke
    Delcroix, Marc
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5694 - 5698
  • [2] COMPARISON OF REFERENCE MICROPHONE SELECTION ALGORITHMS FOR DISTRIBUTED MICROPHONE ARRAY BASED SPEECH ENHANCEMENT IN MEETING RECOGNITION SCENARIOS
    Araki, Shoko
    Ono, Nobutaka
    Kinoshita, Keisuke
    Delcroix, Marc
    2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 316 - 320
  • [3] On the robustness of distributed EM based BSS in asynchronous distributed microphone array scenarios
    Uezu, Yasufumi
    Kinoshita, Keisuke
    Souden, Mehrez
    Nakatani, Tomohiro
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3297 - 3301
  • [4] Circular microphone array for meeting system
    Tamai, Y
    Kagami, S
    Mizoguchi, H
    Sakaya, K
    Nagashima, K
    Takano, T
    PROCEEDINGS OF THE IEEE SENSORS 2003, VOLS 1 AND 2, 2003, : 1100 - 1105
  • [5] Microphone array system for speech recognition
    Kiyohara, K
    Kaneda, Y
    Takahashi, S
    Nomura, H
    Kojima, J
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS, 1997, : 215 - 218
  • [6] Acoustic Source Localization With Distributed Asynchronous Microphone Networks
    Canclini, A.
    Antonacci, F.
    Sarti, A.
    Tubaro, S.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (02): : 439 - 443
  • [7] Cooperative Speech Separation With a Microphone Array and Asynchronous Wearable Devices
    Corey, Ryan M.
    Mittal, Manan
    Sarkar, Kanad
    Singer, Andrew C.
    INTERSPEECH 2022, 2022, : 5398 - 5402
  • [8] SLAM-based Online Calibration for Asynchronous Microphone Array
    Miura, Hiroki
    Yoshida, Takami
    Nakamura, Keisuke
    Nakadai, Kazuhiro
    ADVANCED ROBOTICS, 2012, 26 (17) : 1941 - 1965
  • [9] Simultaneous asynchronous microphone array calibration and sound source localisation
    Su, Daobilige
    Vidal-Calleja, Teresa
    Mins, Jaime Valls
    2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 5561 - 5567
  • [10] Asynchronous Microphone Array Calibration using Hybrid TDOA Information
    The Shenzhen Key Laboratory of Control Theory and Intelligent Systems, Southern University of Science and Technology , Shenzhen
    518055, China
    arXiv, 1600,