COMPARISON OF REFERENCE MICROPHONE SELECTION ALGORITHMS FOR DISTRIBUTED MICROPHONE ARRAY BASED SPEECH ENHANCEMENT IN MEETING RECOGNITION SCENARIOS

被引:0
作者
Araki, Shoko [1 ]
Ono, Nobutaka [2 ]
Kinoshita, Keisuke [1 ]
Delcroix, Marc [1 ]
机构
[1] NTT Corp, NTT Commun Sci Labs, 2-4 Hikaridai,Seika Cho, Kyoto 6190237, Japan
[2] Tokyo Metropolitan Univ, Fac Syst Design, 6-6 Asahigaoka, Hino, Tokyo 1910065, Japan
来源
2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC) | 2018年
基金
日本学术振兴会;
关键词
meeting recognition; distributed microphones; reference microphone selection; speech enhancement; independent vector analysis (IVA); BLIND SOURCE SEPARATION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses a front-end system for speech recognition of meeting conversations that are recorded with distributed microphones such as smartphones. When using distributed microphones, one of the microphones may be closer to the speaker than the others and thus provide high speech recognition accuracy due to a high signal-to-noise ratio and low reverberation. It is important to select such a microphone as a reference microphone channel in widely studied speech enhancement approaches, which estimate source images at a reference microphone. However, the reference microphone selection is still an open problem, especially for a distributed microphone array, where the sensitivity may differ among the microphones. In this paper, we discuss several approaches to select a reference microphone for multi-channel speech enhancement, such as independent vector analysis (IVA), and compare the performance of these approaches in terms of speech recognition accuracy.
引用
收藏
页码:316 / 320
页数:5
相关论文
共 23 条
[21]   Unified Architecture for Multichannel End-to-End Speech Recognition With Neural Beamforming [J].
Ochiai, Tsubasa ;
Watanabe, Shinji ;
Hori, Takaaki ;
Hershey, John R. ;
Xiao, Xiong .
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2017, 11 (08) :1274-1288
[22]  
Ono N, 2011, 2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), P189, DOI 10.1109/ASPAA.2011.6082320
[23]   Location Feature Integration for Clustering-Based Speech Separation in Distributed Microphone Arrays [J].
Souden, Mehrez ;
Kinoshita, Keisuke ;
Delcroix, Marc ;
Nakatani, Tomohiro .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (02) :354-367