ROBUST MULTIPLE SPEECH SOURCE LOCALIZATION USING TIME DELAY HISTOGRAM

被引:0
作者
Huang, Zhaoqiong [1 ]
Zhan, Ge [1 ]
Ying, Dongwen [1 ]
Yan, Yonghong [1 ]
机构
[1] Chinese Acad Sci, Key Lab Speech Acoust & Content Understanding, Beijing 100864, Peoples R China
来源
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS | 2016年
关键词
Speech source localization; time delay histogram; spatial aliasing; spatial resolution; direction of arrival;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Spatial aliasing and spatial resolution are the two issues faced by most multiple speech source localization methods. The histogram of time delays is a simple but effective method to deal with these two issues on linear arrays. But few methods were capable of applying the time delay histogram to directional-of-arrivals (DOAs) estimation using a planar array. This paper proposes a novel method to estimate DOAs of multiple speech sources based on time delay histograms across all microphones of a planar array. The pairwise time delays of different sources are firstly obtained from each time delay histogram, and then, the time delays are identified with variant speech sources. Eventually, the DOA of each source is estimated by regression over its associated time delays. We conducted some experiments in both simulated and real environments to evaluate the proposed method using an eight-element circular array. The experimental results confirmed not only its high computational efficiency, but also its superiority in spatial resolution and spatial anti-aliasing.
引用
收藏
页码:3191 / 3195
页数:5
相关论文
共 16 条
[1]   IMAGE METHOD FOR EFFICIENTLY SIMULATING SMALL-ROOM ACOUSTICS [J].
ALLEN, JB ;
BERKLEY, DA .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 (04) :943-950
[2]  
Araki S., 2006, IEEE International Conference on Acoustics, Speech and Signal Processing, P33
[3]  
Chen J., 2006, EURASIP J APPL SIG P, p1C
[4]  
Garofolo J., 1988, Getting started with the DARPA TIMIT CD-ROM: An acoustic phonetic continuous speech database
[5]   Two decades of array signal processing research - The parametric approach [J].
Krim, H ;
Viberg, M .
IEEE SIGNAL PROCESSING MAGAZINE, 1996, 13 (04) :67-94
[6]   MULTIPLE BROAD-BAND SOURCE LOCATION USING STEERED COVARIANCE MATRICES [J].
KROLIK, J ;
SWINGLER, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (10) :1481-1494
[7]   Robust Source Localization in Reverberant Environments Based on Weighted Fuzzy Clustering [J].
Kuehne, Marco ;
Togneri, Roberto ;
Nordholm, Sven .
IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (1-3) :85-88
[8]  
Lathoud G., 2004, P 1 INT WORKSH MACH, p192C
[9]   Localization of multiple sound sources with two microphones [J].
Liu, C ;
Wheeler, BC ;
O'Brien, WD ;
Bilger, RC ;
Lansing, CR ;
Feng, AS .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2000, 108 (04) :1888-1905
[10]   A Novel Multiple Sparse Source Localization Using Triangular Pyramid Microphone Array [J].
Ren, Mengqi ;
Zou, Yue Xian .
IEEE SIGNAL PROCESSING LETTERS, 2012, 19 (02) :83-86