TDOA Estimation for Multiple Sound Sources in Noisy and Reverberant Environments Using Broadband Independent Component Analysis

被引:75
作者
Lombard, Anthony [1 ]
Zheng, Yuanhang [1 ]
Buchner, Herbert [2 ]
Kellermann, Walter [1 ]
机构
[1] Univ Erlangen Nurnberg, Chair Multimedia Commun & Signal Proc, D-91058 Erlangen, Germany
[2] Tech Univ Berlin, Deutsch Telekom Labs, D-10587 Berlin, Germany
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2011年 / 19卷 / 06期
关键词
Acoustic source localization; adaptive filters; independent component analysis; microphone arrays; reverberation; time difference of arrival (TDOA) estimation; TIME-DELAY ESTIMATION; PERFORMANCE; ALGORITHM; LOCATION;
D O I
10.1109/TASL.2010.2092765
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we show that minimization of the statistical dependence using broadband independent component analysis (ICA) can be successfully exploited for acoustic source localization. As the ICA signal model inherently accounts for the presence of several sources and multiple sound propagation paths, the ICA criterion offers a theoretically more rigorous framework than conventional techniques based on an idealized single-path and single-source signal model. This leads to algorithms which outperform other localization methods, especially in the presence of multiple simultaneously active sound sources and under adverse conditions, notably in reverberant environments. Three methods are investigated to extract the time difference of arrival (TDOA) information contained in the filters of a two-channel broadband ICA scheme. While for the first, the blind system identification (BSI) approach, the number of sources should be restricted to the number of sensors, the other methods, the averaged directivity pattern (ADP) and composite mapped filter (CMF) approaches can be used even when the number of sources exceeds the number of sensors. To allow fast tracking of moving sources, the ICA algorithm operates in block-wise batch mode, with a proportionate weighting of the natural gradient to speed up the convergence of the algorithm. The TDOA estimation accuracy of the proposed schemes is assessed in highly noisy and reverberant environments for two, three, and four stationary noise sources with speech-weighted spectral envelopes as well as for moving real speech sources.
引用
收藏
页码:1490 / 1503
页数:14
相关论文
共 45 条
[1]   A real-time blind source separation scheme and its application to reverberant and noisy acoustic environments [J].
Aichner, R ;
Buchner, H ;
Yan, F ;
Kellermann, W .
SIGNAL PROCESSING, 2006, 86 (06) :1260-1277
[2]   Exploiting narrowband efficiency for broadband convolutive blind source separation [J].
Aichner, Robert ;
Buchner, Herbert ;
Kellermann, Andwalter .
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2007, 2007 (1)
[3]   Natural gradient works efficiently in learning [J].
Amari, S .
NEURAL COMPUTATION, 1998, 10 (02) :251-276
[4]  
[Anonymous], P IEEE INT C AC SPEE
[5]  
[Anonymous], 2001, MICROPHONE ARRAYS SI
[6]  
[Anonymous], P JOINT WORKSH HANDS
[7]  
[Anonymous], P IEEE INT C AC SPEE
[8]  
[Anonymous], P 3 INT WORKSH COMP
[9]  
[Anonymous], P INT S IND COMP AN
[10]  
[Anonymous], P APSURI