Distributed multiple speaker tracking based on time delay estimation in microphone array network

被引:1
作者
Wang, Rong [1 ]
Chen, Zhe [1 ]
Yin, Fuliang [1 ]
机构
[1] Dalian Univ Technol, Sch Informat & Commun Engn, Dalian 116023, Peoples R China
基金
中国国家自然科学基金;
关键词
delay estimation; reverberation; microphone arrays; speaker recognition; Kalman filters; sensor fusion; distributed microphone array network; DMA; multiple speaker scenarios; ambiguous observation; noisy environments; distributed multiple speaker tracking method; time delay estimation strategy; reliable time delays; distributed Kalman filter framework; SOURCE LOCALIZATION;
D O I
10.1049/iet-spr.2019.0613
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Multiple speaker tracking in distributed microphone array (DMA) network is a challenging task. A critical issue for multiple speaker scenarios is to distinguish the ambiguous observation and associate it to the corresponding speaker, especially under reverberant and noisy environments. To address the problem, a distributed multiple speaker tracking method based on time delay estimation in DMA is proposed in this study. Specifically, the time delay estimated by the generalised cross-correlation function is treated as an observation. In order to distinguish the observation for each speaker, the possible time delays, refer to as candidates, are extracted based on data association technique. Considering the ambient influence, a time delay estimation strategy is designed to calculate the time delay for each speaker from the candidates. Finally, only the reliable time delays in DMA are propagated throughout the whole network by diffusion fusion algorithm and used for updating the speakers' state within the distributed Kalman filter framework. The proposed approach can track multiple speakers successfully in a non-centralised manner under reverberant and noisy environments. Simulation results indicate that, compared with other methods, the proposed method can achieve a smaller root mean square error for multiple speaker tracking, especially in adverse conditions.
引用
收藏
页码:591 / 601
页数:11
相关论文
共 27 条
[1]   Performance Improvement of TDOA-Based Speaker Localization in Joint Noisy and Reverberant Conditions [J].
Abutalebi, Hamid Reza ;
Momenzadeh, Hossein .
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2011,
[2]   Multiple Sound Source Location Estimation in Wireless Acoustic Sensor Networks Using DOA Estimates: The Data-Association Problem [J].
Alexandridis, Anastasios ;
Mouchtaris, Athanasios .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (02) :342-356
[3]   IMAGE METHOD FOR EFFICIENTLY SIMULATING SMALL-ROOM ACOUSTICS [J].
ALLEN, JB ;
BERKLEY, DA .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 (04) :943-950
[4]  
[Anonymous], TIMIT ACOUSTIC PHONE
[5]  
Bechler Dirk, 2004, 2004 12th European Signal Processing Conference (EUSIPCO), P1987
[6]   Diffusion Strategies for Distributed Kalman Filtering and Smoothing [J].
Cattivelli, Federico S. ;
Sayed, Ali H. .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2010, 55 (09) :2069-2084
[7]   Maximum likelihood sound source localization and beamforming for directional microphone arrays in distributed meetings [J].
Cha Zhang ;
Florencio, Dinei ;
Ba, Demba E. ;
Zhang, Zhengyou .
IEEE TRANSACTIONS ON MULTIMEDIA, 2008, 10 (03) :538-548
[8]  
Gannot S., 2006, EURASIP Journal on Applied Signal Process, V2006, P1
[9]   Source localization in reverberant environments: Modeling and statistical analysis [J].
Gustafsson, T ;
Rao, BD ;
Trivedi, M .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (06) :791-803
[10]  
Johnson DonH., Signal processing information databse