The Neural-SRP method for positional sound source localization

被引:1
作者
Grinstein, Eric [1 ]
van Waterschoot, Toon [2 ]
Brookes, Mike [1 ]
Naylor, Patrick A. [1 ]
机构
[1] Imperial Coll London, Dept Elect & Elect Engn, London, England
[2] Katholieke Univ Leuven, Dept Elect Engn ESAT, Leuven, Belgium
来源
FIFTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, IEEECONF | 2023年
基金
欧洲研究理事会;
关键词
Sound Source Localization (SSL); Deep Neural Network (DNN); Steered Response Power (SRP); Distributed Microphone Array (DMA);
D O I
10.1109/IEEECONF59524.2023.10476973
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Steered Response Power (SRP) is a widely used method for the task of sound source localization using microphone arrays, showing satisfactory localization performance on many practical scenarios. However, its performance is diminished under highly reverberant environments. Although Deep Neural Networks (DNNs) have been previously proposed to overcome this limitation, most are trained for a specific number of microphones with fixed spatial coordinates. This restricts their practical application on scenarios frequently observed in wireless acoustic sensor networks, where each application has an ad-hoc microphone topology. We propose Neural-SRP, a DNN which combines the flexibility of SRP with the performance gains of DNNs. We train our network using simulated data and transfer learning, and evaluate our approach on recorded and simulated data. Results verify that Neural-SRP's localization performance significantly outperforms the baselines.
引用
收藏
页码:1318 / 1323
页数:6
相关论文
共 34 条
[1]  
Adavanne S, 2018, EUR SIGNAL PR CONF, P1462, DOI 10.23919/EUSIPCO.2018.8553182
[2]   IMAGE METHOD FOR EFFICIENTLY SIMULATING SMALL-ROOM ACOUSTICS [J].
ALLEN, JB ;
BERKLEY, DA .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 (04) :943-950
[3]   Multimodal fusion for multimedia analysis: a survey [J].
Atrey, Pradeep K. ;
Hossain, M. Anwar ;
El Saddik, Abdulmotaleb ;
Kankanhalli, Mohan S. .
MULTIMEDIA SYSTEMS, 2010, 16 (06) :345-379
[4]  
Bertrand A., 2011, IEEE S COMMUN VEH TE, P1, DOI DOI 10.1109/SCVT.2011.6101302
[5]  
Brandstein M.S., 2001, Microphone arrays: signal processing techniques and applications, DOI DOI 10.1007/978-3-662-04619-7
[6]  
Cao Y., 2019, PROC DETECTION CLASS, P30
[7]  
Chakrabarty S, 2017, IEEE WORK APPL SIG, P136, DOI 10.1109/WASPAA.2017.8170010
[8]  
Chinaev A, 2019, INT CONF ACOUST SPEE, P641, DOI [10.1109/ICASSP.2019.8683605, 10.1109/icassp.2019.8683605]
[9]  
Choi K, 2017, INT CONF ACOUST SPEE, P2392, DOI 10.1109/ICASSP.2017.7952585
[10]  
Chung J., [No title captured]