The Neural-SRP method for positional sound source localization

D O I：

10.1109/IEEECONF59524.2023.10476973

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Steered Response Power (SRP) is a widely used method for the task of sound source localization using microphone arrays, showing satisfactory localization performance on many practical scenarios. However, its performance is diminished under highly reverberant environments. Although Deep Neural Networks (DNNs) have been previously proposed to overcome this limitation, most are trained for a specific number of microphones with fixed spatial coordinates. This restricts their practical application on scenarios frequently observed in wireless acoustic sensor networks, where each application has an ad-hoc microphone topology. We propose Neural-SRP, a DNN which combines the flexibility of SRP with the performance gains of DNNs. We train our network using simulated data and transfer learning, and evaluate our approach on recorded and simulated data. Results verify that Neural-SRP's localization performance significantly outperforms the baselines.

引用

页码：1318 / 1323

页数：6

共 34 条

[1]

Adavanne S, 2018, EUR SIGNAL PR CONF, P1462, DOI 10.23919/EUSIPCO.2018.8553182

[2] IMAGE METHOD FOR EFFICIENTLY SIMULATING SMALL-ROOM ACOUSTICS [J].

ALLEN, JB ;

BERKLEY, DA .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 (04) :943-950

[3] Multimodal fusion for multimedia analysis: a survey [J].

Atrey, Pradeep K. ;

Hossain, M. Anwar ;

El Saddik, Abdulmotaleb ;

Kankanhalli, Mohan S. .

MULTIMEDIA SYSTEMS, 2010, 16 (06) :345-379

[4]

Bertrand A., 2011, IEEE S COMMUN VEH TE, P1, DOI DOI 10.1109/SCVT.2011.6101302

[5]

Brandstein M.S., 2001, Microphone arrays: signal processing techniques and applications, DOI DOI 10.1007/978-3-662-04619-7

[6]

Cao Y., 2019, PROC DETECTION CLASS, P30

[7]

Chakrabarty S, 2017, IEEE WORK APPL SIG, P136, DOI 10.1109/WASPAA.2017.8170010

[8]

Chinaev A, 2019, INT CONF ACOUST SPEE, P641, DOI [10.1109/ICASSP.2019.8683605, 10.1109/icassp.2019.8683605]

[9]

Choi K, 2017, INT CONF ACOUST SPEE, P2392, DOI 10.1109/ICASSP.2017.7952585

[10]

Chung J., [No title captured]

← 1 2 3 4 →