Composite Model Particle Filter for Indoor Sound Source Location Based on Multi-feature

被引：0

作者：

Liu W. ^{[1
]}

Pan H. ^{[1
]}

Wang M. ^{[2
]}

机构：

[1] School ofInformation Science and Engineering, Zhejiang Sci-Tech University, Zhejiang, Hangzhou

[2] Key Laboratory of Special Purpose Equipment and Advanced Processing Technology, Ministry of Education, Zhejiang University of Technology, Zhejiang, Hangzhou

来源：

Binggong Xuebao/Acta Armamentarii | 2024年 / 45卷 / 03期

关键词：

composite model; indoor sound source localization; multi-feature; particle filter; time delay estimation;

D O I：

10.12382/bgxb.2022.0849

中图分类号：

学科分类号：

摘要：

A multi-feature-based composite model particle filter algorithm is proposed to improve the accuracy and robustness of sound source location in reverberation and noise environment. In this algorithm, the likelihood function of the particle filter is constructed based on the multiple features of signal received by a microphone, where the depth features of multiple hypothesis time-delay estimated image are extracted by convolutional neural network (CNN), and a time-delay estimation model based on support vector regression (SVR) is established. Furthermore, the deficiency that single feature can't suppress noise and reverberation simultaneously is remedied by introducing the beam output energy fusion mechanism. For the randomness of speaker motion, a composite model for sound source tracking is established to improve the robustness of speaker tracking system. The simulated and experimental results show that, based on the composite model, the position average root mean square error (RMSE) of multi-feature algorithm is reduced by more than 83% compared with that of steered response power and time delay estimation (SRPTDE) algorithm, and under multi-feature observation, the position average RMSE of composite model is reduced by more than 46% compared with that of Langevin model and the random walking model. The proposed algorithm realizes the effective tracking of random moving sound sources in complex environment. © 2024 China Ordnance Industry Corporation. All rights reserved.

引用

页码：975 / 985

页数：10

共 23 条

[1] POLITIS A, MESAROS A, ADAVANNE S, Et al., Overview and evaluation of sound event localization and detection in DCASE2019, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 29, pp. 684-698, (2021)
[2] SALVATI D, DRIOLI C, FORESTI G L., Acoustic source localization using a geometrically sampled grid SRP-PHAT algorithm with Max-Pooling operation[J], IEEE Signal Processing Letters, 29, pp. 1828-1832, (2022)
[3] EVERS C, LOELLMANN H, MELLMANN H, Et al., The LOCATA challenge: acoustic source localization and tracking[J], IEEE/ACM Transactions on Audio, Speech, and Language Processing, 28, pp. 1620-1643, (2020)
[4] HUANG H X, GUO Q H, TONG F., Microphone array sound source direction of arrival estimation based on distributed compressed sensing, Acta Armamentarii, 40, 8, pp. 1725-1731, (2019)
[5] LI B W, ZHANG X G., Improved microphone array sound source localization method based on generalized cross correlation, Journal of Nanjing University(Natueal Sciences), 56, 6, pp. 917-922, (2020)
[6] RANL, KANG M S, KIM B H, Et al., Sound source localization based on GCC-PHAT with diffuseness mask in noisy and reverberant environments[J], IEEE Access, 8, pp. 7373-7382, (2020)
[7] MARXIM R B B, MOHANTY A R., Time delay estimation in reverberant and low SNR environment by EMD based maximum likelihood method, Measurement, 137, pp. 655-663, (2019)
[8] LEVY A, GANNOT S, HABETS E A P., Multiple-hypothesis extended particle filter for acoustic source localization in reverberant environments [J], IEEE Transactions on Audio, Speech, and Language Processing, 19, 6, pp. 1540-1555, (2011)
[9] KWAK Y, KIM D, HAM H, Et al., Convolutional neural network trained with synthetic pseudo-images for detecting an acoustic source[J], Applied Acoustics, 179, 6, pp. 1-7, (2021)
[10] CHAKRABARTY S, HABETS E A P., Multi-speaker DOA estimation using deep convolutional networks trained with noise signals[J], IEEE Journal of Selected Topics in Signal Processing, 13, 1, pp. 8-21, (2019)

← 1 2 3 →