Composite Model Particle Filter for Indoor Sound Source Location Based on Multi-feature

被引：0

作者：

Liu W. ^{[1
]}

Pan H. ^{[1
]}

Wang M. ^{[2
]}

机构：

[1] School ofInformation Science and Engineering, Zhejiang Sci-Tech University, Zhejiang, Hangzhou

[2] Key Laboratory of Special Purpose Equipment and Advanced Processing Technology, Ministry of Education, Zhejiang University of Technology, Zhejiang, Hangzhou

来源：

Binggong Xuebao/Acta Armamentarii | 2024年 / 45卷 / 03期

关键词：

composite model; indoor sound source localization; multi-feature; particle filter; time delay estimation;

D O I：

10.12382/bgxb.2022.0849

中图分类号：

学科分类号：

摘要：

A multi-feature-based composite model particle filter algorithm is proposed to improve the accuracy and robustness of sound source location in reverberation and noise environment. In this algorithm, the likelihood function of the particle filter is constructed based on the multiple features of signal received by a microphone, where the depth features of multiple hypothesis time-delay estimated image are extracted by convolutional neural network (CNN), and a time-delay estimation model based on support vector regression (SVR) is established. Furthermore, the deficiency that single feature can't suppress noise and reverberation simultaneously is remedied by introducing the beam output energy fusion mechanism. For the randomness of speaker motion, a composite model for sound source tracking is established to improve the robustness of speaker tracking system. The simulated and experimental results show that, based on the composite model, the position average root mean square error (RMSE) of multi-feature algorithm is reduced by more than 83% compared with that of steered response power and time delay estimation (SRPTDE) algorithm, and under multi-feature observation, the position average RMSE of composite model is reduced by more than 46% compared with that of Langevin model and the random walking model. The proposed algorithm realizes the effective tracking of random moving sound sources in complex environment. © 2024 China Ordnance Industry Corporation. All rights reserved.

引用

页码：975 / 985

页数：10

共 23 条

[21] LEHMANNEA, JOHANSSONA M., Prediction of energy decay in room impulse responses simulated with an image-source model [J], Journal of the Acoustical Society of America, 124, 1, pp. 269-277, (2008)
[22] TIAN Y, CHEN Z, YIN F L., Distributed Kalman filter-based speaker tracking in microphone array networks [J], Applied Acoustics, 89, 3, pp. 71-77, (2015)
[23] TRANSFELD P, MARTENS U, BINDER H, Et al., Acoustic event source localization for surveillance in reverberant environments supported by an event onset detection, Proceedings of IEEE International Conferenceon Acoustics, Speech and Signal Processing, pp. 2629-2633, (2015)

← 1 2 3 →