Composite Model Particle Filter for Indoor Sound Source Location Based on Multi-feature

被引:0
|
作者
Liu W. [1 ]
Pan H. [1 ]
Wang M. [2 ]
机构
[1] School ofInformation Science and Engineering, Zhejiang Sci-Tech University, Zhejiang, Hangzhou
[2] Key Laboratory of Special Purpose Equipment and Advanced Processing Technology, Ministry of Education, Zhejiang University of Technology, Zhejiang, Hangzhou
来源
Binggong Xuebao/Acta Armamentarii | 2024年 / 45卷 / 03期
关键词
composite model; indoor sound source localization; multi-feature; particle filter; time delay estimation;
D O I
10.12382/bgxb.2022.0849
中图分类号
学科分类号
摘要
A multi-feature-based composite model particle filter algorithm is proposed to improve the accuracy and robustness of sound source location in reverberation and noise environment. In this algorithm, the likelihood function of the particle filter is constructed based on the multiple features of signal received by a microphone, where the depth features of multiple hypothesis time-delay estimated image are extracted by convolutional neural network (CNN), and a time-delay estimation model based on support vector regression (SVR) is established. Furthermore, the deficiency that single feature can't suppress noise and reverberation simultaneously is remedied by introducing the beam output energy fusion mechanism. For the randomness of speaker motion, a composite model for sound source tracking is established to improve the robustness of speaker tracking system. The simulated and experimental results show that, based on the composite model, the position average root mean square error (RMSE) of multi-feature algorithm is reduced by more than 83% compared with that of steered response power and time delay estimation (SRPTDE) algorithm, and under multi-feature observation, the position average RMSE of composite model is reduced by more than 46% compared with that of Langevin model and the random walking model. The proposed algorithm realizes the effective tracking of random moving sound sources in complex environment. © 2024 China Ordnance Industry Corporation. All rights reserved.
引用
收藏
页码:975 / 985
页数:10
相关论文
共 23 条
  • [11] VERA-DIAZ J M, PIZARRO D, MACIAS-GUARASA J., Acoustic source localization with deep generalized cross correlations, Signal Processing, 187, 2, pp. 1-22, (2021)
  • [12] PERTILA P, PARVIAINEN M., Time difference of arrival estimation of speech signals using deep neural networks with integrated time-frequency masking, Proceedings of IEEE International Conferenceon Acoustics, Speech and Signal Processing, pp. 436-440, (2019)
  • [13] JOHANSSON A M, LEHMANN E A., Evolutionary optimization of dynamics models in sequential Monte Carlo target tracking, IEEE Transactions on Evolutionary Computation, 13, 4, pp. 879-894, (2009)
  • [14] HOU D W, YIN F L., An IMM particle filtering method for speaker tracking, Acta Electronica Sinica, 38, 4, pp. 835-841, (2010)
  • [15] WANG W X, FU Y T, DONG F, Et al., Infrared ship target detection method based on deep convolution neural network, Acta Optica Sinica, 38, 7, pp. 160-166, (2018)
  • [16] HAYKIN S., Neural networksand learning machines, (2009)
  • [17] CAI W P, WU Z Y., Robust acoustic source tracking algorithm based on particle filtering, Journal of Electronic Measurement and Instrument, 24, 5, pp. 407-413, (2010)
  • [18] BOORAR, DHULLS K., Iterative modified SRP-PHAT with adaptive search space for acoustic source localization[J], IETE Technical Review, 39, 1, pp. 28-36, (2022)
  • [19] CAO J, LI J, LI W, Et al., Speaker tracking based on adaptive finite-difference particle filteration, Journal of Lanzhou University of Technology, 38, 5, pp. 93-97, (2012)
  • [20] ARULAMPALAM M S, MASKELL S, GORDON N, Et al., A tutorial on particle filters for online nonlinear/non-Gaussian Bayesial tracking [J], IEEE Transactions Signal Processing, 50, 2, pp. 174-188, (2002)