An adaptive IMM particle filter algorithm based on multi-feature for sound source tracking in reverberant and noisy environments

被引:0
作者
Liu W.-S. [1 ]
Pan H.-P. [1 ]
Wang M.-H. [2 ]
机构
[1] School of Mechanical Engineering and Automation, Zhejiang Sci-Tech University, Zhejiang, Hangzhou
[2] Key Laboratory of Special Purpose Equipment and Advanced Processing Technology, Ministry of Education, Zhejiang University of Technology, Zhejiang, Hangzhou
来源
Kongzhi Lilun Yu Yingyong/Control Theory and Applications | 2023年 / 40卷 / 03期
基金
中国国家自然科学基金;
关键词
interacting multiple model; microphone array; multi-feature; particle filter; room reverberation; sound source localization;
D O I
10.7641/CTA.2022.10544
中图分类号
学科分类号
摘要
To deal with the problems of low accuracy and weak robustness of sound source localization in reverberant and noisy environments, an adaptive IMM particle filter algorithm based on the multi-feature is proposed. In this algorithm, the mechanisms of time delay selection and beam output energy optimization are established, by using the space-time correlation and iterative filtering, where the multi-features of the signals received by the microphones are exploited as the observation information. Subsequently, the reasonable sound source position information is obtained from the likelihood function, which is constructed on the basis of both. Meanwhile, considering the randomness of speaker motion, an adaptive IMM algorithm is given. By generating online particle set and interacting the models with different process variances, the speaker’s different motion modes are fitted, which improves the robustness of the speaker tracking system. The simulation and experimental results show that the complementarity of the location information based on the multi-feature is employed in the proposed algorithm, and the influence of the uncertainty of the observation error on sound source position estimation is reduced. Simultaneously, the robustness of random moving sound source tracking system is enhanced and the positioning accuracy of the system is improved. © 2023 South China University of Technology. All rights reserved.
引用
收藏
页码:477 / 484
页数:7
相关论文
共 15 条
  • [1] TROWITZSCH I, SCHYMURA C, KOLOSSA D, Et al., Joining sound event detection and localization through spatial segregation, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 28, pp. 487-502, (2020)
  • [2] POLITIS A, MESAROS A, ADAVANNE S, Et al., Overview and evaluation of sound event localization and detection in DCASE 2019, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 29, pp. 684-698, (2021)
  • [3] LI Baowei, ZHANG Xinggan, Improved microphone array sound source localization method based on generalized cross correlation, Journal of Nanjing University (Natural Sciences), 56, 6, pp. 917-922, (2020)
  • [4] LEE R, AN B, KANG M S, Et al., Sound source localization based on GCC–PHAT with diffuseness mask in noisy and reverberant environments, IEEE Access, Speech, and Language Processing, 8, pp. 7373-7382, (2020)
  • [5] MARXIM R B B, MOHANTY A R., Time delay estimation in reverberant and low SNR environment by EMD based maximum likelihood method, Measurement, 137, pp. 655-663, (2019)
  • [6] ZHAO Xiaoyan, CHEN Shuwen, ZHOU Lin, Sound source localization using SNR-based frequency weighting with microphone array, Journal of Signal Processing, 36, 3, pp. 449-456, (2020)
  • [7] BRENDEL A, ALTMANN I, KELLERMANN W., Acoustic source position estimation based on multi-feature Gaussian processes, European Signal Processing Conference, pp. 1-5, (2019)
  • [8] CHAKRABARTY S, HABETS E A P., Multi-speaker DOA estimation using deep convolutional networks trained with noise signals, IEEE Journal of Selected Topics in Signal Processing, 13, 1, pp. 8-21, (2019)
  • [9] JOHANSSON A M, LEHMANN E A., Evolutionary optimization of dynamics models in sequential Monte Carlo target tracking, IEEE Transactions on Evolutionary Computation, 13, 4, pp. 879-894, (2009)
  • [10] LEVY A, GANNOT S, HABETS E A P., Multiple-hypothesis extended particle filter for acoustic source localization in reverberant environments, IEEE Transactions on Audio, Speech, and Language Processing, 19, 6, pp. 1540-1555, (2011)