Real-time sound source localization using hybrid framework

被引:19
作者
Zhao, Yilu [1 ]
Chen, Xiong [1 ]
Wang, Bin [1 ]
机构
[1] Fudan Univ, Dept Elect Engn, Shanghai 200433, Peoples R China
关键词
Sound source localization; Microphone array; Hybrid framework; Steered response power; Generalized cross correlation; ROOM ACOUSTICS;
D O I
10.1016/j.apacoust.2013.04.010
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Real-time sound source localization with microphone array still remains as a difficult task. The steered response power-phase transform (SRP-PHAT) method is proved to be robust, but it suffers from its high computational cost. In this paper, we propose a hybrid framework for real-time sound source localization with microphone array. The main concept is using the results of generalized cross correlation (GCC) based time difference of arrival (TDOA) estimation to narrow down the search space of SRP-PHAT. A circular clustering algorithm is developed to interlink the above two parts and calculate the extracted search space. The proposed hybrid method maintains the high localization accuracy of traditional SRP-PHAT algorithm, and the computational cost is much reduced. Moreover, any existing improvement of SRP algorithm can be directly incorporated into the hybrid framework to further improve performance. Experiment and simulation results validate the efficiency and accuracy of the proposed method. (c) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1367 / 1373
页数:7
相关论文
共 18 条
  • [1] IMAGE METHOD FOR EFFICIENTLY SIMULATING SMALL-ROOM ACOUSTICS
    ALLEN, JB
    BERKLEY, DA
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 (04) : 943 - 950
  • [2] Badali A, 2009, IEEE RSJ INT C INT R
  • [3] Accelerated steered response power method for sound source localization using orthogonal linear array
    Cai, Weiping
    Wang, Shikui
    Wu, Zhenyang
    [J]. APPLIED ACOUSTICS, 2010, 71 (02) : 134 - 139
  • [4] A Modified SRP-PHAT Functional for Robust Real-Time Sound Source Localization With Scalable Spatial Sampling
    Cobos, Maximo
    Marti, Amparo
    Lopez, Jose J.
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2011, 18 (01) : 71 - 74
  • [5] DiBiase J. H., 2000, A High-Accuracy, Low-Latency Technique for Talker Localization in Reverberant Environments Using Microphone Arrays
  • [6] Do H., 2007, IEEE WORKSH APPL SIG
  • [7] Do H., 2007, IEEE INT C AC SPEECH
  • [8] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR
    EPHRAIM, Y
    MALAH, D
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06): : 1109 - 1121
  • [9] GENERALIZED CORRELATION METHOD FOR ESTIMATION OF TIME-DELAY
    KNAPP, CH
    CARTER, GC
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1976, 24 (04): : 320 - 327
  • [10] Loesch B. Y. B., 2008, 11 INT WORKSH AC ECH