A Steered-Response Power Algorithm Employing Hierarchical Search for Acoustic Source Localization Using Microphone Arrays

被引:57
作者
Nunes, Leonardo O. [1 ]
Martins, Wallace A. [1 ]
Lima, Markus V. S. [1 ]
Biscainho, Luiz W. P. [1 ]
Costa, Mauricio V. M. [1 ]
Goncalves, Felipe M. [1 ]
Said, Amir [2 ]
Lee, Bowon [2 ]
机构
[1] Univ Fed Rio de Janeiro, Signal Multimedia & Telecommun Lab DEL Poli & PEE, BR-21941972 Rio De Janeiro, RJ, Brazil
[2] Hewlett Packard Labs, Palo Alto, CA 94304 USA
关键词
Sound source localization; steered-response power; microphone array; computational complexity; hierarchical search; branch-and-bound; TIME;
D O I
10.1109/TSP.2014.2336636
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The localization of a speaker inside a closed environment is often approached by real-time processing of multiple audio signals captured by a set of microphones. One of the leading related methods for sound source localization, the steered-response power (SRP), searches for the point of maximum power over a spatial grid. High-accuracy localization calls for a dense grid and/or many microphones, which tends to impractically increase computational requirements. This paper proposes a new method for sound source localization (called H-SRP), which applies the SRP approach to space regions instead of grid points. This arrangement makes room for the use of a hierarchical search inspired by the branch-and-bound paradigm, which is guaranteed to find the global maximum in anechoic environments and shown experimentally to also work under reverberant conditions. Besides benefiting from the improved robustness of volume-wise search over point-wise search as to reverberation effects, the H-SRP attains high performance with manageable complexity. In particular, an experiment using a 16-microphone array in a typical presentation room yielded localization errors of the order of 7 cm, and for a given fixed complexity, competing methods' errors are two to three times larger.
引用
收藏
页码:5171 / 5183
页数:13
相关论文
共 22 条
[1]   IMAGE METHOD FOR EFFICIENTLY SIMULATING SMALL-ROOM ACOUSTICS [J].
ALLEN, JB ;
BERKLEY, DA .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 (04) :943-950
[2]  
Antoniou A., 2008, PRACTICAL OPTIMIZATI
[3]  
Benesty J., 2010, MICROPHONE ARRAY SIG
[4]  
Brandstein M. S., 1997, IEEE ASSP WORKSH APP
[5]   SMOOTHED COHERENCE TRANSFORM [J].
CARTER, GC ;
NUTTALL, AH ;
CABLE, PG .
PROCEEDINGS OF THE IEEE, 1973, 61 (10) :1497-1498
[6]  
Clausen J., 1999, BRANCH BOUND ALGORIT
[7]   A Modified SRP-PHAT Functional for Robust Real-Time Sound Source Localization With Scalable Spatial Sampling [J].
Cobos, Maximo ;
Marti, Amparo ;
Lopez, Jose J. .
IEEE SIGNAL PROCESSING LETTERS, 2011, 18 (01) :71-74
[8]  
DiBiase J. H., 2000, A High-Accuracy, Low-Latency Technique for Talker Localization in Reverberant Environments Using Microphone Arrays
[9]  
DiBiase JH, 2001, DIGITAL SIGNAL PROC, P157
[10]   A generalized steered response power method for computationally viable source localization [J].
Dmochowski, Jacek P. ;
Benesty, Jacob ;
Affes, Sofiene .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (08) :2510-2526