Multi-Speaker Direction of Arrival Estimation using SRP-PHAT Algorithm with a Weighted Histogram

被引:0
|
作者
Hadad, Elior [1 ]
Gannot, Sharon [1 ]
机构
[1] Bar Ilan Univ, Fac Engn, IL-5290002 Ramat Gan, Israel
关键词
LOCALIZATION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A direction of arrival (BOA) estimator for concurrent speakers in a reverberant environment is presented. The DOA estimation task is formulated in the short-time Fourier transform (STFT) in two stages. In the first stage, a single narrow-band BOA per time-frequency (T-F) is selected, since the speech sources are assumed to exhibit disjoint activity in the STFT domain. The narrow-band DOA is obtained as the maximum of the narrow-band steered response power phase transform (SRP-PHAT) localization spectrum at that T-F bin. In addition, for each narrow-hand DOA, a quality measure is calculated, which provides the confidence in the estimated decision. In the second stage, the wide-band localization spectrum is calculated using a weighted histogram of the narrow-band DOAs with the quality measures as weight. Finally, the wide band DOA estimation is obtained by selecting the peaks in the wide-band localization spectrum. The results of our experimental study demonstrate the benefit of the proposed algorithm as compared to the wide-band SRP-PHAT algorithm in a reverberant environment.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] MULTI-SPEAKER DOA TRACKING ALGORITHM UTILIZING PROBABILITY HYPOTHESIS DENSITY FILTER AND WEIGHTED HISTOGRAM OF SRP-PHAT
    Soussana, Yosef
    Hadad, Elior
    Gannot, Sharon
    2024 18TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT, IWAENC 2024, 2024, : 334 - 338
  • [2] MAXIMUM LIKELIHOOD MULTI-SPEAKER DIRECTION OF ARRIVAL ESTIMATION UTILIZING A WEIGHTED HISTOGRAM
    Hadad, Elior
    Gannot, Sharon
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 586 - 590
  • [3] Direction of Arrival Estimation with Microphone Arrays Using SRP-PHAT and Neural Networks
    Diaz-Guerra, David
    Beltran, Jose R.
    2018 IEEE 10TH SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP (SAM), 2018, : 617 - 621
  • [4] Acoustic direction of arrival estimation, a comparison between root-music and SRP-PHAT
    Johansson, A
    Cook, G
    Nordholm, S
    TENCON 2004 - 2004 IEEE REGION 10 CONFERENCE, VOLS A-D, PROCEEDINGS: ANALOG AND DIGITAL TECHNIQUES IN ELECTRICAL ENGINEERING, 2004, : B629 - B632
  • [5] Equipment Fault Acoustic Source Direction of Arrival Estimation with Microphone Arrays Using SRP-PHAT Method
    Chen, Jingde
    Shen, Xiaofeng
    Lu, Minan
    Wu, Jijian
    Zhou, Nan
    Luo, Lingen
    2020 5TH ASIA CONFERENCE ON POWER AND ELECTRICAL ENGINEERING (ACPEE 2020), 2020, : 1388 - 1392
  • [6] Sound localization method using modified SRP-PHAT algorithm
    School of Electronic and Information Engineering, Dalian University of Technology, Dalian 116024, China
    Dianzi Yu Xinxi Xuebao, 2006, 7 (1223-1227):
  • [7] NMF-weighted SRP for multi-speaker direction of arrival estimation: robustness to spatial aliasing while exploiting sparsity in the atom-time domain
    Sushmita Thakallapalli
    Suryakanth V. Gangashetty
    Nilesh Madhu
    EURASIP Journal on Audio, Speech, and Music Processing, 2021
  • [8] NMF-weighted SRP for multi-speaker direction of arrival estimation: robustness to spatial aliasing while exploiting sparsity in the atom-time domain
    Thakallapalli, Sushmita
    Gangashetty, Suryakanth V.
    Madhu, Nilesh
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2021, 2021 (01)
  • [9] Robust acoustic direction of arrival estimation using Root-SRP-PHAT, a realtime implementation
    Johansson, A
    Nordholm, S
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 933 - 936
  • [10] Accelerating the SRP-PHAT algorithm on multi- and many-core platforms using OpenCL
    Jose M. Badía
    Jose A. Belloch
    Maximo Cobos
    Francisco D. Igual
    Enrique S. Quintana-Ortí
    The Journal of Supercomputing, 2019, 75 : 1284 - 1297