Multi-Speaker Direction of Arrival Estimation using SRP-PHAT Algorithm with a Weighted Histogram

被引:0
|
作者
Hadad, Elior [1 ]
Gannot, Sharon [1 ]
机构
[1] Bar Ilan Univ, Fac Engn, IL-5290002 Ramat Gan, Israel
关键词
LOCALIZATION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A direction of arrival (BOA) estimator for concurrent speakers in a reverberant environment is presented. The DOA estimation task is formulated in the short-time Fourier transform (STFT) in two stages. In the first stage, a single narrow-band BOA per time-frequency (T-F) is selected, since the speech sources are assumed to exhibit disjoint activity in the STFT domain. The narrow-band DOA is obtained as the maximum of the narrow-band steered response power phase transform (SRP-PHAT) localization spectrum at that T-F bin. In addition, for each narrow-hand DOA, a quality measure is calculated, which provides the confidence in the estimated decision. In the second stage, the wide-band localization spectrum is calculated using a weighted histogram of the narrow-band DOAs with the quality measures as weight. Finally, the wide band DOA estimation is obtained by selecting the peaks in the wide-band localization spectrum. The results of our experimental study demonstrate the benefit of the proposed algorithm as compared to the wide-band SRP-PHAT algorithm in a reverberant environment.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] MULTI-STAGE REJECTION SAMPLING (MSRS): A ROBUST SRP-PHAT PEAK DETECTION ALGORITHM FOR LOCALIZATION OF COCKTAIL-PARTY TALKERS
    Khanal, Sarthak
    Silverman, Harvey F.
    2015 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2015,
  • [22] Multi-speaker DoA Estimation Using Audio and Visual Modality
    Wu, Yulin
    Hu, Ruimin
    Wang, Xiaochen
    Ke, Shanfa
    NEURAL PROCESSING LETTERS, 2023, 55 (07) : 8887 - 8901
  • [23] Weighted maximum likelihood direction of arrival estimation algorithm and its application research
    Zhang, Peng
    Bao, Ming
    Feng, Dahang
    Yang, Jun
    Li, Xiaodong
    Shengxue Xuebao/Acta Acustica, 2010, 35 (02): : 235 - 240
  • [24] Speaker-Attributed Training for Multi-Speaker Speech Recognition Using Multi-Stage Encoders and Attention-Weighted Speaker Embedding
    Kim, Minsoo
    Jang, Gil-Jin
    APPLIED SCIENCES-BASEL, 2024, 14 (18):
  • [25] MULTI-SPEAKER DOA ESTIMATION IN REVERBERATION CONDITIONS USING EXPECTATION-MAXIMIZATION
    Schwartz, Ofer
    Dorfan, Yuval
    Habets, Emanuel A. P.
    Gannot, Sharon
    2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
  • [26] Estimation of direction of arrival using weighted subspace fitting for wireless communications
    Kim, SC
    Song, I
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2005, E88B (09) : 3717 - 3724
  • [27] DIRECTION-OF-ARRIVAL ESTIMATION AND DETECTION USING WEIGHTED SUBSPACE FITTING
    VIBERG, M
    OTTERSTEN, B
    KAILATH, T
    TWENTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2: CONFERENCE RECORD, 1989, : 604 - 608
  • [28] Estimation of direction of arrival using weighted subspace fitting for wireless communications
    Kim, SC
    Kwon, H
    Song, I
    Kim, HG
    Kim, YH
    MILCOM 2000: 21ST CENTURY MILITARY COMMUNICATIONS CONFERENCE PROCEEDINGS, VOLS 1 AND 2: ARCHITECTURES & TECHNOLOGIES FOR INFORMATION SUPERIORITY, 2000, : 831 - 835
  • [29] A fast algorithm for direction of arrival estimation in multi-path environments
    Tayem, Nizar
    Naraghi-Pour, Mort
    WIRELESS SENSING AND PROCESSING II, 2007, 6577
  • [30] Direction of arrival estimation using super-resolution algorithm
    Xu, Y
    Feng, WF
    Hao, JY
    Hwang, HK
    OCEANS 2001 MTS/IEEE: AN OCEAN ODYSSEY, VOLS 1-4, CONFERENCE PROCEEDINGS, 2001, : 749 - 755