Acoustic source detection and localization using generalized likelihood ratio test in the spherical harmonic domain

被引:0
作者
Oulahcine, Dhiya Eddine Rabia [1 ]
Benssalah, Mustapha [1 ]
Haddad, Nabil [2 ]
Salvati, Daniele [3 ]
Mahfoudia, Osama [4 ]
机构
[1] Ecole Mil Polytech, Lab Traitement Signal, BP 17 Bordj El Bahri, Algiers 16046, Algeria
[2] Ecole Mil Polytech, Lab Antennes & Disposit Microondes, BP 17 Bordj El Bahri, Algiers 16046, Algeria
[3] Univ Udine, Dept Math Comp Sci & Phys, I-33100 Udine, Italy
[4] Ecole Mil Polytech, Lab RADAR, BP 17 Bordj El Bahri, Algiers 16046, Algeria
关键词
Acoustic source localization; Acoustic source detection; Spherical microphone array; Spherical harmonic domain; Generalized likelihood ratio test; OF-ARRIVAL ESTIMATION; SOUND EVENT LOCALIZATION; MICROPHONE ARRAY; NEURAL-NETWORKS;
D O I
10.1016/j.apacoust.2024.110434
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In recent years, spherical microphone arrays have gained significant attention for analyzing acoustic signals due to their consistent spatial resolution in both elevation and azimuth, added to the ability to transform signals into the spherical harmonics domain. This paper deals with the localization of active sound sources in the presence of noisy and silent recordings. In the literature, the aspects of detection and localization of acoustic signals are often addressed separately. However, the present paper proposes the spherical harmonic generalized likelihood ratio test (SH-GLRT) algorithm which concurrently addresses both aspects while maintaining a fixed false-alarm probability (Pfa). In this context, the acoustic source localization is considered as a detection problem, and the necessary mathematical development is provided. Precisely, the resulting detection thresholds for four spherical harmonics orders are considered, and the detection performance is assessed and compared to the state-of-the-art methods. In addition, the LOCATA challenge dataset is employed for validation, which emphasizes the efficiency of the proposed method.
引用
收藏
页数:12
相关论文
共 52 条
  • [1] Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks
    Adavanne, Sharath
    Politis, Archontis
    Nikunen, Joonas
    Virtanen, Tuomas
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (01) : 34 - 48
  • [2] GLRT Based on Support Estimation for Multiple Scatterers Detection in SAR Tomography
    Budillon, Alessandra
    Schirinzi, Gilda
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2016, 9 (03) : 1086 - 1094
  • [3] HIGH-RESOLUTION FREQUENCY-WAVENUMBER SPECTRUM ANALYSIS
    CAPON, J
    [J]. PROCEEDINGS OF THE IEEE, 1969, 57 (08) : 1408 - &
  • [4] Environmental Sound Recognition With Time-Frequency Audio Features
    Chu, Selina
    Narayanan, Shrikanth
    Kuo, C. -C. Jay
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (06): : 1142 - 1158
  • [5] Cobos M, 2023, ICASSP 2023, P1
  • [6] A Modified SRP-PHAT Functional for Robust Real-Time Sound Source Localization With Scalable Spatial Sampling
    Cobos, Maximo
    Marti, Amparo
    Lopez, Jose J.
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2011, 18 (01) : 71 - 74
  • [7] Multiple Sound Source Localization With Steered Response Power Density and Hierarchical Grid Refinement
    Coteli, Mert Burkay
    Olgun, Orhun
    Hacihabiboglu, Huseyin
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (11) : 2215 - 2229
  • [8] Audio Surveillance: A Systematic Review
    Crocco, Marco
    Cristani, Marco
    Trucco, Andrea
    Murino, Vittorio
    [J]. ACM COMPUTING SURVEYS, 2016, 48 (04)
  • [9] Octant Spherical Harmonics Features for Source Localization Using Artificial Intelligence Based on Unified Learning Framework
    Dwivedi P.
    Routray G.
    Hegde R.M.
    [J]. IEEE Transactions on Artificial Intelligence, 2024, 5 (08): : 3845 - 3857
  • [10] A Consolidated Perspective on Multimicrophone Speech Enhancement and Source Separation
    Gannot, Sharon
    Vincent, Emmanuel
    Markovich-Golan, Shmulik
    Ozerov, Alexey
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (04) : 692 - 730