Acoustic source detection and localization using generalized likelihood ratio test in the spherical harmonic domain

被引：0

作者：

Oulahcine, Dhiya Eddine Rabia ^{[1
]}

Benssalah, Mustapha ^{[1
]}

Haddad, Nabil ^{[2
]}

Salvati, Daniele ^{[3
]}

Mahfoudia, Osama ^{[4
]}

机构：

[1] Ecole Mil Polytech, Lab Traitement Signal, BP 17 Bordj El Bahri, Algiers 16046, Algeria

[2] Ecole Mil Polytech, Lab Antennes & Disposit Microondes, BP 17 Bordj El Bahri, Algiers 16046, Algeria

[3] Univ Udine, Dept Math Comp Sci & Phys, I-33100 Udine, Italy

[4] Ecole Mil Polytech, Lab RADAR, BP 17 Bordj El Bahri, Algiers 16046, Algeria

来源：

APPLIED ACOUSTICS | 2025年 / 231卷

关键词：

Acoustic source localization; Acoustic source detection; Spherical microphone array; Spherical harmonic domain; Generalized likelihood ratio test; OF-ARRIVAL ESTIMATION; SOUND EVENT LOCALIZATION; MICROPHONE ARRAY; NEURAL-NETWORKS;

D O I：

10.1016/j.apacoust.2024.110434

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In recent years, spherical microphone arrays have gained significant attention for analyzing acoustic signals due to their consistent spatial resolution in both elevation and azimuth, added to the ability to transform signals into the spherical harmonics domain. This paper deals with the localization of active sound sources in the presence of noisy and silent recordings. In the literature, the aspects of detection and localization of acoustic signals are often addressed separately. However, the present paper proposes the spherical harmonic generalized likelihood ratio test (SH-GLRT) algorithm which concurrently addresses both aspects while maintaining a fixed false-alarm probability (Pfa). In this context, the acoustic source localization is considered as a detection problem, and the necessary mathematical development is provided. Precisely, the resulting detection thresholds for four spherical harmonics orders are considered, and the detection performance is assessed and compared to the state-of-the-art methods. In addition, the LOCATA challenge dataset is employed for validation, which emphasizes the efficiency of the proposed method.

引用

页数：12

共 52 条

[1] Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks
Adavanne, Sharath
Politis, Archontis
Nikunen, Joonas
Virtanen, Tuomas
[J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (01) : 34 - 48
[2] GLRT Based on Support Estimation for Multiple Scatterers Detection in SAR Tomography
Budillon, Alessandra
Schirinzi, Gilda
[J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2016, 9 (03) : 1086 - 1094
[3] HIGH-RESOLUTION FREQUENCY-WAVENUMBER SPECTRUM ANALYSIS
CAPON, J
[J]. PROCEEDINGS OF THE IEEE, 1969, 57 (08) : 1408 - &
[4] Environmental Sound Recognition With Time-Frequency Audio Features
Chu, Selina
Narayanan, Shrikanth
Kuo, C. -C. Jay
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (06): : 1142 - 1158
[5] Cobos M, 2023, ICASSP 2023, P1
[6] A Modified SRP-PHAT Functional for Robust Real-Time Sound Source Localization With Scalable Spatial Sampling
Cobos, Maximo
Marti, Amparo
Lopez, Jose J.
[J]. IEEE SIGNAL PROCESSING LETTERS, 2011, 18 (01) : 71 - 74
[7] Multiple Sound Source Localization With Steered Response Power Density and Hierarchical Grid Refinement
Coteli, Mert Burkay
Olgun, Orhun
Hacihabiboglu, Huseyin
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (11) : 2215 - 2229
[8] Audio Surveillance: A Systematic Review
Crocco, Marco
Cristani, Marco
Trucco, Andrea
Murino, Vittorio
[J]. ACM COMPUTING SURVEYS, 2016, 48 (04)
[9] Octant Spherical Harmonics Features for Source Localization Using Artificial Intelligence Based on Unified Learning Framework
Dwivedi P.
Routray G.
Hegde R.M.
[J]. IEEE Transactions on Artificial Intelligence, 2024, 5 (08): : 3845 - 3857
[10] A Consolidated Perspective on Multimicrophone Speech Enhancement and Source Separation
Gannot, Sharon
Vincent, Emmanuel
Markovich-Golan, Shmulik
Ozerov, Alexey
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (04) : 692 - 730

← 1 2 3 4 5 6 →