Evaluating machine learning architectures for sound event detection for signals with variable signal-to-noise-ratios in the Beaufort Sea

被引:0
作者
Ibrahim, Malek [1 ]
Sagers, Jason D. [2 ]
Ballard, Megan S. [2 ]
Le, Minh [2 ]
Koutsomitopoulos, Vasilis [3 ]
机构
[1] MIT, Dept Mech Engn, Cambridge, MA 02139 USA
[2] Univ Texas Austin, Appl Res Labs, Austin, TX 78758 USA
[3] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
关键词
NEURAL-NETWORKS; YEARLONG RECORD; CANADA BASIN; CLASSIFICATION; SHELF; LOCALIZATION; PROPAGATION; MODELS; SPEED; ICE;
D O I
10.1121/10.0021974
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper explores the challenging polyphonic sound event detection problem using machine learning architectures applied to data recorded in the Beaufort Sea during the Canada Basin Acoustic Propagation Experiment. Four candidate architectures were investigated and evaluated on nine classes of signals broadcast from moored sources that were recorded on a vertical line array of hydrophones over the course of the yearlong experiment. These signals represent a high degree of variability with respect to time-frequency characteristics, changes in signal-to-noise ratio (SNR) associated with varying signal levels as well as fluctuating ambient sound levels, and variable distributions, which resulted in class imbalances. Within this context, binary relevance, which decomposes the multi-label learning task into a number of independent binary learning tasks, was examined as an alternative to the conventional multi-label classification (MLC) approach. Binary relevance has several advantages, including flexible, lightweight model configurations that support faster model inference. In the experiments presented, binary relevance outperformed conventional MLC approach on classes with the most imbalance and lowest SNR. A deeper investigation of model performance as a function of SNR showed that binary relevance significantly improved recall within the low SNR range for all classes studied. (c) 2023 Acoustical Society of America.
引用
收藏
页码:2689 / 2707
页数:19
相关论文
共 86 条
[1]   A Review of Deep Learning Based Methods for Acoustic Scene Classification [J].
Abesser, Jakob .
APPLIED SCIENCES-BASEL, 2020, 10 (06)
[2]   Machine learning based approach for the interpretation of engineering geophysical sounding logs [J].
Abordan, Armand ;
Szabo, Norbert Peter .
ACTA GEODAETICA ET GEOPHYSICA, 2021, 56 (04) :681-696
[3]   Automated classification of bird and amphibian calls using machine learning: A comparison of methods [J].
Acevedo, Miguel A. ;
Corrada-Bravo, Carlos J. ;
Corrada-Bravo, Hector ;
Villanueva-Rivera, Luis J. ;
Aide, T. Mitchell .
ECOLOGICAL INFORMATICS, 2009, 4 (04) :206-214
[4]   Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks [J].
Adavanne, Sharath ;
Politis, Archontis ;
Nikunen, Joonas ;
Virtanen, Tuomas .
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (01) :34-48
[5]  
Adavanne S, 2017, INT CONF ACOUST SPEE, P771, DOI 10.1109/ICASSP.2017.7952260
[6]   Real-time bioacoustics monitoring and automated species identification [J].
Aide, T. Mitchell ;
Corrada-Bravo, Carlos ;
Campos-Cerqueira, Marconi ;
Milan, Carlos ;
Vega, Giovany ;
Alvarez, Rafael .
PEERJ, 2013, 1
[7]   A Deep-Learning Model for Subject-Independent Human Emotion Recognition Using Electrodermal Activity Sensors [J].
Al Machot, Fadi ;
Elmachot, Ali ;
Ali, Mouhannad ;
Al Machot, Elyan ;
Kyamakya, Kyandoghere .
SENSORS, 2019, 19 (07)
[8]   A Framework for Designing the Architectures of Deep Convolutional Neural Networks [J].
Albelwi, Saleh ;
Mahmood, Ausif .
ENTROPY, 2017, 19 (06)
[9]   WASN-Based Day-Night Characterization of Urban Anomalous Noise Events in Narrow and Wide Streets [J].
Alias, Francesc ;
Claudi Socoro, Joan ;
Alsina-Pages, Rosa Ma .
SENSORS, 2020, 20 (17) :1-26
[10]   Azimuthal and temporal sound fluctuations on the Chukchi continental shelf during the Canada Basin Acoustic Propagation Experiment 2017 [J].
Badiey, Mohsen ;
Wan, Lin ;
Pecknold, Sean ;
Turgut, Altan .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 146 (06) :EL530-EL536