Data-Centric Methods for Environmental Sound Classification With Limited Labels

被引:0
作者
Syed, Ali Raza [1 ]
Coban, Enis Berk [1 ]
Pir, Dara [2 ]
Mandel, Michael [3 ,4 ]
机构
[1] CUNY, Grad Ctr, Dept Comp Sci, New York, NY 10016 USA
[2] CUNY, Guttman Community Coll, Informat Technol Program, New York, NY 10018 USA
[3] Brooklyn Coll, Dept Comp & Informat Sci, Brooklyn, NY 11210 USA
[4] Grad Ctr, Dept Comp Sci, New York, NY 10016 USA
基金
美国国家科学基金会;
关键词
Training; Data models; Cost accounting; Biological system modeling; Data augmentation; Speech processing; Machine learning; Ecoacoustics; environmental sound classification; data-centric machine learning; data augmentation; data valuation; Shapley values; limited labels; data curation;
D O I
10.1109/TASLP.2024.3414332
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Arctic boreal forests are warming at a rate 2-3 times faster than the global average. It is important to understand the effects of this warming on activities of animals that migrate to and within these environments annually to reproduce. Acoustic sensors can monitor a wide area relatively cheaply, producing large amounts of data. Yet, only a small proportion of the recorded data can be labeled by hand making it challenging to train high performing sound classifiers for ecoacoustic research. In this work, we explore data-centric methods for improving model performance by utilizing labels more efficiently. We show that indeed data augmentation for a DNN-based multi-label sound classifier yields a relative improvement (37%) in AUC performance. We are able to boost this further by 56% with a novel data valuation method. Our method estimates Shapley values for a multi-label DNN classifier enabling curation of a high quality training set and identification of data quality issues. We demonstrate that with our novel method, we can achieve these gains using as little as 40% of the labeled training data.
引用
收藏
页码:4288 / 4297
页数:10
相关论文
共 52 条
  • [1] MEAN SQUARE ERROR OF PREDICTION AS A CRITERION FOR SELECTING VARIABLES
    ALLEN, DM
    [J]. TECHNOMETRICS, 1971, 13 (03) : 469 - &
  • [2] A review of instance selection methods
    Arturo Olvera-Lopez, J.
    Ariel Carrasco-Ochoa, J.
    Francisco Martinez-Trinidad, J.
    Kittler, Josef
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2010, 34 (02) : 133 - 143
  • [3] Deep Representation Learning for Orca Call Type Classification
    Bergler, Christian
    Schmitt, Manuel
    Cheng, Rachael Xi
    Schroeter, Hendrik
    Maier, Andreas
    Barth, Volker
    Weber, Michael
    Noeth, Elmar
    [J]. TEXT, SPEECH, AND DIALOGUE (TSD 2019), 2019, 11697 : 274 - 286
  • [4] Cartwright Mark, 2017, Proceedings of the ACM on Human-Computer Interaction, V1, DOI 10.1145/3134664
  • [5] Polynomial calculation of the Shapley value based on sampling
    Castro, Javier
    Gomez, Daniel
    Tejada, Juan
    [J]. COMPUTERS & OPERATIONS RESEARCH, 2009, 36 (05) : 1726 - 1730
  • [6] Choi K, 2018, INT SOC MUSIC INF RE, P805
  • [7] Applications for deep learning in ecology
    Christin, Sylvain
    Hervet, Eric
    Lecomte, Nicolas
    [J]. METHODS IN ECOLOGY AND EVOLUTION, 2019, 10 (10): : 1632 - 1644
  • [8] Coban E. B., 2022, P WORKSH DET CLASS A, P16
  • [9] TOWARDS LARGE SCALE ECOACOUSTIC MONITORING WITH SMALL AMOUNTS OF LABELED DATA
    Coban, Enis Berk
    Syed, Ali Raza
    Pir, Dara
    Mandel, Michael, I
    [J]. 2021 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2021, : 181 - 185
  • [10] Analysis of label noise in graph-based semi-supervised learning
    de Aquino Afonso, Bruno Klaus
    Berton, Lilian
    [J]. PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING (SAC'20), 2020, : 1127 - 1134