Leveraging transfer learning and active learning for data annotation in passive acoustic monitoring of wildlife

被引:12
作者
Kath, Hannes [1 ,2 ]
Serafini, Patricia P. [3 ,4 ]
Campos, Ivan B. [3 ,5 ]
Gouvea, Thiago S. [1 ,2 ]
Sonntag, Daniel [1 ,2 ]
机构
[1] German Res Ctr Artificial Intelligence DFKI, Interact Machine Leraning, Oldenburg, Germany
[2] Carl von Ossietzky Univ Oldenburg, Appl Artificial Intelligence, Oldenburg, Germany
[3] Chico Mendes Inst Biodivers Conservat ICMBio, Natl Ctr Wild Bird Conservat & Res CEMAVE, Brasilia, Brazil
[4] Univ Fed Santa Catarina UFSC, Florianopolis, Brazil
[5] Univ Fed Minas Gerais UFMG, Dept Biol Geral, Belo Horizonte, Brazil
关键词
Active learning; Transfer learning; Passive acoustic monitoring; BirdNet; CLASSIFICATION; INDEXES;
D O I
10.1016/j.ecoinf.2024.102710
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Passive Acoustic Monitoring (PAM) has emerged as a pivotal technology for wildlife monitoring, generating vast amounts of acoustic data. However, the successful application of machine learning methods for sound event detection in PAM datasets heavily relies on the availability of annotated data, which can be laborious to acquire. In this study, we investigate the effectiveness of transfer learning and active learning techniques to address the data annotation challenge in PAM. Transfer learning allows us to use pre-trained models from related tasks or datasets to bootstrap the learning process for sound event detection. Furthermore, active learning promises strategic selection of the most informative samples for annotation, effectively reducing the annotation cost and improving model performance. We evaluate an approach that combines transfer learning and active learning to efficiently exploit existing annotated data and optimize the annotation process for PAM datasets. Our transfer learning observations show that embeddings produced by BirdNet, a model trained on high signal-to-noise recordings of bird vocalisations, can be effectively used for predicting anurans in PAM data: a linear classifier constructed using these embeddings outperforms the benchmark by 21.7%. Our results indicate that active learning is superior to random sampling, although no clear winner emerges among the strategies employed. The proposed method holds promise for facilitating broader adoption of machine learning techniques in PAM and advancing our understanding of biodiversity dynamics through acoustic data analysis.
引用
收藏
页数:9
相关论文
共 45 条
[1]   A Convolutional Neural Network for Automated Detection of Humpback Whale Song in a Diverse, Long-Term Passive Acoustic Dataset [J].
Allen, Ann N. ;
Harvey, Matt ;
Harrell, Lauren ;
Jansen, Aren ;
Merkens, Karlina P. ;
Wall, Carrie C. ;
Cattiau, Julie ;
Oleson, Erin M. .
FRONTIERS IN MARINE SCIENCE, 2021, 08
[2]   Learning Deep Architectures for AI [J].
Bengio, Yoshua .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127
[3]   Poor performance of acoustic indices as proxies for bird diversity in a fragmented Amazonian landscape [J].
Bicudo, Thiago ;
Llusia, Diego ;
Anciaes, Marina ;
Gil, Diego .
ECOLOGICAL INFORMATICS, 2023, 77
[4]   Assessing the potential of acoustic indices for protected area monitoring in the Serra do Cipo National Park, Brazil [J].
Campos, Ivan Braga ;
Fewster, Rachel ;
Truskinger, Anthony ;
Towsey, Michael ;
Roe, Paul ;
Vasques Filho, Demival ;
Lee, William ;
Gaskett, Anne .
ECOLOGICAL INDICATORS, 2021, 120
[5]   A dataset for benchmarking Neotropical anuran calls identification in passive acoustic monitoring [J].
Canas, Juan Sebastian ;
Toro-Gomez, Maria Paula ;
Sugai, Larissa Sayuri Moreira ;
Restrepo, Hernan Dario Benitez ;
Rudas, Jorge ;
Bautista, Breyner Posso ;
Toledo, Luis Felipe ;
Dena, Simone ;
Domingos, Adao Henrique Rosa ;
de Souza, Franco Leandro ;
Neckel-Oliveira, Selvino ;
da Rosa, Anderson ;
Carvalho-Rocha, Vitor ;
Bernardy, Jose Vinicius ;
Sugai, Jose Luiz Massao Moreira ;
dos Santos, Carolina Emilia ;
Bastos, Rogerio Pereira ;
Llusia, Diego ;
Ulloa, Juan Sebastian .
SCIENTIFIC DATA, 2023, 10 (01)
[6]  
Çoban EB, 2020, INT CONF ACOUST SPEE, P726, DOI [10.1109/ICASSP40776.2020.9053338, 10.1109/icassp40776.2020.9053338]
[7]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[8]   Passive acoustic monitoring of animal populations with transfer learning [J].
Dufourq, Emmanuel ;
Batist, Carly ;
Foquet, Ruben ;
Durbach, Ian .
ECOLOGICAL INFORMATICS, 2022, 70
[9]   Detection and identification of European woodpeckers with deep convolutional neural networks [J].
Florentin, Juliette ;
Dutoit, Thierry ;
Verlinden, Olivier .
ECOLOGICAL INFORMATICS, 2020, 55
[10]  
Howard AG, 2017, Arxiv, DOI [arXiv:1704.04861, 10.48550/arXiv.1704.04861]