Bird Whisperer: Leveraging Large Pre-trained Acoustic Model for Bird Call Classification

被引:0
作者
Sheikh, Muhammad Umer [1 ]
Abid, Hassan [1 ]
Shafique, Bhuiyan Sanjid [1 ]
Hanif, Asif [1 ]
Haris, Muhammad [1 ]
机构
[1] Mohamed Bin Zayed Univ Artificial Intelligence, Abu Dhabi, U Arab Emirates
来源
INTERSPEECH 2024 | 2024年
关键词
bird call classification; birdclef-2023; fine-tuning; whisper; FUSION;
D O I
10.21437/Interspeech.2024-1623
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Adapting large pre-trained acoustic models across diverse domains poses a significant challenge in speech processing, particularly when shifting from human to non-human contexts. This study aims to bridge this gap by utilizing the pre-trained Whisper model, initially intended for human speech recognition, for classifying bird calls. Our study reveals that when employed solely as a feature extractor, the Whisper encoder fails to yield meaningful features from bird calls, possibly due to categorizing them as background noise. We propose a simple but effective technique to enhance Whisper's ability to extract distinctive features from avian vocalizations, resulting in a remarkable 15% increase in F1-score over the baseline. Furthermore, we mitigate the issue of class imbalance within the dataset by introducing a series of data augmentations. Our findings underscore the potential of adapting large pre-trained acoustic models to tackle broader bioacoustic classification tasks. The code is available at https://github.com/umer-sheikh/bird-whisperer
引用
收藏
页码:5028 / 5032
页数:5
相关论文
共 16 条
[1]  
Ansar W., 2024, SN Computer Science, V5, P265
[2]  
Baevski A, 2021, ADV NEUR IN
[3]  
"Birdclef, 2023, competition
[4]   Using birds as biodindicators of forest restoration progress: A preliminary study [J].
Chowfin, S. M. ;
Leslie, A. J. .
TREES FORESTS AND PEOPLE, 2021, 3
[5]  
Gregory RD, 2004, TECH ECOL CONSERVAT, P17
[6]  
Radford A., 2023, PMLR, V202, p28 492
[7]  
Tabur M. A., 2010, 2 INT S SUSTAINABL, P560
[8]  
TAN MX, 2019, PMLR, P6105, DOI DOI 10.48550/ARXIV.1905.11946
[9]  
U.S. Geological Survey, 2024, "Birds as indicators of ecosystem health
[10]  
Vaswani A, 2017, ADV NEUR IN, V30