Deep Learning-based Automatic Bird Species Identification from Isolated Recordings

被引:9
作者
Noumida, A. [1 ]
Rajan, Rajeev [1 ]
机构
[1] Coll Engn, Dept Elect & Commun Engg, Trivandrum, Kerala, India
来源
2021 8TH INTERNATIONAL CONFERENCE ON SMART COMPUTING AND COMMUNICATIONS (ICSCC) | 2021年
关键词
single-label; transfer learning; convolutional neural network; RECOGNITION;
D O I
10.1109/ICSCC51209.2021.9528234
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Birds play an extremely important role in an ecosystem, identifying bird species in audio recordings is challenging and has high research value. This paper aims to develop an effective bird call classification for isolated recordings (singlelabel) approach using various deep learning architectures, namely convolutional neural networks (CNN), deep neural networks (DNN), and transfer learning schemes. Transfer learning models have been widely used in a variety of deep learning applications. The performance of transfer learning models such as ResNet50, VGG-16, and InceptionResNetV2 has been compared to the acoustic MFCC-DNN methodology. On the Xeno-canto (XC) online bird audio dataset, the presented methods are tested. The dataset comprises ten species with 1078 audio tracks. The classification accuracies of 96.3%, 93.7%, and 91.9% are reported for ResNet50, CNN, and VGG-16, respectively, and outperform with the acoustic signal-based MFCC-DNN methodology.
引用
收藏
页码:252 / 256
页数:5
相关论文
共 19 条
[1]   Template-based automatic recognition of birdsong syllables from continuous recordings [J].
Anderson, SE ;
Dave, AS ;
Margoliash, D .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 100 (02) :1209-1219
[2]  
Efremova D. B., 2019, 2019 DIG IM COMP, P1
[3]  
Gelling D., 2001, THESIS U SHEFFIELD
[4]  
Ghosal D, 2018, INTERSPEECH, P2087
[5]  
Härmä A, 2003, 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS, P545
[6]  
Jancovic Peter, 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), P8252, DOI 10.1109/ICASSP.2014.6855210
[7]  
Jancovic P, 2016, INT CONF ACOUST SPEE, P559, DOI 10.1109/ICASSP.2016.7471737
[8]   Recommendations for acoustic recognizer performance assessment with application to five common automated signal recognition programs [J].
Knight, Elly C. ;
Hannah, Kevin C. ;
Foley, Gabriel J. ;
Scott, Chris D. ;
Brigham, R. Mark ;
Bayne, Erin .
AVIAN CONSERVATION AND ECOLOGY, 2017, 12 (02)
[9]   Automatic bird sound detection in long real-field recordings: Applications and tools [J].
Potamitis, Ilyas ;
Ntalampiras, Stavros ;
Jahn, Olaf ;
Riede, Klaus .
APPLIED ACOUSTICS, 2014, 80 :1-9
[10]   Deep learning [J].
Rusk, Nicole .
NATURE METHODS, 2016, 13 (01) :35-35