FUSING SHALLOW AND DEEP LEARNING FOR BIOACOUSTIC BIRD SPECIES CLASSIFICATION

被引:0
作者
Salamon, Justin [1 ,2 ]
Bellol, Juan Pablo [1 ]
Farnsworth, Andrew [3 ]
Kelling, Steve [3 ]
机构
[1] NYU, Mus & Audio Res Lab, New York, NY 10003 USA
[2] NYU, Ctr Urban Sci & Progress, New York, NY 10003 USA
[3] Cornell Univ, Cornell Lab Ornithol, Ithaca, NY USA
来源
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2017年
基金
美国国家科学基金会;
关键词
Convolutional neural networks; bioacoustics; flight calls; deep learning; data augmentation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Automated c1assification of organisms to species based on their vocalizations would contribute tremendously to abilities to monitor biodiversity, with a wide range of applications in the field of ecology. In particular, automated c1assification of migrating birds' flight calls could yield new biological insights and conservation applications for birds that vocalize during migration. In this paper we explore stateof- the-art c1assification techniques for large-vocabulary bird species c1assification from flight calls. In particular, we contrast a "shallow learning" approach based on unsupervised dictionary learning with a deep convolutional neural network combined with data augmentation. We show that the two models perform comparably on a dataset of 5428 flight calls spanning 43 different species, with both significantly outperforming an MFCC baseline. Finally, we show that by combining the models using a simple late-fusion approach we can further improve the results, obtaining a state-of-the-art c1assification accuracy of 0.96.
引用
收藏
页码:141 / 145
页数:5
相关论文
共 39 条
  • [1] Automated classification of bird and amphibian calls using machine learning: A comparison of methods
    Acevedo, Miguel A.
    Corrada-Bravo, Carlos J.
    Corrada-Bravo, Hector
    Villanueva-Rivera, Luis J.
    Aide, T. Mitchell
    [J]. ECOLOGICAL INFORMATICS, 2009, 4 (04) : 206 - 214
  • [2] Real-time bioacoustics monitoring and automated species identification
    Aide, T. Mitchell
    Corrada-Bravo, Carlos
    Campos-Cerqueira, Marconi
    Milan, Carlos
    Vega, Giovany
    Alvarez, Rafael
    [J]. PEERJ, 2013, 1
  • [3] [Anonymous], 2015, LASAGNE 1 RELEASE
  • [4] [Anonymous], 2015, ISMIR
  • [5] [Anonymous], J ACOUSTICAL SOC AM
  • [6] [Anonymous], 1 INT WORKSH MACH LE
  • [7] [Anonymous], 2013, P 1 WORKSHOP MACHINE
  • [8] [Anonymous], PESCADOR 0 1 0
  • [9] Detecting bird sounds in a complex acoustic environment and application to bioacoustic monitoring
    Bardeli, R.
    Wolff, D.
    Kurth, F.
    Koch, M.
    Tauchert, K. -H.
    Frommolt, K. -H.
    [J]. PATTERN RECOGNITION LETTERS, 2010, 31 (12) : 1524 - 1534
  • [10] Bastas S, 2012, IEEE INT SYMP CIRC S, P1676