Convolutional Neural Networks for Scops Owl Sound Classification

被引:25
|
作者
Hidayat, Alam Ahmad [1 ]
Cenggoro, Tjeng Wawan [1 ,2 ]
Pardamean, Bens [1 ,3 ]
机构
[1] Bina Nusantara Univ, Bioinformat & Data Sci Res Ctr, Jakarta 11480, Indonesia
[2] Bina Nusantara Univ, Sch Comp Sci, Comp Sci Dept, Jakarta 11480, Indonesia
[3] Bina Nusantara Univ, Comp Sci Dept, BINUS Grad Program Master Comp Sci, Jakarta 11480, Indonesia
关键词
acoustic features; bird sound classification; convolutional neural network; mean average precision; scops owl;
D O I
10.1016/j.procs.2021.12.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Adopting a deep learning model into bird sound classification tasks becomes a common practice in order to construct a robust automated bird sound detection system. In this paper, we employ a four-layer Convolutional Neural Network (CNN) formulated to classify different species of Indonesia scops owls based on their vocal sounds. Two widely used representations of an acoustic signal: log-scaled mel-spectrogram and Mel Frequency Cepstral Coefficient (MFCC) are extracted from each sound file and fed into the network separately to compare the model performance with different inputs. A more complex CNN that can simultaneously process the two extracted acoustic representations is proposed to provide a direct comparison with the baseline model. The dual-input network is the well-performing model in our experiment that achieves 97.55% Mean Average Precision (MAP). Meanwhile, the baseline model achieves a MAP score of 94.36% for the mel-spectrogram input and 96.08% for the MFCC input. (C) 2021 The Authors. Published by Elsevier B.V.
引用
收藏
页码:81 / 87
页数:7
相关论文
共 50 条
  • [21] Flower Classification with Convolutional Neural Networks
    Mitrovic, Katarina
    Milosevic, Danijela
    2019 23RD INTERNATIONAL CONFERENCE ON SYSTEM THEORY, CONTROL AND COMPUTING (ICSTCC), 2019, : 845 - 850
  • [22] Convolutional Neural Networks for Electrocardiogram Classification
    Mohamad M. Al Rahhal
    Yakoub Bazi
    Mansour Al Zuair
    Esam Othman
    Bilel BenJdira
    Journal of Medical and Biological Engineering, 2018, 38 : 1014 - 1025
  • [23] Convolutional Neural Networks for Electrocardiogram Classification
    Al Rahhal, Mohamad M.
    Bazi, Yakoub
    Al Zuair, Mansour
    Othman, Esam
    BenJdira, Bilel
    JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING, 2018, 38 (06) : 1014 - 1025
  • [24] Glomerulus Classification with Convolutional Neural Networks
    Pedraza, Anibal
    Gallego, Jaime
    Lopez, Samuel
    Gonzalez, Lucia
    Laurinavicius, Arvydas
    Bueno, Gloria
    MEDICAL IMAGE UNDERSTANDING AND ANALYSIS (MIUA 2017), 2017, 723 : 839 - 849
  • [25] Convolutional Neural Networks for ATC Classification
    Lumini, Alessandra
    Nanni, Loris
    CURRENT PHARMACEUTICAL DESIGN, 2018, 24 (34) : 4007 - 4012
  • [26] Classification of Phonocardiograms with Convolutional Neural Networks
    Deperlioglu, Omer
    BRAIN-BROAD RESEARCH IN ARTIFICIAL INTELLIGENCE AND NEUROSCIENCE, 2018, 9 (02): : 22 - 33
  • [27] Convolutional Neural Networks for Font Classification
    Tensmeyer, Chris
    Saunders, Daniel
    Martinez, Tony
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 985 - 990
  • [28] Gongylonema sp infection in the scops owl (Otus scops)
    Esperon, Fernando
    Paz Martin, Maria
    Lopes, Francisca
    Orejas, Patricia
    Carrero, Laura
    Jesus Munoz, Maria
    Alonso, Raul
    PARASITOLOGY INTERNATIONAL, 2013, 62 (06) : 502 - 504
  • [29] Animal Sound Classification Using A Convolutional Neural Network
    Sasmaz, Emre
    Tek, F. Boray
    2018 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2018, : 625 - 629
  • [30] Comparison of environmental sound classification performance of convolutional neural networks according to audio preprocessing methods
    Oh, Wongeun
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2020, 39 (03): : 143 - 149