Workflow and convolutional neural network for automated identification of animal sounds

被引:44
作者
Ruff, Zachary J. [1 ,2 ]
Lesmeister, Damon B. [1 ,3 ]
Appel, Cara L. [1 ,3 ]
Sullivan, Christopher M. [4 ]
机构
[1] USDA Forest Serv, Pacific Northwest Res Stn, Corvallis, OR USA
[2] Oak Ridge Inst Sci & Educ, Oak Ridge, TN USA
[3] Oregon State Univ, Dept Fisheries & Wildlife, Corvallis, OR 97331 USA
[4] Oregon State Univ, Ctr Genome Res & Biocomp, Corvallis, OR 97331 USA
关键词
Bioacoustics; Machine learning; Wildlife; Ecology; Passive acoustic monitoring; Artificial intelligence; VOCALIZATIONS; POPULATIONS; HABITAT;
D O I
10.1016/j.ecolind.2021.107419
中图分类号
X176 [生物多样性保护];
学科分类号
090705 ;
摘要
The use of passive acoustic monitoring in wildlife ecology has increased dramatically in recent years as researchers take advantage of improvements in autonomous recording units and analytical methods. These technologies have allowed researchers to collect large quantities of acoustic data which must then be processed to extract meaningful information, e.g. target species detections. A persistent issue in acoustic monitoring is the challenge of efficiently automating the detection of species of interest, and deep learning has emerged as a powerful approach to accomplish this task. Here we report on the development and application of a deep convolutional neural network for the automated detection of 14 forest-adapted birds and mammals by classifying spectrogram images generated from short audio clips. The neural network performed well for most species, with precision exceeding 90% and recall exceeding 50% at high score thresholds, indicating high power to detect these species when they were present and vocally active, combined with a low proportion of false positives. We describe a multi-step workflow that integrates this neural network to efficiently process large volumes of audio data with a combination of automated detection and human review. This workflow reduces the necessary human effort by > 99% compared to full manual review of the data. As an optional component of this workflow, we developed a graphical interface for the neural network that can be run through RStudio using the Shiny package, creating a portable and user-friendly way for field biologists and managers to efficiently process audio data and detect these target species close to the point of collection and with minimal delays using consumer-grade computers.
引用
收藏
页数:12
相关论文
共 49 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] [Anonymous], 107375, DOI [10.1016/j.apacoust.2020.107375., DOI 10.1016/J.APACOUST.2020.107375]
  • [3] [Anonymous], 2015, ICLR
  • [4] Artuso C., 2013, The Birds of North America, DOI DOI 10.2173/BNA.372
  • [5] Boarman W.I., 1999, BIRDS N AM
  • [6] VOCAL REPERTOIRE OF CHIPMUNKS (GENUS-EUTAMIAS) IN CALIFORNIA
    BRAND, LR
    [J]. ANIMAL BEHAVIOUR, 1976, 24 (MAY) : 319 - 335
  • [7] Uncovering Ecological Patterns with Convolutional Neural Networks
    Brodrick, Philip G.
    Davies, Andrew B.
    Asner, Gregory P.
    [J]. TRENDS IN ECOLOGY & EVOLUTION, 2019, 34 (08) : 734 - 745
  • [8] Bull E.L., 1995, BIRDS N AM, DOI [DOI 10.2173/BNA.148, 10.2173/bna.148.]
  • [9] Cannings R.J., 2017, BIRDS N AM, DOI [10.2173/bna.wesowl1.03, DOI 10.2173/BNA.WESOWL1.03]
  • [10] Chollet F., 2015, KERAS 20 COMPUTER SO