ORCA-SPOT: An Automatic Killer Whale Sound Detection Toolkit Using Deep Learning

被引:66
作者
Bergler, Christian [1 ]
Schroeter, Hendrik [1 ]
Cheng, Rachael Xi [2 ]
Barth, Volker [3 ]
Weber, Michael [3 ]
Noeth, Elmar [1 ]
Hofer, Heribert [2 ,4 ,5 ]
Maier, Andreas [1 ]
机构
[1] Friedrich Alexander Univ Erlangen Nuremberg, Dept Comp Sci, Pattern Recognit Lab, Martensstr 3, D-91058 Erlangen, Germany
[2] Leibniz Inst Zoo & Wildlife Res IZW Forschungsver, Dept Ecol Dynam, Alfred Kowalke Str 17, D-10315 Berlin, Germany
[3] Anthromedia, Nansenstr 19, D-12047 Berlin, Germany
[4] Free Univ Berlin, Dept Biol, Pharm, Chem, Takustr 3, D-14195 Berlin, Germany
[5] Free Univ Berlin, Dept Vet Med, Oertzenweg 19b, D-14195 Berlin, Germany
关键词
BOTTLE-NOSED-DOLPHIN; SINGING HUMPBACK WHALES; ORCINUS-ORCA; MEGAPTERA-NOVAEANGLIAE; ECHOLOCATION CALLS; TURSIOPS-TRUNCATUS; SIGNATURE WHISTLES; BRITISH-COLUMBIA; NEURAL-NETWORKS; VOCAL BEHAVIOR;
D O I
10.1038/s41598-019-47335-w
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Large bioacoustic archives of wild animals are an important source to identify reappearing communication patterns, which can then be related to recurring behavioral patterns to advance the current understanding of intra-specific communication of non-human animals. A main challenge remains that most large-scale bioacoustic archives contain only a small percentage of animal vocalizations and a large amount of environmental noise, which makes it extremely difficult to manually retrieve sufficient vocalizations for further analysis - particularly important for species with advanced social systems and complex vocalizations. In this study deep neural networks were trained on 11,509 killer whale (Orcinus orca) signals and 34,848 noise segments. The resulting toolkit ORCA-SPOT was tested on a large-scale bioacoustic repository - the Orchive - comprising roughly 19,000 hours of killer whale underwater recordings. An automated segmentation of the entire Orchive recordings (about 2.2 years) took approximately 8 days. It achieved a time-based precision or positive-predictive-value (PPV) of 93.2% and an area-under-the-curve (AUC) of 0.9523. This approach enables an automated annotation procedure of large bioacoustics databases to extract killer whale sounds, which are essential for subsequent identification of significant communication patterns. The code will be publicly available in October 2019 to support the application of deep learning to bioaoucstic research. ORCA-SPOT can be adapted to other animal species.
引用
收藏
页数:17
相关论文
共 101 条
  • [1] A piezoelectric bone-conduction bending hearing actuator
    Adamson, R. B. A.
    Bance, M.
    Brown, J. A.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 128 (04) : 2003 - 2008
  • [2] Amodei D, 2016, PR MACH LEARN RES, V48
  • [3] Template-based automatic recognition of birdsong syllables from continuous recordings
    Anderson, SE
    Dave, AS
    Margoliash, D
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 100 (02) : 1209 - 1219
  • [4] [Anonymous], 2017, THESIS
  • [5] [Anonymous], 2017 25 EUR SIGN PRO
  • [6] [Anonymous], 2000, KILLER WHALES NATURA
  • [7] Baird RW, 2001, CAN FIELD NAT, V115, P676
  • [8] Detecting bird sounds in a complex acoustic environment and application to bioacoustic monitoring
    Bardeli, R.
    Wolff, D.
    Kurth, F.
    Koch, M.
    Tauchert, K. -H.
    Frommolt, K. -H.
    [J]. PATTERN RECOGNITION LETTERS, 2010, 31 (12) : 1524 - 1534
  • [9] The mixed blessing of echolocation: Differences in sonar use by fish-eating and mammal-eating killer whales
    BarrettLennard, LG
    Ford, JKB
    Heise, KA
    [J]. ANIMAL BEHAVIOUR, 1996, 51 : 553 - 565
  • [10] Bergler C, 2019, DEEPAL FIELDWORK DAT