Automated cetacean detection in UAV imagery using AI models: a case study on Delphinid species

被引:0
作者
Canelas, Joao [1 ,3 ]
Clementino, Luana [2 ]
Cid, Andre [3 ]
Castro, Joana [3 ,4 ]
Machado, Ines [2 ,5 ]
Vieira, Susana [1 ]
机构
[1] Inst Super Tecn, IDMEC, Ave Rovisco Pais, P-1049001 Lisbon, Portugal
[2] WavEC Offshore Renewables, Edificio Diogo Cao, Doca Alcantara Norte, P-1350352 Lisbon, Portugal
[3] AIMM Assoc Invest Meio Marinho, Rua Maestro Fred Freitas N15-1, P-1500399 Lisbon, Portugal
[4] Univ Lisbon, Fac Ciencias, MARE Marine & Environm Sci Ctr, Lab Maritimo Guia,ARNET Aquat Res Network, Ave Nossa Senhora Cabo 939, P-2750374 Cascais, Portugal
[5] Univ Lisbon, MARE Marine & Environm Sci Ctr, ARNET Aquat Res Network, Fac Ciencias, Campo Grande, P-1749016 Lisbon, Portugal
关键词
Unmanned aerial vehicles; Convolutional neural networks; Long-short-term-memory; Machine learning; Marine mammals detection; Photo identification; MARINE MAMMALS;
D O I
10.1007/s41060-024-00704-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The identification and quantification of marine mammals is crucial for understanding their abundance, ecology and supporting their conservation efforts. Traditional methods for detecting cetaceans, however, are often labor-intensive and limited in their accuracy. To overcome these challenges, this work explores the use of convolutional neural networks (CNNs) as a tool for automating the detection of cetaceans through aerial images from unmanned aerial vehicles (UAVs). Additionally, the study proposes the use of Long-Short-Term-Memory (LSTM)-based models for video detection using a CNN-LSTM architecture. Models were trained on a selected dataset of dolphin examples acquired from 138 online videos with the aim of testing methods that hold potential for practical field monitoring. The approach was effectively validated on field data, suggesting that the method shows potential for further applications for operational settings. The results show that image-based detection methods are effective in the detection of dolphins from aerial UAV images, with the best-performing model, based on a ConvNext architecture, achieving high accuracy and f1-score values of 83.9% and 82.0%, respectively, within field observations conducted. However, video-based methods showed more difficulties in the detection task, as LSTM-based models struggled with generalization beyond their training environments, achieving a top accuracy of 68%. By reducing the labor required for cetacean detection, thus improving monitoring efficiency, this research provides a scalable approach that can support ongoing conservation efforts by enabling more robust data collection on cetacean populations.
引用
收藏
页数:15
相关论文
共 48 条
  • [41] Simonyan K, 2015, Arxiv, DOI arXiv:1409.1556
  • [42] Rethinking the Inception Architecture for Computer Vision
    Szegedy, Christian
    Vanhoucke, Vincent
    Ioffe, Sergey
    Shlens, Jon
    Wojna, Zbigniew
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2818 - 2826
  • [43] Tan MX, 2019, PR MACH LEARN RES, V97
  • [44] Tercero J.A., 2018, Wave and tidal energy, DOI [10.1002/9781119014492.ch9, DOI 10.1002/9781119014492.CH9]
  • [45] Drone Up! Quantifying Whale Behavior From a New Perspective Improves Observational Capacity
    Torres, Leigh G.
    Nieukirk, Sharon L.
    Lemos, Leila
    Chandler, Todd E.
    [J]. FRONTIERS IN MARINE SCIENCE, 2018, 5
  • [46] Acoustic detection and satellite-tracking leads to discovery of rare concentration of endangered North Pacific right whales
    Wade, Paul
    Heide-Jorgensen, Mads Peter
    Shelden, Kim
    Barlow, Jay
    Carretta, James
    Durban, John
    LeDuc, Rick
    Munger, Lisa
    Rankin, Shannon
    Sauter, Allan
    Stinchcomb, Charles
    [J]. BIOLOGY LETTERS, 2006, 2 (03) : 417 - 419
  • [47] Winkler C., 2020, IWC B, V68, P66
  • [48] Temporal Relational Reasoning in Videos
    Zhou, Bolei
    Andonian, Alex
    Oliva, Aude
    Torralba, Antonio
    [J]. COMPUTER VISION - ECCV 2018, PT I, 2018, 11205 : 831 - 846