Trajectory-based fish event classification through pre-training with diffusion models

被引:1
|
作者
Canovi, Noemi [1 ]
Ellis, Benjamin A. [2 ,3 ]
Sordalen, Tonje K. [4 ]
Allken, Vaneeda [3 ]
Halvorsen, Kim T. [3 ]
Malde, Ketil [3 ,5 ]
Beyan, Cigdem [6 ]
机构
[1] Univ Trento, Dept Informat Engn & Comp Sci, I-38123 Trento, Italy
[2] Univ Plymouth, Sch Biol & Marine Sci, Plymouth PL4 8AA, Devon, England
[3] Inst Marine Res, Ecosyst Acoust Grp, N-5817 Bergen, Norway
[4] Univ Agder, Ctr Coastal Res, Dept Nat Sci, N-4604 Kristiansand, Norway
[5] Univ Bergen, Fac Nat Sci, Dept Informat, N-5008 Bergen, Norway
[6] Univ Verona, Dept Comp Sci, I-37134 Verona, Italy
关键词
Fish behavior; Underwater videos; Event recognition; Trajectory; Generative models; Autoencoder; Diffusion model; Corkwing wrasse; ACTION RECOGNITION; CORKWING WRASSE; NEURAL-NETWORKS; CLIMATE-CHANGE; BEHAVIOR; IMPACT; VIDEOS; SIZE;
D O I
10.1016/j.ecoinf.2024.102733
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
This study contributes to advancing the field of automatic fish event recognition in natural underwater videos, addressing the current gap in studying fish interaction and competition, including predator-prey relationships and mating behaviors. We used the corkwing wrasse (Symphodus melops) as a model, a marine species of commercial importance that reproduces in sea-weed nests built and cared for by a single male. These nests attract a wide range of visitors and are the focal point for behavior such as spawning, chasing, and maintenance. We propose a deep learning methodology to analyze the movement trajectories of the nesting male and classify the associated events observed in their natural habitat. Our approach leverages unsupervised pre-training based on diffusion models, leading to improved feature learning. Additionally, we introduce a dataset comprising 16,937 trajectories across 12 event classes, making it the largest in terms of event class diversity. Our results demonstrate the superior performance of our method compared to several deep architectures. The code for the proposed method and the trajectories can be found at https://github.com/NoeCanovi/Fish_Behaviors_Generative_Models.
引用
收藏
页数:16
相关论文
共 41 条
  • [21] HG-News: News Headline Generation Based on a Generative Pre-Training Model
    Li, Ping
    Yu, Jiong
    Chen, Jiaying
    Guo, Binglei
    IEEE ACCESS, 2021, 9 : 110039 - 110046
  • [22] A Multivariate Time Series Forecasting Algorithm Based on Self-Evolution and Pre-training
    Wan C.
    Li W.-Z.
    Ding W.-X.
    Zhang Z.-J.
    Ye B.-L.
    Lu S.-L.
    Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (03): : 513 - 525
  • [23] Improving Medical Speech-to-Text Accuracy using Vision-Language Pre-training Models
    Huh, Jaeyoung
    Park, Sangjoon
    Lee, Jeong Eun
    Ye, Jong Chul
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (03) : 1692 - 1703
  • [24] Pre-Training of DNN-Based Speech Synthesis Based on Bidirectional Conversion between Text and Speech
    Sone, Kentaro
    Nakashika, Toru
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (08) : 1546 - 1553
  • [25] Automatic acoustic recognition of pollinating bee species can be highly improved by Deep Learning models accompanied by pre-training and strong data augmentation
    Siqueira Ferreira, Alef Iury
    Felipe da Silva, Dia Felix
    Mesquita, Fernanda Neiva
    Rosa, Thierson Couto
    Hugo Monzon, Victor
    Neiva Mesquita-Neto, Jose
    FRONTIERS IN PLANT SCIENCE, 2023, 14
  • [26] Estimation of source locations of total gaseous mercury measured in New York State using trajectory-based models
    Han, Young-Ji
    Holsen, Thomas M.
    Hopke, Philip K.
    ATMOSPHERIC ENVIRONMENT, 2007, 41 (28) : 6033 - 6047
  • [27] Assessing the Social Skills of Children with Autism Spectrum Disorder via Language-Image Pre-training Models
    Liu, Wenxing
    Cheng, Ming
    Pan, Yueran
    Yuan, Lynn
    Hu, Suxiu
    Li, Ming
    Zeng, Songtian
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XIII, 2024, 14437 : 260 - 271
  • [28] A Study on Training Story Generation Models Based on Event Representations
    Shen, Jingcheng
    Fu, Changzeng
    Deng, Xiangtian
    Ino, Fumihiko
    2020 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD 2020), 2020, : 210 - 214
  • [29] JointGraph: joint pre-training framework for traffic forecasting with spatial-temporal gating diffusion graph attention network
    Kong, Xiangyuan
    Wei, Xiang
    Zhang, Jian
    Xing, Weiwei
    Lu, Wei
    APPLIED INTELLIGENCE, 2023, 53 (11) : 13723 - 13740
  • [30] Investigating the effect of pre-training when learning through immersive virtual reality and video: A media and methods experiment
    Meyer, Oliver A.
    Omdahl, Magnus K.
    Makransky, Guido
    COMPUTERS & EDUCATION, 2019, 140