FishTrack: Multi-object tracking method for fish using spatiotemporal information fusion

被引:21
作者
Liu, Yiran [1 ,2 ,4 ]
Li, Beibei [1 ,4 ]
Zhou, Xinhui [1 ,4 ]
Li, Daoliang [1 ,2 ,3 ,4 ]
Duan, Qingling [1 ,2 ,3 ,4 ]
机构
[1] China Agr Univ, Natl Innovat Ctr Digital Fishery, Beijing 100083, Peoples R China
[2] China Agr Univ, Key Lab Smart Farming Technol Aquat Anim & Livesto, Minist Agr & Rural Affairs, Beijing 100083, Peoples R China
[3] China Agr Univ, Beijing Engn Technol Res Ctr Internet Things Agr, Beijing 100083, Peoples R China
[4] China Agr Univ, Coll Informat & Elect Engn, Beijing 100083, Peoples R China
关键词
Spatiotemporal information fusion; Iterative Query; Multi-object tracking; Joint tracking model; Fish; Factory farming;
D O I
10.1016/j.eswa.2023.122194
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tracking the fish is a key step in analyzing fish behavior, evaluating their health levels, and warning of abnormal water quality, so it is of significant importance for intelligent monitoring in fish farming. However, multi-object tracking for fish is a very challenging task due to foam occlusion in the factory tank, high individual similarity, and fast-paced motion of fish. To address the above issues, an online multi-fish tracking model called FishTrack is proposed with 3 branches of target detection, track prediction, and re-identification, which establishes the motion model and appearance model for fish simultaneously, thus achieving the online multi-fish tracking. It is a complete Encoder-Decoder structure and the Pyramid Vision Transformer (PVT) is adopted as the backbone network to extract multi-level features. Then an Encoder is specially designed to encode the historical infor-mation of the positions of fish, and automatically update its spatiotemporal information in an autoregressive fashion, to perform the fusion of the spatiotemporal information of fish targets and avoid manual selection of spatiotemporal features. Finally, a parallel dual-Decoder is used to decode the motion and appearance features of fish to reduce the interference of the two optimization directions during the joint model training. The motion cues are first used in linear assignment to reduce the interference of occlusion and deformation issues, and when the motion model fails to track, the appearance cues can recover the identity to deal with long occlusion. The experiments on the established multiple fish tracking dataset showed that the higher order tracking accuracy (HOTA) reached 71.4%, the multi-object tracking accuracy (MOTA) reached 94.8%, the most tracked ratio (MT) reached 93.3%, and the identification F1 score (IDF1) reached 82.5%. The results show that FishTrack can solve the problem of tracking accuracy decrease caused by foam occlusion and fish deformation, and save the inference time by sharing features, increasing the robustness of online multi-fish tracking in factory farming.
引用
收藏
页数:17
相关论文
共 58 条
[1]   Artificial intelligence for fish behavior recognition may unlock fishing gear selectivity [J].
Abangan, Alexa Sugpatan ;
Kopp, Dorothee ;
Faillettaz, Robin .
FRONTIERS IN MARINE SCIENCE, 2023, 10
[2]  
Aharon N, 2022, Arxiv, DOI arXiv:2206.14651
[3]  
Anas O., 2020, Procedia Computer Science, V175, P141
[4]  
Arvind CS, 2019, TENCON IEEE REGION, P778, DOI [10.1109/TENCON.2019.8929613, 10.1109/tencon.2019.8929613]
[5]   Zebrafish tracking using YOLOv2 and Kalman filter [J].
Barreiros, Marta de Oliveira ;
Dantas, Diego de Oliveira ;
de Oliveira Silva, Luis Claudio ;
Ribeiro, Sidarta ;
Barros, Allan Kardec .
SCIENTIFIC REPORTS, 2021, 11 (01)
[6]   TAU: A framework for video-based traffic analytics leveraging artificial intelligence and unmanned aerial systems [J].
Benjdira, Bilel ;
Koubaa, Anis ;
Azar, Ahmad Taher ;
Khan, Zahid ;
Ammar, Adel ;
Boulila, Wadii .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 114
[7]  
Bewley A, 2016, IEEE IMAGE PROC, P3464, DOI 10.1109/ICIP.2016.7533003
[8]   SUZE: A Hybrid Approach for Multi-fish Detection and Tracking [J].
Bhateja, Aditi ;
Lall, Brejesh ;
Kalra, Prem K. ;
Chaudhary, Kompal .
GLOBAL OCEANS 2020: SINGAPORE - U.S. GULF COAST, 2020,
[9]  
Bochinski E, 2018, 2018 15TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), P435
[10]   Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking [J].
Cao, Jinkun ;
Pang, Jiangmiao ;
Weng, Xinshuo ;
Khirodkar, Rawal ;
Kitani, Kris .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :9686-9696