Underwater moving target detection and tracking based on enhanced you only look once and deep simple online and realtime tracking strategy

被引:0
作者
Sun, Bing [1 ]
Zhang, Wei [2 ]
Xing, Cheng [1 ]
Li, Yingyao [1 ]
机构
[1] Shanghai Maritime Univ, Shanghai Engn Res Ctr Intelligent Maritime Search, 1550 Haigang Ave, Shanghai 201306, Peoples R China
[2] Shanghai Dianji Univ, Sch Elect Engn, Shanghai 201306, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
Target detection; Target tracking; Deep learning; You only look once; Deep simple online and realtime tracking; Data augmentation;
D O I
10.1016/j.engappai.2024.109982
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Addressing the challenges posed by factors like light attenuation, turbid water quality, large target scale changes, and dense distribution in underwater moving target images, we propose an enhanced strategy combining YOLOv5 (You Only Look Once version 5) and Deep SORT (Simple Online and Realtime Tracking) for target detection and tracking. Initially, various image enhancement techniques are employed to improve detection performance, while a Deep Convolutional Generative Adversarial Networks (DCGAN) approach augments the small underwater image dataset. Subsequently, we enhance the YOLOv5 method by integrating convolutional attention module into the detection network to enhance target salience in large-scale scenes. Additionally, a tiny target detection head is introduced to enhance the detector's capability in adapting to scale changes, particularly for small objects. Finally, the Deep SORT algorithm is integrated for tracking detected targets. Experimental comparisons with state-of-the-art methods validate the efficacy of the proposed approach. Our method achieves a detection accuracy of 94.3% on a self-made underwater fish target dataset, with a detection speed of 71FPS and robust tracking performance. Moreover, its applicability extends to various underwater targets, demonstrating superior performance.
引用
收藏
页数:14
相关论文
共 41 条
[1]   The State of the Art in Visual Analysis Approaches for Ocean and Atmospheric Datasets [J].
Afzal, S. ;
Hittawe, M. M. ;
Ghani, S. ;
Jamil, T. ;
Knio, O. ;
Hadwiger, M. ;
Hoteit, I .
COMPUTER GRAPHICS FORUM, 2019, 38 (03) :881-907
[2]   Visualization and Visual Analytics Approaches for Image and Video Datasets: A Survey [J].
Afzal, Shehzad ;
Ghani, Sohaib ;
Hittawe, Mohamad Mazen ;
Rashid, Sheikh Faisal ;
Knio, Omar M. ;
Hadwiger, Markus ;
Hoteit, Ibrahim .
ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2023, 13 (01)
[3]   Underwater image dehazing using global color features [J].
Alenezi, Fayadh ;
Armghan, Ammar ;
Santosh, K. C. .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 116
[4]   Implementation of the Hungarian Algorithm to Account for Ligand Symmetry and Similarity in Structure-Based Design [J].
Allen, William J. ;
Rizzo, Robert C. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2014, 54 (02) :518-529
[5]  
Alsakar YM, 2023, Preprints, DOI 10.20944/preprints202307.0585.v1
[6]   A survey: object detection methods from CNN to transformer [J].
Arkin, Ershat ;
Yadikar, Nurbiya ;
Xu, Xuebin ;
Aysa, Alimjan ;
Ubul, Kurban .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (14) :21353-21383
[7]   Updating the neural network sediment load models using different sensitivity analysis methods: a regional application [J].
Asheghi, Reza ;
Hosseini, Seyed Abbas ;
Saneie, Mojtaba ;
Shahri, Abbas Abbaszadeh .
JOURNAL OF HYDROINFORMATICS, 2020, 22 (03) :562-577
[8]  
Bewley A, 2016, IEEE IMAGE PROC, P3464, DOI 10.1109/ICIP.2016.7533003
[9]  
Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, DOI 10.48550/ARXIV.2004.10934]
[10]  
Bouindour Samir, 2017, 8th International Conference on Imaging for Crime Detection and Prevention (ICDP 2017), P1