A lightweight YOLOv8 integrating FasterNet for real-time underwater object detection

被引:57
作者
Guo, An [1 ]
Sun, Kaiqiong [1 ]
Zhang, Ziyi [1 ]
机构
[1] Wuhan Polytech Univ, Sch Math & Comp Sci, Wuhan 430023, Peoples R China
关键词
Lightweight model; Real-time detection; Underwater object detection; YOLOv8;
D O I
10.1007/s11554-024-01431-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a underwater target detection method that optimizes YOLOv8s to make it more suitable for real-time and underwater environments. First, a lightweight FasterNet module replaces the original backbone of YOLOv8s to reduce the computation and improve the performance of the network. Second, we modify current bi-directional feature pyramid network into a fast one by reducing unnecessary feature layers and changing the fusion method. Finally, we propose a lightweight-C2f structure by replacing the last standard convolution, bottleneck module of C2f with a GSConv and a partial convolution, respectively, to obtain a lighter and faster block. Experiments on three underwater datasets, RUOD, UTDAC2020 and URPC2022 show that the proposed method has mAP50\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$_{50}$$\end{document} of 86.8%, 84.3% and 84.7% for the three datasets, respectively, with a speed of 156 FPS on NVIDIA A30 GPUs, which meets the requirement of real-time detection. Compared to the YOLOv8s model, the model volume is reduced on average by 24%, and the mAP accuracy is enhanced on all three datasets.
引用
收藏
页数:15
相关论文
共 45 条
[1]   HTDet: A Hybrid Transformer-Based Approach for Underwater Small Object Detection [J].
Chen, Gangqi ;
Mao, Zhaoyong ;
Wang, Kai ;
Shen, Junge .
REMOTE SENSING, 2023, 15 (04)
[2]   Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks [J].
Chen, Jierun ;
Kao, Shiu-Hong ;
He, Hao ;
Zhuo, Weipeng ;
Wen, Song ;
Lee, Chul-Ho ;
Chan, S. -H. Gary .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :12021-12031
[3]   Underwater-YCC: Underwater Target Detection Optimization Algorithm Based on YOLOv7 [J].
Chen, Xiao ;
Yuan, Mujiahui ;
Yang, Qi ;
Yao, Haiyang ;
Wang, Haiyan .
JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (05)
[4]  
Chen Yu, 2020, Advances in Neural Information Processing Systems, V33
[5]   Lightweight Transformers make strong encoders for underwater object detection [J].
Cui, Jinrong ;
Liu, Hailong ;
Zhong, Haowei ;
Huang, Cheng ;
Zhang, Weifeng .
SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (05) :1889-1896
[6]   CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped Windows [J].
Dong, Xiaoyi ;
Bao, Jianmin ;
Chen, Dongdong ;
Zhang, Weiming ;
Yu, Nenghai ;
Yuan, Lu ;
Chen, Dong ;
Guo, Baining .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :12114-12124
[7]  
Fayaz S, 2023, IEEe Trans. Veh. Technol.
[8]  
Flyai, 2020, Underwater object detection dataset
[9]   Rethinking general underwater object detection: Datasets, challenges, and solutions [J].
Fu, Chenping ;
Liu, Risheng ;
Fan, Xin ;
Chen, Puyang ;
Fu, Hao ;
Yuan, Wanqi ;
Zhu, Ming ;
Luo, Zhongxuan .
NEUROCOMPUTING, 2023, 517 :243-256
[10]   Augmented weighted bidirectional feature pyramid network for marine object detection [J].
Gao, Jinxiong ;
Geng, Xu ;
Zhang, Yonghui ;
Wang, Rong ;
Shao, Kaixuan .
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237