RCT-YOLOv8: A Tuna Detection Model for Distant-Water Fisheries Based on Improved YOLOv8

被引:0
作者
Zhou, Qingyi [1 ]
Liu, Yuqing [1 ]
机构
[1] Shanghai Ocean Univ, Coll Food Sci & Technol, 999 Hucheng Ring Rd, Shanghai 201306, Peoples R China
关键词
YOLOv8; deep learning; object detection; tuna detection;
D O I
10.20965/jaciii.2024.p1273
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the development of distant-water fisheries, ship fishing and fish catch detection are now vital to modern fishing. Existing manual detection methods are prone to issues such as missed detections and false detections. Deep learning has enabled the deployment of detection models on shipboard devices, offering a new solution. However, many existing models are hindered by large parameters and computational complexity, making them unsuitable for shipboard use due to limited resources and costs onboard ships. To address these challenges, we propose the RCT-YOLOv8 model for tuna catch detection in this paper. Specifically, we adopt YOLOv8 as the base model and replace the network backbone with RepVGG network, which employs re-parameterized convolutions to enhance detection accuracy. Additionally, we incorporate coordinate attention at the end of the backbone to better aggregate channel-wise information. In the neck part, we introduce the contextual transformer (CoT) attention and propose the C2F-CoT model, which combines convolutional neural network with Transformer to capture global features, thereby improving detection accuracy and the effectiveness of feature propagation. We test multiple loss functions and select efficient intersection over union, which is more suitable for our algorithm. Furthermore, to adapt to devices with limited computational resources, we utilize the dependency-graph-based pruning method to compress the network model. Compared to the base network, the pruned model achieves a 9.8% increase in detection accuracy while reducing parameters and computational complexity by 40% and 35.8%, respectively. Compared to various algorithms, the pruned model demonstrates the highest detection accuracy, lowest parameter count, and lowest computational complexity, achieving optimal results at all fronts.
引用
收藏
页码:1273 / 1283
页数:11
相关论文
共 36 条
[1]   A modified YOLOv3 model for fish detection based on MobileNetv1 as backbone [J].
Cai, Kewei ;
Miao, Xinying ;
Wang, Wei ;
Pang, Hongshuai ;
Liu, Ying ;
Song, Jinyan .
AQUACULTURAL ENGINEERING, 2020, 91
[2]   AP-Loss for Accurate One-Stage Object Detection [J].
Chen, Kean ;
Lin, Weiyao ;
Li, Jianguo ;
See, John ;
Wang, Ji ;
Zou, Junni .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (11) :3782-3798
[3]   RepVGG: Making VGG-style ConvNets Great Again [J].
Ding, Xiaohan ;
Zhang, Xiangyu ;
Ma, Ningning ;
Han, Jungong ;
Ding, Guiguang ;
Sun, Jian .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :13728-13737
[4]   Scale-Sensitive IOU Loss: An Improved Regression Loss Function in Remote Sensing Object Detection [J].
Du, Shuangjiang ;
Zhang, Baofu ;
Zhang, Pin .
IEEE ACCESS, 2021, 9 :141258-141272
[5]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[6]   GhostNet: More Features from Cheap Operations [J].
Han, Kai ;
Wang, Yunhe ;
Tian, Qi ;
Guo, Jianyuan ;
Xu, Chunjing ;
Xu, Chang .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :1577-1586
[7]   DenseNet Convolutional Neural Networks Application for Predicting COVID-19 Using CT Image [J].
Hasan N. ;
Bao Y. ;
Shawon A. ;
Huang Y. .
SN Computer Science, 2021, 2 (5)
[8]  
He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/ICCV.2017.322, 10.1109/TPAMI.2018.2844175]
[9]   Coordinate Attention for Efficient Mobile Network Design [J].
Hou, Qibin ;
Zhou, Daquan ;
Feng, Jiashi .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :13708-13717
[10]   Searching for MobileNetV3 [J].
Howard, Andrew ;
Sandler, Mark ;
Chu, Grace ;
Chen, Liang-Chieh ;
Chen, Bo ;
Tan, Mingxing ;
Wang, Weijun ;
Zhu, Yukun ;
Pang, Ruoming ;
Vasudevan, Vijay ;
Le, Quoc V. ;
Adam, Hartwig .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1314-1324