EF-UODA: Underwater Object Detection Based on Enhanced Feature

被引:1
作者
Zu, Yunqin [1 ]
Zhang, Lixun [1 ]
Li, Siqi [2 ]
Fan, Yuhe [1 ]
Liu, Qijia [1 ]
机构
[1] Harbin Engn Univ, Coll Mech & Elect Engn, Harbin 150001, Peoples R China
[2] Harbin Engn Univ, Coll Shipbldg Engn, Harbin 150001, Peoples R China
关键词
underwater object detection; feature extraction; feature fusion; YOLOv8; NEURAL-NETWORKS;
D O I
10.3390/jmse12050729
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
The ability to detect underwater objects accurately is important in marine environmental engineering. Although many kinds of underwater object detection algorithms with relatively high accuracy have been proposed, they involve a large number of parameters and floating point operations (FLOPs), and often fail to yield satisfactory results in complex underwater environments. In light of the demand for an algorithm with the capability to extract high-quality features in complex underwater environments, we proposed a one-stage object detection algorithm called the enhanced feature-based underwater object detection algorithm (EF-UODA), which was based on the architecture of Next-ViT, the loss function of YOLOv8, and Ultralytics. First, we developed a highly efficient module for convolutions, called efficient multi-scale pointwise convolution (EMPC). Second, we proposed a feature pyramid architecture called the multipath fast fusion-feature pyramid network (M2F-FPN) based on different modes of feature fusion. Finally, we integrated the Next-ViT and the minimum point distance intersection over union loss functions in our proposed algorithm. Specifically, on the URPC2020 dataset, EF-UODA surpasses the state-of-the-art (SOTA) convolution-based object detection algorithm YOLOv8X by 2.9% mean average precision (mAP), and surpasses the SOTA ViT-based object detection algorithm real-time detection transformer (RT-DETR) by 2.1%. Meanwhile, it achieves the lowest FLOPs and parameters. The results of extensive experiments showed that EF-UODA had excellent feature extraction capability, and was adequately balanced in terms of the number of FLOPs and parameters.
引用
收藏
页数:20
相关论文
共 58 条
  • [1] Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, DOI 10.48550/ARXIV.2004.10934]
  • [2] Underwater object detection using collaborative weakly supervision
    Cai, Sixian
    Li, Guocheng
    Shan, Yuan
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2022, 102
  • [3] Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks
    Chen, Jierun
    Kao, Shiu-Hong
    He, Hao
    Zhuo, Weipeng
    Wen, Song
    Lee, Chul-Ho
    Chan, S. -H. Gary
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12021 - 12031
  • [4] Underwater Target Recognition Based on Improved YOLOv4 Neural Network
    Chen, Lingyu
    Zheng, Meicheng
    Duan, Shunqiang
    Luo, Weilin
    Yao, Ligang
    [J]. ELECTRONICS, 2021, 10 (14)
  • [5] SWIPENET: Object detection in noisy underwater scenes
    Chen, Long
    Zhou, Feixiang
    Wang, Shengke
    Dong, Junyu
    Li, Ning
    Ma, Haiping
    Wang, Xin
    Zhou, Huiyu
    [J]. PATTERN RECOGNITION, 2022, 132
  • [6] Underwater-YCC: Underwater Target Detection Optimization Algorithm Based on YOLOv7
    Chen, Xiao
    Yuan, Mujiahui
    Yang, Qi
    Yao, Haiyang
    Wang, Haiyan
    [J]. JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (05)
  • [7] Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution
    Chen, Yunpeng
    Fan, Haoqi
    Xu, Bing
    Yan, Zhicheng
    Kalantidis, Yannis
    Rohrbach, Marcus
    Yan, Shuicheng
    Feng, Jiashi
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3434 - 3443
  • [8] Underwater salient object detection by combining 2D and 3D visual features
    Chen, Zhe
    Gao, Hongmin
    Zhang, Zhen
    Zhou, Helen
    Wang, Xun
    Tian, Yan
    [J]. NEUROCOMPUTING, 2020, 391 (391) : 249 - 259
  • [9] Real-Time Underwater Object Detection Based on DC Resistivity Method
    Cho, Sung-Ho
    Jung, Hyun-Key
    Lee, Hyosun
    Rim, Hyoungrea
    Lee, Seong Kon
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (11): : 6833 - 6842
  • [10] Diverse Branch Block: Building a Convolution as an Inception-like Unit
    Ding, Xiaohan
    Zhang, Xiangyu
    Han, Jungong
    Ding, Guiguang
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10881 - 10890