EF-UODA: Underwater Object Detection Based on Enhanced Feature

被引：1

作者：

Zu, Yunqin ^{[1
]}

Zhang, Lixun ^{[1
]}

Li, Siqi ^{[2
]}

Fan, Yuhe ^{[1
]}

Liu, Qijia ^{[1
]}

机构：

[1] Harbin Engn Univ, Coll Mech & Elect Engn, Harbin 150001, Peoples R China

[2] Harbin Engn Univ, Coll Shipbldg Engn, Harbin 150001, Peoples R China

来源：

JOURNAL OF MARINE SCIENCE AND ENGINEERING | 2024年 / 12卷 / 05期

关键词：

underwater object detection; feature extraction; feature fusion; YOLOv8; NEURAL-NETWORKS;

D O I：

10.3390/jmse12050729

中图分类号：

U6 [水路运输]; P75 [海洋工程];

学科分类号：

0814 ; 081505 ; 0824 ; 082401 ;

摘要：

The ability to detect underwater objects accurately is important in marine environmental engineering. Although many kinds of underwater object detection algorithms with relatively high accuracy have been proposed, they involve a large number of parameters and floating point operations (FLOPs), and often fail to yield satisfactory results in complex underwater environments. In light of the demand for an algorithm with the capability to extract high-quality features in complex underwater environments, we proposed a one-stage object detection algorithm called the enhanced feature-based underwater object detection algorithm (EF-UODA), which was based on the architecture of Next-ViT, the loss function of YOLOv8, and Ultralytics. First, we developed a highly efficient module for convolutions, called efficient multi-scale pointwise convolution (EMPC). Second, we proposed a feature pyramid architecture called the multipath fast fusion-feature pyramid network (M2F-FPN) based on different modes of feature fusion. Finally, we integrated the Next-ViT and the minimum point distance intersection over union loss functions in our proposed algorithm. Specifically, on the URPC2020 dataset, EF-UODA surpasses the state-of-the-art (SOTA) convolution-based object detection algorithm YOLOv8X by 2.9% mean average precision (mAP), and surpasses the SOTA ViT-based object detection algorithm real-time detection transformer (RT-DETR) by 2.1%. Meanwhile, it achieves the lowest FLOPs and parameters. The results of extensive experiments showed that EF-UODA had excellent feature extraction capability, and was adequately balanced in terms of the number of FLOPs and parameters.

引用

页数：20

共 58 条

[1] Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, DOI 10.48550/ARXIV.2004.10934]
[2] Underwater object detection using collaborative weakly supervision
Cai, Sixian
Li, Guocheng
Shan, Yuan
[J]. COMPUTERS & ELECTRICAL ENGINEERING, 2022, 102
[3] Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks
Chen, Jierun
Kao, Shiu-Hong
He, Hao
Zhuo, Weipeng
Wen, Song
Lee, Chul-Ho
Chan, S. -H. Gary
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 12021 - 12031
[4] Underwater Target Recognition Based on Improved YOLOv4 Neural Network
Chen, Lingyu
Zheng, Meicheng
Duan, Shunqiang
Luo, Weilin
Yao, Ligang
[J]. ELECTRONICS, 2021, 10 (14)
[5] SWIPENET: Object detection in noisy underwater scenes
Chen, Long
Zhou, Feixiang
Wang, Shengke
Dong, Junyu
Li, Ning
Ma, Haiping
Wang, Xin
Zhou, Huiyu
[J]. PATTERN RECOGNITION, 2022, 132
[6] Underwater-YCC: Underwater Target Detection Optimization Algorithm Based on YOLOv7
Chen, Xiao
Yuan, Mujiahui
Yang, Qi
Yao, Haiyang
Wang, Haiyan
[J]. JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (05)
[7] Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution
Chen, Yunpeng
Fan, Haoqi
Xu, Bing
Yan, Zhicheng
Kalantidis, Yannis
Rohrbach, Marcus
Yan, Shuicheng
Feng, Jiashi
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3434 - 3443
[8] Underwater salient object detection by combining 2D and 3D visual features
Chen, Zhe
Gao, Hongmin
Zhang, Zhen
Zhou, Helen
Wang, Xun
Tian, Yan
[J]. NEUROCOMPUTING, 2020, 391 (391) : 249 - 259
[9] Real-Time Underwater Object Detection Based on DC Resistivity Method
Cho, Sung-Ho
Jung, Hyun-Key
Lee, Hyosun
Rim, Hyoungrea
Lee, Seong Kon
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (11): : 6833 - 6842
[10] Diverse Branch Block: Building a Convolution as an Inception-like Unit
Ding, Xiaohan
Zhang, Xiangyu
Han, Jungong
Ding, Guiguang
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10881 - 10890

← 1 2 3 4 5 6 →