Multi-branch attention mechanism and path enhancement for underwater object detection

被引:0
作者
Wang, Haibo [1 ]
Zhou, Zhiyu [1 ]
机构
[1] Zhejiang Sci Tech Univ, Sch Comp Sci & Technol, Hangzhou 310018, Peoples R China
来源
ENGINEERING RESEARCH EXPRESS | 2025年 / 7卷 / 02期
基金
国家重点研发计划;
关键词
underwater object detection; self-attention mechanism; small-scale object; convolutional neural network;
D O I
10.1088/2631-8695/adc5c5
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Underwater object detection is an important research area with wide-ranging applications, from underwater exploration to ecological monitoring. However, this field faces multiple challenges, particularly the significant degradation of underwater image quality and variations in target scales. Traditional object detection algorithms struggle to accurately extract key features of underwater targets, leading to poor detection performance. This study aims to enhance the performance of underwater object detection, especially for small-scale underwater targets, to adapt to complex underwater environments. In this paper, we propose a novel underwater object detector called MPEDet based on multi-branch attention mechanism and path enhancement. Specifically, to improve the capability of the model to extract key features in complex underwater environments, we propose a multi-branch attention mechanism called MBAM, which fully utilizes the dependency information between input features and input keys to strengthen the semantic representation capability during the encoding phase. In addition, we use the designed path enhancement module to facilitate the information interaction between high and low features and reduce the loss of detailed information in the propagation of high-level features within the network. Finally, after training the proposed MPEDet underwater detector for only 24 epochs, it achieved AP50 values of 84.4% and 74.8% on the RUOD and UTDAC underwater test sets, respectively. The results demonstrate that the proposed MPEDet detector can effectively handle the task of underwater.
引用
收藏
页数:14
相关论文
共 47 条
[21]   Object matching of visible-infrared image based on attention mechanism and feature fusion [J].
Li, Wuxin ;
Chen, Qian ;
Gu, Guohua ;
Sui, Xiubao .
PATTERN RECOGNITION, 2025, 158
[22]   Contextual Transformer Networks for Visual Recognition [J].
Li, Yehao ;
Yao, Ting ;
Pan, Yingwei ;
Mei, Tao .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (02) :1489-1500
[23]   Focal Loss for Dense Object Detection [J].
Lin, Tsung-Yi ;
Goyal, Priya ;
Girshick, Ross ;
He, Kaiming ;
Dollar, Piotr .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2999-3007
[24]   Plant intelligence-based PILLO underwater target detection algorithm [J].
Liu, Lizhao ;
Li, Pinrui .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
[25]  
Liu Siyi, INT C LEARN REPR
[26]   UnitModule: A lightweight joint image enhancement module for underwater object detection [J].
Liu, Zhuoyan ;
Wang, Bo ;
Li, Ye ;
He, Jiaxian ;
Li, Yunfeng .
PATTERN RECOGNITION, 2024, 151
[27]   Weighted multi-error information entropy based you only look once network for underwater object detection [J].
Ma, Haiping ;
Zhang, Yajing ;
Sun, Shengyi ;
Zhang, Weijia ;
Fei, Minrui ;
Zhou, Huiyu .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 130
[28]   SGUIE-Net: Semantic Attention Guided Underwater Image Enhancement With Multi-Scale Perception [J].
Qi, Qi ;
Li, Kunqian ;
Zheng, Haiyong ;
Gao, Xiang ;
Hou, Guojia ;
Sun, Kun .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 :6816-6830
[29]   Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks [J].
Ren, Shaoqing ;
He, Kaiming ;
Girshick, Ross ;
Sun, Jian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) :1137-1149
[30]   Deep convolution neural network based semantic segmentation for ocean eddy detection [J].
Saida, Shaik John ;
Sahoo, Suraj Prakash ;
Ari, Samit .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 219