Improved YOLOv7 Underwater Object Detection Based on Attention Mechanism

被引：0

作者：

Fu, Junshang ^{[1
]}

Tian, Ying ^{[1
]}

机构：

[1] Univ Sci & Technol Liaoning, Sch Comp Sci & Software Engn, Anshan 114051, Liaoning, Peoples R China

来源：

ENGINEERING LETTERS | 2024年 / 32卷 / 07期

关键词：

U nderwater Target Detection; Marine Resources; YOLOv7; Attention Mechanism;

D O I：

暂无

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

The task of detecting marine target organisms has always been a challenging issue, despite the numerous machine learning detection methods proposed to improve precision. The underwater image blurriness caused by irregular light absorption and water quality remains a major obstacle to achieving accurate detection. This results in high misalignment rates and poor underwater scene recognition capabilities for detecting underwater targets. To address this, we put forward a YOLOv7-RNCA underwater target detection technology based on improvements to YOLOv7. This model adds residual modules and coordinate attention mechanisms (CA) at the end of the backbone network, as well as incorporating partial convolution (PConv) modules. The combination of these three components makes the model more precise during the detection process while reducing unnecessary computation and memory access. This allows for better optimization during deep network training and preserves more feature information. Additionally, we reconstructed the SPPCSPC structure and incorporated a global attention mechanism (GAM) to form the SPPCSPC-GAM module in the neck network, which improves the performance of the convolutional neural network (CNN) and ensures good data capabilities and robustness during training, thereby enhancing the target detection ability. We also improved the neck ELAN module by introducing PConv convolution modules, which continuously enhance network learning abilities without disrupting the original gradient path. The introduction of the PConv module reduces redundant computation and memory access, making the ELAN-PConv module more effective at extracting spatial features. Our outcomes of experimentation indicate YOLOv7-RNCA network an average precision of 86.6% on the URPC dataset, outperforming existing methods in accuracy detection and demonstrating great potential as a promising solution for marine target monitoring tasks.

引用

页码：1377 / 1384

页数：8

共 24 条

[1] Allen-Zhu Z, 2019, ADV NEUR IN, V32
[2] Chen JR, 2023, Arxiv, DOI [arXiv:2303.03667, DOI 10.48550/ARXIV.2303.03667]
[3] The future of food from the sea
Costello, Christopher
Cao, Ling
Gelcich, Stefan
Cisneros-Mata, Miguel A.
Free, Christopher M.
Froehlich, Halley E.
Golden, Christopher D.
Ishimura, Gakushi
Maier, Jason
Macadam-Somer, Ilan
Mangin, Tracey
Melnychuk, Michael C.
Miyahara, Masanori
de Moor, Carryn L.
Naylor, Rosamond
Nostbakken, Linda
Ojea, Elena
O'Reilly, Erin
Parma, Ana M.
Plantinga, Andrew J.
Thilsted, Shakuntala H.
Lubchenco, Jane
[J]. NATURE, 2020, 588 (7836) : 95 - +
[4] Research Challenges, Recent Advances, and Popular Datasets in Deep Learning-Based Underwater Marine Object Detection: A Review
Er, Meng Joo
Chen, Jie
Zhang, Yani
Gao, Wenxiao
[J]. SENSORS, 2023, 23 (04)
[5] Fast R-CNN
Girshick, Ross
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
[6] Rich feature hierarchies for accurate object detection and semantic segmentation
Girshick, Ross
Donahue, Jeff
Darrell, Trevor
Malik, Jitendra
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
[7] Coordinate Attention for Efficient Mobile Network Design
Hou, Qibin
Zhou, Daquan
Feng, Jiashi
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13708 - 13717
[8] Kinsey J C., 2006, IFAC C MANOEUVERING, V88, ppp 1, DOI DOI 10.1007/978-3-319-47766-418
[9] Leonard JJ, 2016, SPRINGER HANDBOOK OF OCEAN ENGINEERING, P341
[10] SSD: Single Shot MultiBox Detector
Liu, Wei
Anguelov, Dragomir
Erhan, Dumitru
Szegedy, Christian
Reed, Scott
Fu, Cheng-Yang
Berg, Alexander C.
[J]. COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 21 - 37

← 1 2 3 →