Improved YOLOv7 Underwater Object Detection Based on Attention Mechanism

被引:0
作者
Fu, Junshang [1 ]
Tian, Ying [1 ]
机构
[1] Univ Sci & Technol Liaoning, Sch Comp Sci & Software Engn, Anshan 114051, Liaoning, Peoples R China
关键词
U nderwater Target Detection; Marine Resources; YOLOv7; Attention Mechanism;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The task of detecting marine target organisms has always been a challenging issue, despite the numerous machine learning detection methods proposed to improve precision. The underwater image blurriness caused by irregular light absorption and water quality remains a major obstacle to achieving accurate detection. This results in high misalignment rates and poor underwater scene recognition capabilities for detecting underwater targets. To address this, we put forward a YOLOv7-RNCA underwater target detection technology based on improvements to YOLOv7. This model adds residual modules and coordinate attention mechanisms (CA) at the end of the backbone network, as well as incorporating partial convolution (PConv) modules. The combination of these three components makes the model more precise during the detection process while reducing unnecessary computation and memory access. This allows for better optimization during deep network training and preserves more feature information. Additionally, we reconstructed the SPPCSPC structure and incorporated a global attention mechanism (GAM) to form the SPPCSPC-GAM module in the neck network, which improves the performance of the convolutional neural network (CNN) and ensures good data capabilities and robustness during training, thereby enhancing the target detection ability. We also improved the neck ELAN module by introducing PConv convolution modules, which continuously enhance network learning abilities without disrupting the original gradient path. The introduction of the PConv module reduces redundant computation and memory access, making the ELAN-PConv module more effective at extracting spatial features. Our outcomes of experimentation indicate YOLOv7-RNCA network an average precision of 86.6% on the URPC dataset, outperforming existing methods in accuracy detection and demonstrating great potential as a promising solution for marine target monitoring tasks.
引用
收藏
页码:1377 / 1384
页数:8
相关论文
共 24 条
  • [1] Allen-Zhu Z, 2019, ADV NEUR IN, V32
  • [2] Chen JR, 2023, Arxiv, DOI [arXiv:2303.03667, DOI 10.48550/ARXIV.2303.03667]
  • [3] The future of food from the sea
    Costello, Christopher
    Cao, Ling
    Gelcich, Stefan
    Cisneros-Mata, Miguel A.
    Free, Christopher M.
    Froehlich, Halley E.
    Golden, Christopher D.
    Ishimura, Gakushi
    Maier, Jason
    Macadam-Somer, Ilan
    Mangin, Tracey
    Melnychuk, Michael C.
    Miyahara, Masanori
    de Moor, Carryn L.
    Naylor, Rosamond
    Nostbakken, Linda
    Ojea, Elena
    O'Reilly, Erin
    Parma, Ana M.
    Plantinga, Andrew J.
    Thilsted, Shakuntala H.
    Lubchenco, Jane
    [J]. NATURE, 2020, 588 (7836) : 95 - +
  • [4] Research Challenges, Recent Advances, and Popular Datasets in Deep Learning-Based Underwater Marine Object Detection: A Review
    Er, Meng Joo
    Chen, Jie
    Zhang, Yani
    Gao, Wenxiao
    [J]. SENSORS, 2023, 23 (04)
  • [5] Fast R-CNN
    Girshick, Ross
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
  • [6] Rich feature hierarchies for accurate object detection and semantic segmentation
    Girshick, Ross
    Donahue, Jeff
    Darrell, Trevor
    Malik, Jitendra
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
  • [7] Coordinate Attention for Efficient Mobile Network Design
    Hou, Qibin
    Zhou, Daquan
    Feng, Jiashi
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13708 - 13717
  • [8] Kinsey J C., 2006, IFAC C MANOEUVERING, V88, ppp 1, DOI DOI 10.1007/978-3-319-47766-418
  • [9] Leonard JJ, 2016, SPRINGER HANDBOOK OF OCEAN ENGINEERING, P341
  • [10] SSD: Single Shot MultiBox Detector
    Liu, Wei
    Anguelov, Dragomir
    Erhan, Dumitru
    Szegedy, Christian
    Reed, Scott
    Fu, Cheng-Yang
    Berg, Alexander C.
    [J]. COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 21 - 37