Underwater-YCC: Underwater Target Detection Optimization Algorithm Based on YOLOv7

被引:38
作者
Chen, Xiao [1 ]
Yuan, Mujiahui [1 ]
Yang, Qi [1 ]
Yao, Haiyang [1 ]
Wang, Haiyan [1 ,2 ]
机构
[1] Shaanxi Univ Sci & Technol, Sch Elect Informat & Artificial Intelligence, Xian 710021, Peoples R China
[2] Northwestern Polytech Univ, Sch Marine Sci & Technol, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
underwater target detection; attention mechanism; YOLOv7; deep learning; model optimization;
D O I
10.3390/jmse11050995
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Underwater target detection using optical images is a challenging yet promising area that has witnessed significant progress. However, fuzzy distortions and irregular light absorption in the underwater environment often lead to image blur and color bias, particularly for small targets. Consequently, existing methods have yet to yield satisfactory results. To address this issue, we propose the Underwater-YCC optimization algorithm based on You Only Look Once (YOLO) v7 to enhance the accuracy of detecting small targets underwater. Our algorithm utilizes the Convolutional Block Attention Module (CBAM) to obtain fine-grained semantic information by selecting an optimal position through multiple experiments. Furthermore, we employ the Conv2Former as the Neck component of the network for underwater blurred images. Finally, we apply the Wise-IoU, which is effective in improving detection accuracy by assigning multiple weights between high- and low-quality images. Our experiments on the URPC2020 dataset demonstrate that the Underwater-YCC algorithm achieves a mean Average Precision (mAP) of up to 87.16% in complex underwater environments.
引用
收藏
页数:17
相关论文
共 29 条
[1]  
Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
[2]   Cascade R-CNN: Delving into High Quality Object Detection [J].
Cai, Zhaowei ;
Vasconcelos, Nuno .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6154-6162
[3]  
Deng J., 2020, J PHYS C SERIES, V1684, P012028, DOI 10.1088/1742-6596/1684/1/012028
[4]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[5]   Attention mechanisms in computer vision: A survey [J].
Guo, Meng-Hao ;
Xu, Tian-Xing ;
Liu, Jiang-Jiang ;
Liu, Zheng-Ning ;
Jiang, Peng-Tao ;
Mu, Tai-Jiang ;
Zhang, Song-Hai ;
Martin, Ralph R. ;
Cheng, Ming-Ming ;
Hu, Shi-Min .
COMPUTATIONAL VISUAL MEDIA, 2022, 8 (03) :331-368
[6]  
He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[7]  
Hou Q., 2022, arXiv, DOI DOI 10.1109/TPAMI.2024.3401450
[8]   Underwater image processing and analysis: A review [J].
Jian, Muwei ;
Liu, Xiangyu ;
Luo, Hanjiang ;
Lu, Xiangwei ;
Yu, Hui ;
Dong, Junyu .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 91
[9]  
Li CY, 2022, Arxiv, DOI [arXiv:2209.02976, 10.48550/arXiv.2209.02976]
[10]   An Improved YOLOV5 Based on Triplet Attention and Prediction Head Optimization for Marine Organism Detection on Underwater Mobile Platforms [J].
Li, Yan ;
Bai, Xinying ;
Xia, Chunlei .
JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (09)