A dual-task algorithm for ship target detection and semantic segmentation based on improved YOLOv5

被引:1
作者
Fu, Huixuan [1 ]
Wang, Xiangyu [1 ]
Peng, Chao [2 ]
Che, Ziwei [1 ]
Wang, Yuchao [1 ]
机构
[1] Harbin Engn Univ, Coll Intelligent Syst Sci & Engn, Harbin, Peoples R China
[2] Taihu Lab Deepsea Technol Sci, Wuxi, Peoples R China
来源
OCEANS 2023 - LIMERICK | 2023年
基金
中国国家自然科学基金;
关键词
ship target detection; ship semantic segmentation; YOLOv5; dual-task;
D O I
10.1109/OCEANSLimerick52467.2023.10244376
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Marine target detection technology plays an important role in sea surface monitoring, sea rescue and other fields. Aiming at the problem of low detection accuracy rate of ship targets, this paper proposed a dual-task network model which integrates target detection and semantic segmentation for ship objects. The semantic segmentation branch composed of Encoder and Decoder is added on the basis of YOLOv5 network to obtain multi-scale context information. Encoder and Decoder are connected through ASPP module, finally the low-level features and high-level features are further fused to generate semantic segmentation results. CE Loss is used as the semantic segmentation loss function to perform gradient optimization. Finally, it is superimposed with the target detection results to generate an image with both the anchor frame information of the target position and the contour information of the target boundary as the final result. The two tasks share the backbone extraction network and feature fusion network of YOLOv5. Different tasks have different positions of local minima in the shared layer. The interaction of unrelated parts between multiple tasks helps to escape from local minima. The detection accuracy is improved through the features of information sharing and mutual complement of multi-task models, and the semantic segmentation function is implemented. The experimental results show that, on the self-built dataset, the mAP of target detection is improved by 4.3 percentage points compared with YOLOv5, and the MIoU of semantic segmentation can reach 93.6%.
引用
收藏
页数:7
相关论文
共 24 条
[1]  
Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
[2]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[3]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[4]   Underwater Target Detection Algorithm Based on YOLO and Swin Transformer for Sonar Images [J].
Chen, Ruoyu ;
Zhan, Shuyue ;
Chen, Ying .
2022 OCEANS HAMPTON ROADS, 2022,
[5]  
Chen Yan, 2022, Ship Science and Technology (China), V44, P177
[6]  
Dai JF, 2017, Arxiv, DOI [arXiv:1703.06211, 10.48550/arXiv.1703.06211, DOI 10.48550/ARXIV.1703.06211]
[7]  
Fang Luping, 2018, Computer Engineering and Applications, V54, P11, DOI 10.3778/j.issn.1002-8331.1804-0167
[8]  
[金梅 Jin Mei], 2022, [船舶工程, Ship Engineering], V44, P100
[9]  
Khalfaoui-Hassani I, 2022, Arxiv, DOI [arXiv:2112.03740, DOI 10.48550/ARXIV.2112.03740]
[10]  
Li H., 2018, ARXIV180510180, P285