Improved SAR Ship Detection Algorithm for YOLOv7

被引:1
作者
Xiao, Zhenjiu [1 ]
Lin, Bohan [1 ]
Qu, Haicheng [1 ]
机构
[1] School of Software, Liaoning University of Technology, Liaoning, Huludao
关键词
ConvNeXt Block; global attention mechanism; NWD metric; partial convolution; SAR images; ship detection; YOLOv7;
D O I
10.3778/j.issn.1002-8331.2304-0109
中图分类号
学科分类号
摘要
In order to solve the problem of low accuracy of ship detection for small target ships and complex backgrounds in synthetic aperture radar(SAR)images, while making the model more lightweight, a SAR ship detection algorithm with improved YOLOv7 is proposed. The REP-PConv-ELAN module is built in the YOLOv7 backbone network to replace the original ELAN, which reduces the computation and memory consumption of the network, speeds up the inference speed, and enhances the extraction capability of the network for target features; the ConvNeXt Block is incorporated in the feature fusion part to accelerate the network to extract and fuse the feature information of complex targets. Then the global attention mechanism(GAM)is added to the down-sampling stage to build a sampling module for capturing global features(MP-GAM), which performs feature capture and feature fusion in channel dimension and spatial dimension to realize the interaction of multidimensional information and improve the ability of network to capture key features of ships in complex backgrounds. A new metric NWD is introduced at the regression loss function of the detection head to replace IoU to enhance the detection capability of small targets. Experimental comparisons are conducted on the HRSID dataset, and the improved method improves the AP value by 10.04 percentage points, the accuracy by 3.61 percentage points, and the recall by 15.15 percentage points compared to YOLOv7. The accuracy is significantly improved compared with the current mainstream algorithm. The experimental results show that the improved algorithm can effectively improve the false and missed detections of ships. © 2023 Journal of Computer Engineering and Applications Beijing Co., Ltd.; Science Press. All rights reserved.
引用
收藏
页码:243 / 252
页数:9
相关论文
共 31 条
[1]  
KEEL B M., Constant false alarm rate detectors, IET Digital Library, (2010)
[2]  
REN S,, HE K, GIRSHICK R, Et al., Faster R-CNN:towards real-time object detection with region proposal networks[C], Advances in Neural Information Processing Systems, (2015)
[3]  
HE K, GKIOXARI G, Et al., Mask R-CNN[C], Proceedings of the IEEE International Conference on Computer Vision, pp. 2961-2969, (2017)
[4]  
CARION N, MASSA F, SYNNAEVE G, Et al., End-to-end object detection with transformers[C], 16th European Conference on Computer Vision, pp. 213-229, (2020)
[5]  
LIU W, ERHAN D, Et al., SSD:single shot multibox detector[C], 14th European Conference on Computer Vision, pp. 21-37, (2016)
[6]  
LIN T Y, GOYAL P, GIRSHICK R, Et al., Focal loss for dense object detection[C], Proceedings of the IEEE International Conference on Computer Vision, pp. 2980-2988, (2017)
[7]  
DUAN K, BAI S, XIE L, Et al., Centernet:keypoint triplets for object detection[C], Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 6569-6578, (2019)
[8]  
TAN M, PANG R, LE Q V., Efficientdet:scalable and efficient object detection[C], Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10781-10790, (2020)
[9]  
TIAN Z,, SHEN C,, CHEN H, Et al., Fcos:fully convolutional one-stage object detection[C], Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 9627-9636, (2019)
[10]  
REDMON J, DIVVALA S, Et al., You only look once:unified,real-time object detection[C], Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779-788, (2016)