Multi-Scale Ship Detection From SAR and Optical Imagery Via A More Accurate YOLOv3

被引:87
作者
Hong, Zhonghua [1 ,2 ]
Yang, Ting [1 ,2 ]
Tong, Xiaohua [4 ]
Zhang, Yun [1 ,2 ]
Jiang, Shenlu [3 ]
Zhou, Ruyan [1 ,2 ]
Han, Yanling [1 ,2 ]
Wang, Jing [1 ,2 ]
Yang, Shuhu [1 ,2 ]
Liu, Sichong [4 ]
机构
[1] Shanghai Ocean Univ, Coll Informat Technol, Shanghai 201306, Peoples R China
[2] Shanghai Ocean Univ, Key Lab Fisheries Informat, Minist Agr, Shanghai 201306, Peoples R China
[3] Chinese Univ Hong Kong, Space & Earth Informat Sci, Hong Kong, Peoples R China
[4] Tongji Univ, Coll Surveying & Geoinformat, Shanghai 200092, Peoples R China
基金
国家重点研发计划;
关键词
Marine vehicles; Optical imaging; Synthetic aperture radar; Optical sensors; Optical reflection; Adaptive optics; Object detection; Deep learning-based object detection; synthetic aperture radar (SAR) and optical imagery; ship detection; you only look once"version 3 (YOLOv3);
D O I
10.1109/JSTARS.2021.3087555
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deep learning detection methods use in ship detection remains a challenge, owing to the small scale of the objects and interference from complex sea surfaces. In addition, existing ship detection methods rarely verify the robustness of their algorithms on multisensor images. Thus, we propose a new improvement on the "you only look once" version 3 (YOLOv3) framework for ship detection in marine surveillance, based on synthetic aperture radar (SAR) and optical imagery. First, improved choices are obtained for the anchor boxes by using linear scaling based on the k-means++ algorithm. This addresses the difficulty in reflecting the advantages of YOLOv3's multiscale detection, as the anchor boxes of a single detection target type between different detection scales have small differences. Second, we add uncertainty estimators for the positioning of the bounding boxes by introducing a Gaussian parameter for ship detection into the YOLOv3 framework. Finally, four anchor boxes are allocated to each detection scale in the Gaussian-YOLO layer instead of three as in the default YOLOv3 settings, as there are wide disparities in an object's size and direction in remote sensing images with different resolutions. Applying the proposed strategy to ``YOLOv3-spp" and ``YOLOv3-tiny," the results are enhanced by 2%-3%. Compared with other models, the improved-YOLOv3 has the highest average precision on both the optical (93.56%) and SAR (95.52%) datasets. The improved-YOLOv3 is robust, even in the context of a mixed dataset of SAR and optical images comprising images from different satellites and with different scales.
引用
收藏
页码:6083 / 6101
页数:19
相关论文
共 46 条
[1]   Ship Detection for Optical Remote Sensing Images Based on Visual Attention Enhanced Network [J].
Bi, Fukun ;
Hou, Jinyuan ;
Chen, Liang ;
Yang, Zhihua ;
Wang, Yanping .
SENSORS, 2019, 19 (10)
[2]  
Bochkovskiy A., 2020, arXiv pre-print server, DOI DOI 10.48550/ARXIV.2004.10934
[3]   Ship Detection Based on YOLOv2 for SAR Imagery [J].
Chang, Yang-Lang ;
Anagaw, Amare ;
Chang, Lena ;
Wang, Yi Chun ;
Hsiao, Chih-Yu ;
Lee, Wei-Hong .
REMOTE SENSING, 2019, 11 (07)
[4]   Gaussian YOLOv3: An Accurate and Fast Object Detector Using Localization Uncertainty for Autonomous Driving [J].
Choi, Jiwoong ;
Chun, Dayoung ;
Kim, Hyun ;
Lee, Hyuk-Jae .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :502-511
[5]   On Semiparametric Clutter Estimation for Ship Detection in Synthetic Aperture Radar Images [J].
Cui, Yi ;
Yang, Jian ;
Yamaguchi, Yoshio ;
Singh, Gulab ;
Park, Sang-Eun ;
Kobayashi, Hirokazu .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2013, 51 (05) :3170-3180
[6]   Ship Detection in Large-Scale SAR Images Via Spatial Shuffle-Group Enhance Attention [J].
Cui, Zongyong ;
Wang, Xiaoya ;
Liu, Nengyuan ;
Cao, Zongjie ;
Yang, Jianyu .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (01) :379-391
[7]   Dense Attention Pyramid Networks for Multi-Scale Ship Detection in SAR Images [J].
Cui, Zongyong ;
Li, Qi ;
Cao, Zongjie ;
Liu, Nengyuan .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (11) :8983-8997
[8]   CenterNet: Keypoint Triplets for Object Detection [J].
Duan, Kaiwen ;
Bai, Song ;
Xie, Lingxi ;
Qi, Honggang ;
Huang, Qingming ;
Tian, Qi .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6568-6577
[9]   A model for extremely heterogeneous clutter [J].
Frery, AC ;
Muller, HJ ;
Yanasse, CDF ;
SantAnna, SJS .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 1997, 35 (03) :648-659
[10]  
Fu C.Y., 2017, ARXIV