Single Shot Anchor Refinement Network for Oriented Object Detection in Optical Remote Sensing Imagery

被引：39

作者：

Bao, Songze ^{[1
,2
]}

Zhong, Xing ^{[1
,2
,3
]}

Zhu, Ruifei ^{[1
,2
,3
]}

Zhang, Xiaonan ^{[1
,2
]}

Li, Zhuqiang ^{[3
]}

Li, Mengyang ^{[1
,2
]}

机构：

[1] Chinese Acad Sci, Changchun Inst Opt Fine Mech & Phys, Changchun 130033, Jilin, Peoples R China

[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

[3] Chang Guang Satellite Technol Co Ltd, Key Lab Satellite Remote Sensing Applicat Technol, Changchun 130000, Jilin, Peoples R China

来源：

IEEE ACCESS | 2019年 / 7卷

基金：

美国国家科学基金会;

关键词：

Convolutional neural network (CNN); remote sensing; oriented object detection; anchor refinement; SHIP DETECTION;

D O I：

10.1109/ACCESS.2019.2924643

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Object detection is a challenging task in the field of remote sensing applications due to the complex backgrounds and uncertain orientation of targets. Compared with the horizontal bounding box, the oriented bounding box can provide orientation information while retaining the true size. Most existing oriented object detection methods are based on Faster-RCNN and the other one-stage methods that can achieve real-time speed but have shortcomings in localization and detection accuracy. To further enhance the performance of one-stage methods, we propose an oriented object detection framework that is based on the single shot detector, namely, single shot anchor refinement network (S(2)ARN). The S(2)ARN obtains the accurate detection results by performing two consecutive regressions. More precisely, the multilevel features of the backbone are used to regress the coordinate offsets between the predefined rotated anchors and the ground-truth boxes to generate the refined anchors. The classification and regression subnetworks assigned to the output features are used to perform the second regression to determine the class labels and further adjust the location of the refined anchors. In addition, receptive field amplification modules (RFAMs) are inserted to enlarge the receptive field and extract more discriminative features. Furthermore, in the anchor matching step, angle-related Intersection over Union (ArIoU) is used to calculate the Intersection over Union (IoU) score instead of the traditional method. Benefiting from the multiple regressions and the insensitivity of the ArIoU score to the angle deviation, the angle sampling interval of the rotated anchor can be reduced. The experimental results for the two public datasets, HRSC2016 and UCAS-AOD, demonstrate the effectiveness of the proposed network.

引用

页码：87150 / 87161

页数：12

共 45 条

[41] Net-FLICS: fast quantitative wide-field fluorescence lifetime imaging with compressed sensing - a deep learning approach [J].

Yao, Ruoyang ;

Ochoa, Marien ;

Yan, Pingkun ;

Intes, Xavier .

LIGHT-SCIENCE & APPLICATIONS, 2019, 8 (1)

[42] S3FD: Single Shot Scale-invariant Face Detector [J].

Zhang, Shifeng ;

Zhu, Xiangyu ;

Lei, Zhen ;

Shi, Hailin ;

Wang, Xiaobo ;

Li, Stan Z. .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :192-201

[43] Toward Arbitrary-Oriented Ship Detection With Rotated Region Proposal and Discrimination Networks [J].

Zhang, Zenghui ;

Guo, Weiwei ;

Zhu, Shengnan ;

Yu, Wenxian .

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2018, 15 (11) :1745-1749

[44] Cascade R-CNN: Delving into High Quality Object Detection [J].

Cai, Zhaowei ;

Vasconcelos, Nuno .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6154-6162

[45]

Zhu HG, 2015, IEEE IMAGE PROC, P3735, DOI 10.1109/ICIP.2015.7351502

← 1 2 3 4 5 →