RAOD: refined oriented detector with augmented feature in remote sensing images object detection

被引：6

作者：

Shi, Qin ^{[1
]}

Zhu, Yu ^{[1
]}

Fang, Chuantao ^{[1
]}

Wang, Nan ^{[1
]}

Lin, Jiajun ^{[1
]}

机构：

[1] East China Univ Sci & Technol, Sch Informat Sci & Engn, Shanghai 200237, Peoples R China

来源：

APPLIED INTELLIGENCE | 2022年 / 52卷 / 13期

关键词：

Remote sensing image; Oriented object detection; Augmented feature pyramid; Deformable RoI pooling; Rotated RoI align;

D O I：

10.1007/s10489-022-03393-8

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Object detection is a challenging task in remote sensing. Aerial images are distinguished by complex backgrounds, arbitrary orientations, and dense distributions. Considering those difficulties, this paper proposes a two-stage refined oriented detector with augmented features named RAOD. First, a novel Augmented Feature Pyramid Network (A-FPN) is built to enhance fusion both in spatial and channel dimensions. Specifically, it mainly consists of three modules: Scale Transfer Module (STM), Feature Aggregate Module (FAM) and Feature Refinement Module (FRM). STM reduces information loss when fusing features in the top-down pathway. FAM aggregates features from different scales. FRM aims to refine the integrated features using a lightweight attention module. Then, we adopt a two-step processing, which consists of a coarse stage and a refinement stage. In the coarse stage, deformable RoI pooling is adopted to improve the network's ability of modeling spatial transformations and then horizontal proposals are transformed into oriented ones. In the refinement stage, Rotated RoI align (RRoI align) is used to extract rotation-invariant features from rotated RoIs and further optimize the localization. To enhance stability and robustness during training, smooth Ln is chosen as regression loss as it has better ability in terms of robustness and stability than smooth L-1 loss. Extensive experiments on several rotation detection datasets demonstrate the effectiveness of our method. Results show that our method is able to achieve 79.78%, 74.7% and 94.82% on DOTA-v1.0, DOTA-v1.5 and HRSC2016, respectively.

引用

页码：15278 / 15294

页数：17

共 59 条

[11] Beyond Bounding-Box: Convex-hull Feature Adaptation for Oriented and Densely Packed Object Detection [J].

Guo, Zonghao ;

Liu, Chang ;

Zhang, Xiaosong ;

Jiao, Jianbin ;

Ji, Xiangyang ;

Ye, Qixiang .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :8788-8797

[12] Align Deep Features for Oriented Object Detection [J].

Han, Jiaming ;

Ding, Jian ;

Li, Jie ;

Xia, Gui-Song .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60

[13] ReDet: A Rotation-equivariant Detector for Aerial Object Detection [J].

Han, Jiaming ;

Ding, Jian ;

Xue, Nan ;

Xia, Gui-Song .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :2785-2794

[14]

He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/ICCV.2017.322, 10.1109/TPAMI.2018.2844175]

[15] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[16] MEAD: a Mask-guidEd Anchor-free Detector for oriented aerial object detection [J].

He, Zewen ;

Ren, Zhida ;

Yang, Xuebing ;

Yang, Yang ;

Zhang, Wensheng .

APPLIED INTELLIGENCE, 2022, 52 (04) :4382-4397

[17]

Jaderberg M, 2015, ADV NEUR IN, V28

[18]

Li C., 2019, P IEEE CVF C COMP VI, P20

[19]

Li W., 2021, ARXIV210511111, V2021

[20] Focal Loss for Dense Object Detection [J].

Lin, Tsung-Yi ;

Goyal, Priya ;

Girshick, Ross ;

He, Kaiming ;

Dollar, Piotr .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (02) :318-327

← 1 2 3 4 5 6 →