Adaptive multi-level feature fusion and attention-based network for arbitrary-oriented object detection in remote sensing imagery

被引:41
作者
Chen, Luchang [1 ]
Liu, Chunsheng [1 ]
Chang, Faliang [1 ]
Li, Shuang [1 ]
Nie, Zhaoying [1 ]
机构
[1] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Peoples R China
基金
国家重点研发计划;
关键词
Object detection in aerial images; Deep neural network; Attention network; Training strategy;
D O I
10.1016/j.neucom.2021.04.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Compared with the classic object detection problem, detecting objects in aerial images has some special challenges including huge orientation variations, complicated and large background, and wide multi scale distribution. Considering these three challenges together, we propose a novel arbitrary-oriented object detection framework consisting of three main parts. Firstly, the Cascading Attention Network (CA-Net) composed of a patching self-attention module and a supervised spatial attention module is proposed for enhancing the feature representations from objects of interest and suppressing the background noises in Feature Pyramid Network (FPN) from coarse to fine. Then, the Adaptive Feature Concatenate Network (AFC-Net) is proposed to adaptively stack the feature maps pooled from all FPN levels as well as the global semantic features, for dealing with the multi-scale change of objects. Lastly, the OBB Multi-Definition and Selection Strategy (OBB-MDS-Strategy) is proposed to regress rotated bounding boxes more smoothly and detect oriented objects more accurately in the training process. Our experiments are conducted on two common and challenging aerial datasets, i.e., DOTA and HRSC2016. Experiments results show that the proposed method has superior performances in multi-orientated objects detection compared with the representative methods. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:67 / 80
页数:14
相关论文
共 50 条
[1]   Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery [J].
Azimi, Seyed Majid ;
Vig, Eleonora ;
Bahmanyar, Reza ;
Koerner, Marco ;
Reinartz, Peter .
COMPUTER VISION - ACCV 2018, PT III, 2019, 11363 :150-165
[2]   Target heat-map network: An end-to-end deep network for target detection in remote sensing images [J].
Chen, Huai ;
Zhang, Libao ;
Ma, Jie ;
Zhang, Jue .
NEUROCOMPUTING, 2019, 331 :375-387
[3]  
Dai JF, 2016, ADV NEUR IN, V29
[4]   Learning RoI Transformer for Oriented Object Detection in Aerial Images [J].
Ding, Jian ;
Xue, Nan ;
Long, Yang ;
Xia, Gui-Song ;
Lu, Qikai .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2844-2853
[5]   Point-Based Estimator for Arbitrary-Oriented Object Detection in Aerial Images [J].
Fu, Kun ;
Chang, Zhonghan ;
Zhang, Yue ;
Sun, Xian .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (05) :4370-4387
[6]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[7]   CAD-Net: A Context-Aware Detection Network for Objects in Remote Sensing Imagery [J].
Zhang, Gongjie ;
Lu, Shijian ;
Zhang, Wei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (12) :10015-10024
[8]  
He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[9]  
He Kaiming, 2015, C COMP VIS PATT REC
[10]  
Jaderberg M, 2015, ADV NEUR IN, V28