MEAD: a Mask-guidEd Anchor-free Detector for oriented aerial object detection

被引:9
作者
He, Zewen [1 ,2 ]
Ren, Zhida [1 ,2 ]
Yang, Xuebing [1 ]
Yang, Yang [1 ]
Zhang, Wensheng [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Oriented aerial object detection; Anchor-free detector; Mask-guided mechanism; Cascade structure;
D O I
10.1007/s10489-021-02570-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection in aerial images is a challenging task due to various orientations of objects and the lack of discriminative features. Existing methods are usually in a dilemma between accuracy and speed. While one-stage anchor-free detectors inference more quickly than two-stage frameworks, their predictions are not as accurate as that of the opposite. This paper proposes a quick and accurate detector, Mask-guidEd Anchor-free Detector (MEAD). It can rapidly locate oriented objects in aerial images by means of per-pixel prediction. Furthermore, it embeds a cascade architecture to locate targets more precisely. To enhance feature discrimination, the mask-guided branch is employed to force features to attend the foreground regions. Comparative experiments are conducted on DOTA and HRSC2016 datasets. The results show that MEAD is better than current state-of-the-art anchor-free detectors, that is, mAP 74.33 on DOTA and 89.83 on HRSC2016.
引用
收藏
页码:4382 / 4397
页数:16
相关论文
共 37 条
[1]   Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery [J].
Azimi, Seyed Majid ;
Vig, Eleonora ;
Bahmanyar, Reza ;
Koerner, Marco ;
Reinartz, Peter .
COMPUTER VISION - ACCV 2018, PT III, 2019, 11363 :150-165
[2]   D2Det: Towards High Quality Object Detection and Instance Segmentation [J].
Cao, Jiale ;
Cholakkal, Hisham ;
Anwer, Rao Muhammad ;
Khan, Fahad Shahbaz ;
Pang, Yanwei ;
Shao, Ling .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11482-11491
[3]  
Carion Nicolas, 2020, End-to-End Object Detection with Transformers, P213
[4]   Hybrid Task Cascade for Instance Segmentation [J].
Chen, Kai ;
Pang, Jiangmiao ;
Wang, Jiaqi ;
Xiong, Yu ;
Li, Xiaoxiao ;
Sun, Shuyang ;
Feng, Wansen ;
Liu, Ziwei ;
Shi, Jianping ;
Ouyang, Wanli ;
Loy, Chen Change ;
Lin, Dahua .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4969-4978
[5]  
Chen Kai, 2019, arXiv:1906.07155
[6]   Deformable Convolutional Networks [J].
Dai, Jifeng ;
Qi, Haozhi ;
Xiong, Yuwen ;
Li, Yi ;
Zhang, Guodong ;
Hu, Han ;
Wei, Yichen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773
[7]   Learning RoI Transformer for Oriented Object Detection in Aerial Images [J].
Ding, Jian ;
Xue, Nan ;
Long, Yang ;
Xia, Gui-Song ;
Lu, Qikai .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2844-2853
[8]  
He KM, 2020, IEEE T PATTERN ANAL, V42, P386, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[9]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[10]  
He T., 2019, R3Det: refined single-stage detector with feature refinement for rotating object