A Framework of Maximum Feature Exploration Oriented Remote Sensing Object Detection

被引:2
作者
Li, Yuelong [1 ,2 ]
Xing, Yue [1 ,2 ]
Wang, Zhiwei [1 ,2 ]
Xiao, Tengfei [1 ,2 ]
Song, Qingzeng [1 ,2 ]
Li, Weiwei [1 ,2 ]
Wang, Jianming [1 ,2 ]
机构
[1] Tiangong Univ, Sch Comp Sci & Technol, Tianjin Key Lab Autonomous Intelligence Technol &, Tianjin 300387, Peoples R China
[2] Tiangong Univ, Sch Artificial Intelligence, Tianjin 300387, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Remote sensing; Proposals; Object detection; Correlation; Location awareness; Image segmentation; Aerial images; deep learning; maximum feature exploration; object detection; remote sensing;
D O I
10.1109/LGRS.2022.3228689
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Object detection from remote sensing images is an interesting research topic full of theoretical and application values. Compared with generic images, those of remote sensing contain much less distinguishing representation information due to far-away capturing position and limited camera lens resolution, the perpendicular to ground capturing view, almost unacquirable 3-D object structure and serious roof occlusion, and the inherent unimportance of top roof view. Focusing on this intrinsic disadvantage, in this letter, we propose a general framework of accurate remote sensing object localization constructed with the core idea of maximum image feature exploration. This integrated structure is mainly comprised of cascaded tiny patch correlation (CTPC) based feature digging, averaging local patch regression (ALPR) centered coarse location acquisition, and instance segmentation oriented further refinement (ISOFR). All those components are designed to maximize feature digging and usage. We believe this is the first to implement the idea of maximum feature exploration to each corner of object detection architecture. Extensive experiments on challenging aerial image datasets DOTA and NWPU VHR-10 show state-of-the-art performance.
引用
收藏
页数:5
相关论文
共 40 条
[1]   Riggable 3D Face Reconstruction via In-Network Optimization [J].
Bai, Ziqian ;
Cui, Zhaopeng ;
Liu, Xiaoming ;
Tan, Ping .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :6212-6221
[2]   D2Det: Towards High Quality Object Detection and Instance Segmentation [J].
Cao, Jiale ;
Cholakkal, Hisham ;
Anwer, Rao Muhammad ;
Khan, Fahad Shahbaz ;
Pang, Yanwei ;
Shao, Ling .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11482-11491
[3]   Triply Supervised Decoder Networks for Joint Detection and Segmentation [J].
Cao, Jiale ;
Pang, Yanwei ;
Li, Xuelong .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7384-7393
[4]   Hybrid Task Cascade for Instance Segmentation [J].
Chen, Kai ;
Pang, Jiangmiao ;
Wang, Jiaqi ;
Xiong, Yu ;
Li, Xiaoxiao ;
Sun, Shuyang ;
Feng, Wansen ;
Liu, Ziwei ;
Shi, Jianping ;
Ouyang, Wanli ;
Loy, Chen Change ;
Lin, Dahua .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4969-4978
[5]   Revisiting RCNN: On Awakening the Classification Power of Faster RCNN [J].
Cheng, Bowen ;
Wei, Yunchao ;
Shi, Honghui ;
Feris, Rogerio ;
Xiong, Jinjun ;
Huang, Thomas .
COMPUTER VISION - ECCV 2018, PT 15, 2018, 11219 :473-490
[6]   Multi-class geospatial object detection and geographic image classification based on collection of part detectors [J].
Cheng, Gong ;
Han, Junwei ;
Zhou, Peicheng ;
Guo, Lei .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 98 :119-132
[7]   Learning RoI Transformer for Oriented Object Detection in Aerial Images [J].
Ding, Jian ;
Xue, Nan ;
Long, Yang ;
Xia, Gui-Song ;
Lu, Qikai .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2844-2853
[8]  
Dosovitskiy A., 2021, P INT C LEARN REPR, P437
[9]   Point-Based Estimator for Arbitrary-Oriented Object Detection in Aerial Images [J].
Fu, Kun ;
Chang, Zhonghan ;
Zhang, Yue ;
Sun, Xian .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (05) :4370-4387
[10]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448