SAENet: Self-Supervised Adversarial and Equivariant Network for Weakly Supervised Object Detection in Remote Sensing Images

被引:20
作者
Feng, Xiaoxu [1 ]
Yao, Xiwen [1 ]
Cheng, Gong [1 ]
Han, Jungong [2 ]
Han, Junwei [1 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian 710072, Peoples R China
[2] Aberystwyth Univ, Dept Comp Sci, Aberystwyth SY23 3FL, Dyfed, Wales
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2022年 / 60卷
基金
中国博士后科学基金; 美国国家科学基金会;
关键词
Detectors; Proposals; Object detection; Remote sensing; Task analysis; Annotations; Transforms; Multiple instance learning (MIL); remote sensing images (RSIs); self-supervised learning; weakly supervised object detection (WSOD); TARGET DETECTION;
D O I
10.1109/TGRS.2021.3105575
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Weakly supervised object detection (WSOD) in remote sensing images (RSIs) remains a challenge when learning a subtle object detection model with only image-level annotations. Most works tend to optimize the detection model via exploiting the most contributed region, thereby to be dominated by the most discriminative part of an object. Meanwhile, these methods ignore the consistency across different spatial transformations of the same image and always label them with different classes, which introduces potential ambiguities. To tackle these challenges, we propose a unique self-supervised adversarial and equivariant network (SAENet) and aim at learning complementary and consistent visual patterns for WSOD in RSIs. To this end, an adversarial dropout-activation block is first designed to facilitate the entire object detector via adaptively hiding the discriminative parts and highlighting the instance-related regions. Besides, we further introduce a flexible self-supervised transformation equivariance mechanism on each potential instance from multiple spatial transformations to obtain spatially consistent self-supervisions. Accordingly, the obtained supervisions can be leveraged to pursue a more robust and spatially consistent object detector. Comprehensive experiments on the challenging LEarning, VIsion and Remote sensing Laboratory (LEVIR), NorthWestern Polytechnical University (NWPU) VHR-10.v2, and detection in optical RSIs (DIOR) datasets validate that SAENet outperforms the previous state-of-the-art works and achieves 46.2%, 60.7%, and 27.1% mAP, respectively.
引用
收藏
页数:11
相关论文
共 49 条
[1]  
[Anonymous], 2014, COMPUT RES REPOSITOR
[2]   Weakly Supervised Deep Detection Networks [J].
Bilen, Hakan ;
Vedaldi, Andrea .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2846-2854
[3]   High-Quality Proposals for Weakly Supervised Object Detection [J].
Cheng, Gong ;
Yang, Junyu ;
Gao, Decheng ;
Guo, Lei ;
Han, Junwei .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) :5794-5804
[4]   Learning Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection [J].
Cheng, Gong ;
Han, Junwei ;
Zhou, Peicheng ;
Xu, Dong .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (01) :265-278
[5]   When Deep Learning Meets Metric Learning: Remote Sensing Image Scene Classification via Learning Discriminative CNNs [J].
Cheng, Gong ;
Yang, Ceyuan ;
Yao, Xiwen ;
Guo, Lei ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (05) :2811-2821
[6]   Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images [J].
Cheng, Gong ;
Zhou, Peicheng ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (12) :7405-7415
[7]  
Cheng G, 2015, PROC CVPR IEEE, P1173, DOI 10.1109/CVPR.2015.7298721
[8]   TCANet: Triple Context-Aware Network for Weakly Supervised Object Detection in Remote Sensing Images [J].
Feng, Xiaoxu ;
Han, Junwei ;
Yao, Xiwen ;
Cheng, Gong .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (08) :6946-6955
[9]   Progressive Contextual Instance Refinement for Weakly Supervised Object Detection in Remote Sensing Images [J].
Feng, Xiaoxu ;
Han, Junwei ;
Yao, Xiwen ;
Cheng, Gong .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (11) :8002-8012
[10]  
Ghiasi G, 2018, ADV NEUR IN, V31