Adaptive Composite Feature Generation for Object Detection in Remote Sensing Images

被引:10
作者
Zhang, Ziye [1 ]
Mei, Shaohui [1 ]
Ma, Mingyang [2 ]
Han, Zonghao [1 ]
机构
[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710129, Shaanxi, Peoples R China
[2] Xi An Jiao Tong Univ, Sch Informat & Commun Engn, Xian 710049, Shaanxi, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Knowledge transfer; Knowledge distillation (KD); object detection; remote sensing image; self-attention; KNOWLEDGE DISTILLATION;
D O I
10.1109/TGRS.2024.3424295
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Object detection in remote sensing images identifies and extracts the acquired Earth surface information, providing data support and research basis for multiple fields. Remote sensing image object detection based on knowledge distillation (KD) can transfer the knowledge of a large teacher model to a smaller student model, achieving the effect of low parameter volume and high accuracy. Mainstream methods directly imitate teacher features to improve student performance, ignoring the generation of high-ranking features through teacher features instructing student feature maps in this knowledge transfer process. In this article, an adaptive composite feature generation (ACFG) strategy is proposed to achieve end-to-end trainable KD for object detection in remote sensing images, in which the robustness of feature points under composite masks is improved through adaptive feature mapping. In particular, a composite mask generator (CMG) module is proposed to select student instance-related features and point background features. Furthermore, a global and local projection layer (GLPL) module is proposed to connect the local information and global information of the feature map under the mask generator to adaptively realize the global recovery mapping of the feature map with partial feature points. Finally, balanced decoupling loss (BDL) is improved to handle foreground and background loss separately, so that the two decoupled features can better enable the student model to learn instance-related information. Note that the proposed ACFG is capable of conducting KD for both single-stage and two-stage object detectors. Experimental results using both anchor-based and anchor-free detectors on the DIOR dataset and DOTA dataset demonstrate that the proposed ACFG clearly achieved better performance than several state-of-the-art (SOTA) algorithms for KD.
引用
收藏
页数:16
相关论文
共 57 条
[1]   Variational Information Distillation for Knowledge Transfer [J].
Ahn, Sungsoo ;
Hu, Shell Xu ;
Damianou, Andreas ;
Lawrence, Neil D. ;
Dai, Zhenwen .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9155-9163
[2]  
Alvarez JM, 2016, ADV NEUR IN, V29
[3]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[4]   Data-free Knowledge Distillation for Object Detection [J].
Chawla, Akshay ;
Yin, Hongxu ;
Molchanov, Pavlo ;
Alvarez, Jose .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, :3288-3297
[5]  
Chen GB, 2017, ADV NEUR IN, V30
[6]  
Chen K, 2019, Arxiv, DOI arXiv:1906.07155
[7]  
Cheng G., 2021, IEEE Trans. Geosci. Remote Sens., V60, P111
[8]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[9]   Learning RoI Transformer for Oriented Object Detection in Aerial Images [J].
Ding, Jian ;
Xue, Nan ;
Long, Yang ;
Xia, Gui-Song ;
Lu, Qikai .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2844-2853
[10]   Multiscale Deformable Attention and Multilevel Features Aggregation for Remote Sensing Object Detection [J].
Dong, Xiaohu ;
Qin, Yao ;
Fu, Ruigang ;
Gao, Yinghui ;
Liu, Songlin ;
Ye, Yuanxin ;
Li, Biao .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19