Variational Self-Distillation for Remote Sensing Scene Classification

被引:30
作者
Hu, Yutao [1 ]
Huang, Xin [1 ]
Luo, Xiaoyan [2 ]
Han, Jungong [3 ]
Cao, Xianbin [1 ,4 ]
Zhang, Jun [5 ]
机构
[1] Beihang Univ, Sch Elect & Informat Engn, Beijing 100191, Peoples R China
[2] Beihang Univ, Sch Astronaut, Beijing 100191, Peoples R China
[3] Aberystwyth Univ, Dept Comp Sci, Aberystwyth SY23 3FL, Dyfed, Wales
[4] Minist Ind & Informat Technol China, Key Lab Adv Technol Near Space Informat Syst, Beijing 100804, Peoples R China
[5] Beijing Inst Technol, Adv Res Inst Multidisciplinary Sci, Beijing 100081, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2022年 / 60卷
基金
中国国家自然科学基金;
关键词
Remote sensing; Training; Representation learning; Uncertainty; Perturbation methods; Knowledge transfer; Computational modeling; Class entanglement information; hierarchical knowledge transfer; remote sensing scene classification; self-distillation; CONVOLUTIONAL NEURAL-NETWORKS;
D O I
10.1109/TGRS.2022.3194549
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Supported by deep learning techniques, remote sensing scene classification, a fundamental task in remote image analysis, has recently obtained remarkable progress. However, due to the severe uncertainty and perturbation within an image, it is still a challenging task and remains many unsolved problems. In this article, we note that regular one-hot labels cannot precisely describe remote sensing images, and they fail to provide enough information for supervision and limiting the discriminative feature learning of the network. To solve this problem, we propose a variational self-distillation network (VSDNet), in which the class entanglement information from the prediction vector acts as the supplement to the category information. Then, the exploited information is hierarchically distilled from the deep layers into the shallow parts via a variational knowledge transfer (VKT) module. Notably, the VKT module performs knowledge distillation in a probabilistic way through variational estimation, which enables end-to-end optimization for mutual information and promotes robustness to uncertainty within the image. Extensive experiments on four challenging remote sensing datasets demonstrate that, with a negligible parameter increase, the proposed VSDNet brings a significant performance improvement over different backbone networks and delivers state-of-the-art results.
引用
收藏
页数:13
相关论文
共 57 条
[1]  
Alemi AA, 2019, Arxiv, DOI arXiv:1612.00410
[2]   Variational Information Distillation for Knowledge Transfer [J].
Ahn, Sungsoo ;
Hu, Shell Xu ;
Damianou, Andreas ;
Lawrence, Neil D. ;
Dai, Zhenwen .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9155-9163
[3]  
Belghazi MI, 2018, PR MACH LEARN RES, V80
[4]   Compact Cloud Detection with Bidirectional Self-Attention Knowledge Distillation [J].
Chai, Yajie ;
Fu, Kun ;
Sun, Xian ;
Diao, Wenhui ;
Yan, Zhiyuan ;
Feng, Yingchao ;
Wang, Lei .
REMOTE SENSING, 2020, 12 (17)
[5]   Contextual Information-Preserved Architecture Learning for Remote-Sensing Scene Classification [J].
Chen, Jie ;
Huang, Haozhe ;
Peng, Jian ;
Zhu, Jiawei ;
Chen, Li ;
Tao, Chao ;
Li, Haifeng .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[6]   Remote Sensing Image Scene Classification Meets Deep Learning: Challenges, Methods, Benchmarks, and Opportunities [J].
Cheng, Gong ;
Xie, Xingxing ;
Han, Junwei ;
Guo, Lei ;
Xia, Gui-Song .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 :3735-3756
[7]   When Deep Learning Meets Metric Learning: Remote Sensing Image Scene Classification via Learning Discriminative CNNs [J].
Cheng, Gong ;
Yang, Ceyuan ;
Yao, Xiwen ;
Guo, Lei ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (05) :2811-2821
[8]   Remote Sensing Image Scene Classification: Benchmark and State of the Art [J].
Cheng, Gong ;
Han, Junwei ;
Lu, Xiaoqiang .
PROCEEDINGS OF THE IEEE, 2017, 105 (10) :1865-1883
[9]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[10]  
Hjelm RD, 2019, Arxiv, DOI arXiv:1808.06670