Few-Shot Scene Classification of Optical Remote Sensing Images Leveraging Calibrated Pretext Tasks

被引:30
作者
Ji, Hong [1 ]
Gao, Zhi [1 ]
Zhang, Yongjun [1 ]
Wan, Yu [1 ]
Li, Can [1 ]
Mei, Tiancan [2 ]
机构
[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430079, Peoples R China
[2] Wuhan Univ, Sch Elect Informat, Wuhan 430072, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2022年 / 60卷
基金
中国国家自然科学基金;
关键词
Task analysis; Training; Remote sensing; Image analysis; Optical sensors; Optical imaging; Data models; Adversarial model perturbation (AMP); few-shot scene classification; multitask learning; optical remote sensing image; pretext task;
D O I
10.1109/TGRS.2022.3184080
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Small data hold big artificial intelligence (AI) potential. As one of the promising small data AI approaches, few-shot learning has the goal to learn a model efficiently that can recognize novel classes with extremely limited training samples. Therefore, it is critical to accumulate useful prior knowledge obtained from large-scale base class dataset. To realize few-shot scene classification of optical remote sensing images, we start from a baseline model that trains all base classes using a standard cross-entropy loss leveraging two auxiliary objectives to capture intrinsical characteristics across the semantic classes. Specifically, rotation prediction learns to recognize the 2-D rotation of an input to guide the learning of class-transferable knowledge, and contrastive learning aims to pull together the positive pairs while pushing apart the negative pairs to promote intraclass consistency and interclass inconsistency. We jointly optimize two such pretext tasks and semantic class prediction task in an end-to-end manner. To further overcome the overfitting issue, we introduce a regularization technique, adversarial model perturbation, to calibrate the pretext tasks so as to enhance the generalization ability. Extensive experiments on public remote sensing benchmarks including Northwestern Polytechnical University (NWPU)-RESISC45, aerial image dataset (AID), and Wuhan University (WHU)-remote sensing (RS)-19 demonstrate that our method works effectively and achieves best performance that significantly outperforms many state-of-the-art approaches.
引用
收藏
页数:13
相关论文
共 65 条
[1]  
[Anonymous], 1999, An Overview of Statistical Learning Theory
[2]  
[Anonymous], 2019, P INT C MACH LEARN
[3]  
Chen T, 2020, PR MACH LEARN RES, V119
[4]  
Chen W.Y., 2019, INT C LEARNING REPRE
[5]  
Chen Y., 2021, P IEEE CVF INT C COM, V2021, P9062, DOI [DOI 10.1109/ICCV48922.2021.00893, 10.1109/iccv48922.2021.00893]
[6]   SPNet: Siamese-Prototype Network for Few-Shot Remote Sensing Image Scene Classification [J].
Cheng, Gong ;
Cai, Liming ;
Lang, Chunbo ;
Yao, Xiwen ;
Chen, Jinyong ;
Guo, Lei ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[7]   Remote Sensing Image Scene Classification: Benchmark and State of the Art [J].
Cheng, Gong ;
Han, Junwei ;
Lu, Xiaoqiang .
PROCEEDINGS OF THE IEEE, 2017, 105 (10) :1865-1883
[8]   Satellite Image Classification via Two-Layer Sparse Coding With Biased Image Representation [J].
Dai, Dengxin ;
Yang, Wen .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2011, 8 (01) :173-176
[9]  
Deng WJ, 2021, PR MACH LEARN RES, V139
[10]  
Finn C, 2017, PR MACH LEARN RES, V70