Remote Sensing Image Scene Classification with Self-Supervised Learning Based on Partially Unlabeled Datasets

被引:12
作者
Chen, Xiliang [1 ]
Zhu, Guobin [1 ]
Liu, Mingqing [1 ]
机构
[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金;
关键词
self-supervised learning; vision transformer; random mask; remote sensing image scene classification; unlabeled datasets; CONVOLUTIONAL NEURAL-NETWORKS; OBJECT DETECTION; SCALE; REPRESENTATION; ATTENTION;
D O I
10.3390/rs14225838
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
In recent years, supervised learning, represented by deep learning, has shown good performance in remote sensing image scene classification with its powerful feature learning ability. However, this method requires large-scale and high-quality handcrafted labeled datasets, which leads to a high cost of obtaining annotated samples. Self-supervised learning can alleviate this problem by using unlabeled data to learn the image's feature representation and then migrate to the downstream task. In this study, we use an encoder-decoder structure to construct a self-supervised learning architecture. In the encoding stage, the image mask is used to discard some of the image patches randomly, and the image's feature representation can be learned from the remaining image patches. In the decoding stage, the lightweight decoder is used to recover the pixels of the original image patches according to the features learned in the encoding stage. We constructed a large-scale unlabeled training set using several public scene classification datasets and Gaofen-2 satellite data to train the self-supervised learning model. In the downstream task, we use the encoder structure with the masked image patches that have been removed as the backbone network of the scene classification task. Then, we fine-tune the pre-trained weights of self-supervised learning in the encoding stage on two open datasets with complex scene categories. The datasets include NWPU-RESISC45 and AID. Compared with other mainstream supervised learning methods and self-supervised learning methods, our proposed method has better performance than the most state-of-the-art methods in the task of remote sensing image scene classification.
引用
收藏
页数:23
相关论文
共 76 条
[1]  
Aral RA, 2018, IEEE INT CONF BIG DA, P2058, DOI 10.1109/BigData.2018.8622212
[2]   Simple Yet Effective Fine-Tuning of Deep CNNs Using an Auxiliary Classification Loss for Remote Sensing Scene Classification [J].
Bazi, Yakoub ;
Al Rahhal, Mohamad M. ;
Alhichri, Haikel ;
Alajlan, Naif .
REMOTE SENSING, 2019, 11 (24)
[3]   Which is the best way to organize/classify images by content? [J].
Bosch, Anna ;
Munoz, Xavier ;
Marti, Robert .
IMAGE AND VISION COMPUTING, 2007, 25 (06) :778-791
[4]   Deep Feature Fusion for VHR Remote Sensing Scene Classification [J].
Chaib, Souleyman ;
Liu, Huan ;
Gu, Yanfeng ;
Yao, Hongxun .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (08) :4775-4784
[5]  
Chen J., 2018, SCI CHINA EARTH SCI, V48, P1391
[6]   Bag-of-Visual-Words Scene Classifier for Remote Sensing Image Based on Region Covariance [J].
Chen, Xiliang ;
Zhu, Guobin ;
Liu, Mingqing .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[7]   When Deep Learning Meets Metric Learning: Remote Sensing Image Scene Classification via Learning Discriminative CNNs [J].
Cheng, Gong ;
Yang, Ceyuan ;
Yao, Xiwen ;
Guo, Lei ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (05) :2811-2821
[8]   Remote Sensing Image Scene Classification Using Bag of Convolutional Features [J].
Cheng, Gong ;
Li, Zhenpeng ;
Yao, Xiwen ;
Guo, Lei ;
Wei, Zhongliang .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2017, 14 (10) :1735-1739
[9]   Remote Sensing Image Scene Classification: Benchmark and State of the Art [J].
Cheng, Gong ;
Han, Junwei ;
Lu, Xiaoqiang .
PROCEEDINGS OF THE IEEE, 2017, 105 (10) :1865-1883
[10]   A survey on object detection in optical remote sensing images [J].
Cheng, Gong ;
Han, Junwei .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 117 :11-28