A self-supervised pre-training scheme for multi-source heterogeneous remote sensing image land cover classification

被引:0
|
作者
Xue Z. [1 ,2 ]
Yu X. [3 ]
Liu J. [4 ]
Yang G. [2 ]
Liu B. [1 ]
Yu A. [1 ]
Zhou J. [5 ]
Jin S. [5 ]
机构
[1] Information Engineering University, Zhengzhou
[2] Beijing Aviation Meteorological Institute, Beijing
[3] North China University of Water Resources and Electric Povjer, Zhengzhou
[4] Troops 93110,, Beijing
[5] Troops 93116,, Shenyang
来源
Cehui Xuebao/Acta Geodaetica et Cartographica Sinica | 2024年 / 53卷 / 03期
关键词
land cover classification; multi-source heterogeneous data; pre-training; remote sensing; self-supervised learning;
D O I
10.11947/j.AGCS.2024.20220553
中图分类号
学科分类号
摘要
Deep learning has revolutionized the remote sensing image processing techniques over the past few years. Nevertheless , it is laborious to annotate high quality samples, thus limiting the performance of deep networks because of insufficient supervision information. To resolve this contradiction, we investigate the self-supervised pre-training and fine-tuning paradigm for multi-source heterogeneous remote sensing image land cover classification, aiming to relieve the urgent need for manually annotated data. Specifically, the proposed generative feature learning model consists of asymmetric encoder-decoder structure, in which the deep encoder extracts high-level key characteristics contained in multi-source data and task-specific lightweight decoders are developed to reconstruct original data. To further improve the feature representation capability, the cross-attention layers are utilized to exchange information contained in heterogeneous characteristics, thus learning more complementary information from multi-source remote sensing data. In fine-tuning stage, the trained encoder is employed as unsupervised feature extractor, and learned features are utilized for land cover classification through the designed lightweight Transformer based classifier. This self-supervised pre-training architecture is capable of learning high-level key features from multi-source hetero-genous remote sensing images, and this process does not require any labeled information, thus relieving the urgent need for labeled samples. Compared with existing classification paradigms, the proposed multimodal self-supervised pre-training and fine-tuning scheme achieves superior performance for remote sensing image classification. © 2024 SinoMaps Press. All rights reserved.
引用
收藏
页码:512 / 525
页数:13
相关论文
共 39 条
  • [1] HONG Danfeng, HU Jingliang, YAO Jing, Et al., Multimodal remote sensing benchmark datasets for land cover classification with a shared and specific feature learning model[J], ISPRS Journal of Photogrammetry and Remote Sensing:Official Publication of the International Society for Photogrammetry and Remote Sensing (ISPRS), 178, pp. 68-80, (2021)
  • [2] LEI Lei, WANG Xinyu, ZHONG Yanfei, Et al., DOCC:deep one-class crop classification via positive and unlabeled learning for multi-modal satellite imagery[J], International Journal of Applied Earth Observation and Geoinformation, 105, (2021)
  • [3] ZHANG Liangpei, HE Jiang, YANG Qianqian, Et al., Data-driven multi-source remote sensing data fusion:progress and challenges[J], Acta Geodaetica et Cartographica Sinica, 51, 7, pp. 1317-1337, (2022)
  • [4] SHI Beiqi, LIU Chun, SUN Weiwei, Et al., Sparse nonnegative matrix factorization for hyperspectral optimal band selection[J], Acta Geodaetica et Cartographica Sinica, 42, 3, pp. 351-358, (2013)
  • [5] YU Anzhu, LIU Bing, XING Zhipeng, Et al., Salient feature extraction method for hyperspectral image classification[J], Acta Geodaetica et Cartographica Sinica, 48, 8, pp. 985-995, (2019)
  • [6] ZHANG Liangpei, LI Jiayi, Development and prospect of sparse representation-based hyperspectral image processing and analysis[J], Journal of Remote Sensing, 20, 5, pp. 1091-1101, (2016)
  • [7] CHEN Yushi, JIANG Hanlu, LI Chunyang, Et al., Deep feature extraction and classification of hyperspectral images based on convolutional neural networks[J], IEEE Transactions on Geoscience and Remote Sensing, 54, 10, pp. 6232-6251, (2016)
  • [8] LEE H, KWON H., Going deeper with contextual CNN for hyperspectral image classification[J], IEEE Transactions on Image Processing:a Publication of the IEEE Signal Processing Society, 26, 10, pp. 4843-4855, (2017)
  • [9] WANG Kexian, ZHENG Shunyi, LI Rui, Et al., A deep double-channel dense network for hyperspectral image classification[J], Journal of Geodesy and Geoinformation Science, 4, 4, pp. 46-62, (2021)
  • [10] Lichao MOU, GHAMISIP, ZHU Xiaoxiang, Deep recurrent neural networks for hyperspectral image classification[J], IEEE Transactions on Geoscience and Remote Sensing, 55, 7, pp. 3639-3655, (2017)