A self-supervised pre-training scheme for multi-source heterogeneous remote sensing image land cover classification

被引：0

作者：

Xue Z. ^{[1
,2
]}

Yu X. ^{[3
]}

Liu J. ^{[4
]}

Yang G. ^{[2
]}

Liu B. ^{[1
]}

Yu A. ^{[1
]}

Zhou J. ^{[5
]}

Jin S. ^{[5
]}

机构：

[1] Information Engineering University, Zhengzhou

[2] Beijing Aviation Meteorological Institute, Beijing

[3] North China University of Water Resources and Electric Povjer, Zhengzhou

[4] Troops 93110,, Beijing

[5] Troops 93116,, Shenyang

来源：

Cehui Xuebao/Acta Geodaetica et Cartographica Sinica | 2024年 / 53卷 / 03期

关键词：

land cover classification; multi-source heterogeneous data; pre-training; remote sensing; self-supervised learning;

D O I：

10.11947/j.AGCS.2024.20220553

中图分类号：

学科分类号：

摘要：

Deep learning has revolutionized the remote sensing image processing techniques over the past few years. Nevertheless , it is laborious to annotate high quality samples, thus limiting the performance of deep networks because of insufficient supervision information. To resolve this contradiction, we investigate the self-supervised pre-training and fine-tuning paradigm for multi-source heterogeneous remote sensing image land cover classification, aiming to relieve the urgent need for manually annotated data. Specifically, the proposed generative feature learning model consists of asymmetric encoder-decoder structure, in which the deep encoder extracts high-level key characteristics contained in multi-source data and task-specific lightweight decoders are developed to reconstruct original data. To further improve the feature representation capability, the cross-attention layers are utilized to exchange information contained in heterogeneous characteristics, thus learning more complementary information from multi-source remote sensing data. In fine-tuning stage, the trained encoder is employed as unsupervised feature extractor, and learned features are utilized for land cover classification through the designed lightweight Transformer based classifier. This self-supervised pre-training architecture is capable of learning high-level key features from multi-source hetero-genous remote sensing images, and this process does not require any labeled information, thus relieving the urgent need for labeled samples. Compared with existing classification paradigms, the proposed multimodal self-supervised pre-training and fine-tuning scheme achieves superior performance for remote sensing image classification. © 2024 SinoMaps Press. All rights reserved.

引用

页码：512 / 525

页数：13

共 39 条

[1]

HONG Danfeng, HU Jingliang, YAO Jing, Et al., Multimodal remote sensing benchmark datasets for land cover classification with a shared and specific feature learning model[J], ISPRS Journal of Photogrammetry and Remote Sensing:Official Publication of the International Society for Photogrammetry and Remote Sensing (ISPRS), 178, pp. 68-80, (2021)

[2]

LEI Lei, WANG Xinyu, ZHONG Yanfei, Et al., DOCC:deep one-class crop classification via positive and unlabeled learning for multi-modal satellite imagery[J], International Journal of Applied Earth Observation and Geoinformation, 105, (2021)

[3]

ZHANG Liangpei, HE Jiang, YANG Qianqian, Et al., Data-driven multi-source remote sensing data fusion:progress and challenges[J], Acta Geodaetica et Cartographica Sinica, 51, 7, pp. 1317-1337, (2022)

[4]

SHI Beiqi, LIU Chun, SUN Weiwei, Et al., Sparse nonnegative matrix factorization for hyperspectral optimal band selection[J], Acta Geodaetica et Cartographica Sinica, 42, 3, pp. 351-358, (2013)

[5]

YU Anzhu, LIU Bing, XING Zhipeng, Et al., Salient feature extraction method for hyperspectral image classification[J], Acta Geodaetica et Cartographica Sinica, 48, 8, pp. 985-995, (2019)

[6]

ZHANG Liangpei, LI Jiayi, Development and prospect of sparse representation-based hyperspectral image processing and analysis[J], Journal of Remote Sensing, 20, 5, pp. 1091-1101, (2016)

[7]

CHEN Yushi, JIANG Hanlu, LI Chunyang, Et al., Deep feature extraction and classification of hyperspectral images based on convolutional neural networks[J], IEEE Transactions on Geoscience and Remote Sensing, 54, 10, pp. 6232-6251, (2016)

[8]

LEE H, KWON H., Going deeper with contextual CNN for hyperspectral image classification[J], IEEE Transactions on Image Processing:a Publication of the IEEE Signal Processing Society, 26, 10, pp. 4843-4855, (2017)

[9]

WANG Kexian, ZHENG Shunyi, LI Rui, Et al., A deep double-channel dense network for hyperspectral image classification[J], Journal of Geodesy and Geoinformation Science, 4, 4, pp. 46-62, (2021)

[10]

Lichao MOU, GHAMISIP, ZHU Xiaoxiang, Deep recurrent neural networks for hyperspectral image classification[J], IEEE Transactions on Geoscience and Remote Sensing, 55, 7, pp. 3639-3655, (2017)

← 1 2 3 4 →