SS-MAE: Spatial–Spectral Masked Autoencoder for Multisource Remote Sensing Image Classification

被引：31

作者：

Lin, Junyan ^{[1
]}

Gao, Feng ^{[1
]}

Shi, Xiaochen ^{[1
]}

Dong, Junyu ^{[1
]}

Du, Qian ^{[2
]}

机构：

[1] Ocean Univ China, Sch Comp Sci & Technol, Qingdao 266100, Peoples R China

[2] Mississippi State Univ, Dept Elect & Comp Engn, Starkville, MS 39762 USA

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2023年 / 61卷

关键词：

Image reconstruction; Feature extraction; Transformers; Image classification; Training; Decoding; Self-supervised learning; Deep learning; hyperspectral image (HSI); masked autoencoder (MAE); multisource data; DECISION FUSION;

D O I：

10.1109/TGRS.2023.3331717

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Masked image modeling (MIM) is a highly popular and effective self-supervised learning method for image understanding. The existing MIM-based methods mostly focus on spatial feature modeling, neglecting spectral feature modeling. Meanwhile, the existing MIM-based methods use Transformer for feature extraction, and some local or high-frequency information may get lost. To this end, we propose a spatial-spectral masked autoencoder (SS-MAE) for hyperspectral image (HSI) and light detection and ranging (LiDAR)/synthetic aperture radar (SAR) data joint classification. Specifically, SS-MAE consists of a spatialwise branch and a spectralwise branch. The spatialwise branch masks random patches and reconstructs missing pixels, while the spectralwise branch masks random spectral channels and reconstructs missing channels. Our SS-MAE fully exploits the spatial and spectral representations of the input data. Furthermore, to complement local features in the training stage, we add two lightweight convolutional nerual networks (CNNs) for feature extraction. Both global and local features are taken into account for feature modeling. To demonstrate the effectiveness of the proposed SS-MAE, we conduct extensive experiments on three publicly available datasets. Extensive experiments on three multisource datasets verify the superiority of our SS-MAE compared with several state-of-the-art baselines. The source codes are available at https://github.com/summitgao/SS-MAE.

引用

页码：1 / 14

页数：14

共 50 条

[21] DAE-GSP: Discriminative Autoencoder With Gaussian Selective Patch for Multimodal Remote Sensing Image Classification
Li, Mengchang
Feng, Zhixi
Yang, Shuyuan
Ma, Yue
Song, Liangliang
Chen, Shuai
Jiao, Licheng
Zhang, Junkai
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
[22] Distribution-Independent Domain Generalization for Multisource Remote Sensing Classification
Gao, Yunhao
Zhang, Mengmeng
Li, Wei
Tao, Ran
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
[23] Adaptive Masked Autoencoder Transformer for image classification
Chen, Xiangru
Liu, Chenjing
Hu, Peng
Lin, Jie
Gong, Yunhong
Chen, Yingke
Peng, Dezhong
Geng, Xue
APPLIED SOFT COMPUTING, 2024, 164
[24] Spectral and Multi-spatial-feature based deep learning for hyperspectral remote sensing image classification
Chen, Chen
Zhang, JingJing
Li, Teng
Yan, Qing
Xun, LiNa
PROCEEDINGS OF 2018 IEEE INTERNATIONAL CONFERENCE ON REAL-TIME COMPUTING AND ROBOTICS (IEEE RCAR), 2018, : 421 - 426
[25] Hyperspectral Image Classification Based on Stacked Contractive Autoencoder Combined With Adaptive Spectral-Spatial Information
Guo, Pengyue
Liu, Zhenbing
Lu, Haoxiang
Wang, Zimin
IEEE ACCESS, 2021, 9 : 96404 - 96415
[26] Multihead Global Attention and Spatial Spectral Information Fusion for Remote Sensing Image Compression
Shi, Cuiping
Shi, Kaijie
Zhu, Fei
Zeng, Zexin
Wang, Liguo
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 999 - 1015
[27] A Multimodal Unified Representation Learning Framework With Masked Image Modeling for Remote Sensing Images
Du, Dakuan
Liu, Tianzhu
Gu, Yanfeng
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[28] A Spatial-Spectral Prototypical Network for Hyperspectral Remote Sensing Image
Tang, Haojin
Li, Yanshan
Han, Xiao
Huang, Qinghua
Xie, Weixin
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (01) : 167 - 171
[29] Exploring Transformer and Multilabel Classification for Remote Sensing Image Captioning
Kandala, Hitesh
Saha, Sudipan
Banerjee, Biplab
Zhu, Xiao Xiang
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[30] A3CLNN: Spatial, Spectral and Multiscale Attention ConvLSTM Neural Network for Multisource Remote Sensing Data Classification
Li, Heng-Chao
Hu, Wen-Shuai
Li, Wei
Li, Jun
Du, Qian
Plaza, Antonio
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (02) : 747 - 761

← 1 2 3 4 5 →