SS-MAE: Spatial–Spectral Masked Autoencoder for Multisource Remote Sensing Image Classification

被引:31
|
作者
Lin, Junyan [1 ]
Gao, Feng [1 ]
Shi, Xiaochen [1 ]
Dong, Junyu [1 ]
Du, Qian [2 ]
机构
[1] Ocean Univ China, Sch Comp Sci & Technol, Qingdao 266100, Peoples R China
[2] Mississippi State Univ, Dept Elect & Comp Engn, Starkville, MS 39762 USA
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2023年 / 61卷
关键词
Image reconstruction; Feature extraction; Transformers; Image classification; Training; Decoding; Self-supervised learning; Deep learning; hyperspectral image (HSI); masked autoencoder (MAE); multisource data; DECISION FUSION;
D O I
10.1109/TGRS.2023.3331717
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Masked image modeling (MIM) is a highly popular and effective self-supervised learning method for image understanding. The existing MIM-based methods mostly focus on spatial feature modeling, neglecting spectral feature modeling. Meanwhile, the existing MIM-based methods use Transformer for feature extraction, and some local or high-frequency information may get lost. To this end, we propose a spatial-spectral masked autoencoder (SS-MAE) for hyperspectral image (HSI) and light detection and ranging (LiDAR)/synthetic aperture radar (SAR) data joint classification. Specifically, SS-MAE consists of a spatialwise branch and a spectralwise branch. The spatialwise branch masks random patches and reconstructs missing pixels, while the spectralwise branch masks random spectral channels and reconstructs missing channels. Our SS-MAE fully exploits the spatial and spectral representations of the input data. Furthermore, to complement local features in the training stage, we add two lightweight convolutional nerual networks (CNNs) for feature extraction. Both global and local features are taken into account for feature modeling. To demonstrate the effectiveness of the proposed SS-MAE, we conduct extensive experiments on three publicly available datasets. Extensive experiments on three multisource datasets verify the superiority of our SS-MAE compared with several state-of-the-art baselines. The source codes are available at https://github.com/summitgao/SS-MAE.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 50 条
  • [21] DAE-GSP: Discriminative Autoencoder With Gaussian Selective Patch for Multimodal Remote Sensing Image Classification
    Li, Mengchang
    Feng, Zhixi
    Yang, Shuyuan
    Ma, Yue
    Song, Liangliang
    Chen, Shuai
    Jiao, Licheng
    Zhang, Junkai
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [22] Distribution-Independent Domain Generalization for Multisource Remote Sensing Classification
    Gao, Yunhao
    Zhang, Mengmeng
    Li, Wei
    Tao, Ran
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [23] Adaptive Masked Autoencoder Transformer for image classification
    Chen, Xiangru
    Liu, Chenjing
    Hu, Peng
    Lin, Jie
    Gong, Yunhong
    Chen, Yingke
    Peng, Dezhong
    Geng, Xue
    APPLIED SOFT COMPUTING, 2024, 164
  • [24] Spectral and Multi-spatial-feature based deep learning for hyperspectral remote sensing image classification
    Chen, Chen
    Zhang, JingJing
    Li, Teng
    Yan, Qing
    Xun, LiNa
    PROCEEDINGS OF 2018 IEEE INTERNATIONAL CONFERENCE ON REAL-TIME COMPUTING AND ROBOTICS (IEEE RCAR), 2018, : 421 - 426
  • [25] Hyperspectral Image Classification Based on Stacked Contractive Autoencoder Combined With Adaptive Spectral-Spatial Information
    Guo, Pengyue
    Liu, Zhenbing
    Lu, Haoxiang
    Wang, Zimin
    IEEE ACCESS, 2021, 9 : 96404 - 96415
  • [26] Multihead Global Attention and Spatial Spectral Information Fusion for Remote Sensing Image Compression
    Shi, Cuiping
    Shi, Kaijie
    Zhu, Fei
    Zeng, Zexin
    Wang, Liguo
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 999 - 1015
  • [27] A Multimodal Unified Representation Learning Framework With Masked Image Modeling for Remote Sensing Images
    Du, Dakuan
    Liu, Tianzhu
    Gu, Yanfeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [28] A Spatial-Spectral Prototypical Network for Hyperspectral Remote Sensing Image
    Tang, Haojin
    Li, Yanshan
    Han, Xiao
    Huang, Qinghua
    Xie, Weixin
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (01) : 167 - 171
  • [29] Exploring Transformer and Multilabel Classification for Remote Sensing Image Captioning
    Kandala, Hitesh
    Saha, Sudipan
    Banerjee, Biplab
    Zhu, Xiao Xiang
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [30] A3CLNN: Spatial, Spectral and Multiscale Attention ConvLSTM Neural Network for Multisource Remote Sensing Data Classification
    Li, Heng-Chao
    Hu, Wen-Shuai
    Li, Wei
    Li, Jun
    Du, Qian
    Plaza, Antonio
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (02) : 747 - 761