MAPM:PolSAR Image Classification with Masked Autoencoder Based on Position Prediction and Memory Tokens

被引:0
作者
Wang, Jianlong [1 ]
Li, Yingying [1 ]
Quan, Dou [2 ]
Hou, Beibei [1 ]
Wang, Zhensong [1 ]
Sima, Haifeng [3 ]
Sun, Junding [1 ]
机构
[1] Henan Polytech Univ, Sch Comp Sci & Technol, Jiaozuo 454003, Peoples R China
[2] Xidian Univ, Sch Artificial Intelligence, Key Lab Intelligent Percept & Image Understanding, Minist Educ, Xian 710071, Peoples R China
[3] Henan Polytech Univ, Sch Software, Jiaozuo 454003, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
polarimetric SAR; masked autoencoder; position prediction; <italic>L</italic>1 loss; memory tokens; ABSOLUTE ERROR MAE; COVER; MODEL; RMSE;
D O I
10.3390/rs16224280
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Deep learning methods have shown significant advantages in polarimetric synthetic aperture radar (PolSAR) image classification. However, their performances rely on a large number of labeled data. To alleviate this problem, this paper proposes a PolSAR image classification method with a Masked Autoencoder based on Position prediction and Memory tokens (MAPM). First, MAPM designs a Masked Autoencoder (MAE) based on the transformer for pre-training, which can boost feature learning and improve classification results based on the number of labeled samples. Secondly, since the transformer is relatively insensitive to the order of the input tokens, a position prediction strategy is introduced in the encoder part of the MAE. It can effectively capture subtle differences and discriminate complex, blurry boundaries in PolSAR images. In the fine-tuning stage, the addition of learnable memory tokens can improve classification performance. In addition, L1 loss is used for MAE optimization to enhance the robustness of the model to outliers in PolSAR data. Experimental results show the effectiveness and advantages of the proposed MAPM in PolSAR image classification. Specifically, MAPM achieves performance gains of about 1% in classification accuracy compared with existing methods.
引用
收藏
页数:28
相关论文
共 40 条
  • [31] A Dual-Tree Complex Wavelet Transform Based Complex-Valued Convolutional Neural Network for PolSAR Image Classification
    Liu, Lu
    2024 5TH INTERNATIONAL CONFERENCE ON GEOLOGY, MAPPING AND REMOTE SENSING, ICGMRS 2024, 2024, : 15 - 18
  • [32] UNSUPERVISED POLSAR IMAGE CLASSIFICATION USING BOUNDARY-PRESERVING REGION DIVISION AND REGION-BASED AFFINITY PROPAGATION CLUSTERING
    Hou, Biao
    Jiang, Yuheng
    Ren, Bo
    Wen, Zaidao
    Wang, Shuang
    Jiao, Licheng
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 5103 - 5106
  • [33] Reg-Superpixel Guided Convolutional Neural Network of PolSAR Image Classification Based on Feature Selection and Receptive Field Reconstruction
    Shang, Ronghua
    Zhu, Keyao
    Feng, Jie
    Wang, Chao
    Jiao, Licheng
    Xu, Songhua
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 4312 - 4327
  • [34] Real-Time Interpretation Method for Shooting-Range Image Based on Position Prediction
    Zhong, Lijun
    Yu, Qifeng
    Zhou, Jiexin
    Zhang, Xiaohu
    Lu, Yani
    IMAGE AND GRAPHICS, ICIG 2019, PT I, 2019, 11901 : 68 - 80
  • [35] MAE-EEG-Transformer: A transformer-based approach combining masked autoencoder and cross-individual data augmentation pre-training for EEG classification
    Cai, Miao
    Zeng, Yu
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 94
  • [36] VAE-TALSTM: a temporal attention and variational autoencoder-based long short-term memory framework for dam displacement prediction
    Shu, Xiaosong
    Bao, Tengfei
    Li, Yangtao
    Gong, Jian
    Zhang, Kang
    ENGINEERING WITH COMPUTERS, 2022, 38 (04) : 3497 - 3512
  • [37] Position Prediction Based on Empirical Mode Decomposition and Long Short-term Memory Under Global Navigation Satellite System Outages
    Min H.-G.
    Fang Y.-K.
    Wu X.
    Xu Z.-G.
    Zhao X.-M.
    Zhongguo Gonglu Xuebao/China Journal of Highway and Transport, 2021, 34 (07): : 128 - 139
  • [38] Dynamic classification and attention mechanism-based bidirectional long short-term memory network for daily runoff prediction in Aksu River basin, Northwest China
    Wei, Qing
    Yang, Ju
    Fu, Fangbing
    Xue, Lianqing
    APPLIED MATHEMATICS AND COMPUTATION, 2025, 494
  • [39] Dynamic classification and attention mechanism-based bidirectional long short-term memory network for daily runoff prediction in Aksu River basin, Northwest China
    Wei, Qing
    Yang, Ju
    Fu, Fangbing
    Xue, Lianqing
    APPLIED MATHEMATICS AND COMPUTATION, 2025, 494
  • [40] Dynamic classification and attention mechanism-based bidirectional long short-term memory network for daily runoff prediction in Aksu River basin, Northwest China
    Wei, Qing
    Yang, Ju
    Fu, Fangbing
    Xue, Lianqing
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2025, 374