MAPM:PolSAR Image Classification with Masked Autoencoder Based on Position Prediction and Memory Tokens

被引:0
作者
Wang, Jianlong [1 ]
Li, Yingying [1 ]
Quan, Dou [2 ]
Hou, Beibei [1 ]
Wang, Zhensong [1 ]
Sima, Haifeng [3 ]
Sun, Junding [1 ]
机构
[1] Henan Polytech Univ, Sch Comp Sci & Technol, Jiaozuo 454003, Peoples R China
[2] Xidian Univ, Sch Artificial Intelligence, Key Lab Intelligent Percept & Image Understanding, Minist Educ, Xian 710071, Peoples R China
[3] Henan Polytech Univ, Sch Software, Jiaozuo 454003, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
polarimetric SAR; masked autoencoder; position prediction; <italic>L</italic>1 loss; memory tokens; ABSOLUTE ERROR MAE; COVER; MODEL; RMSE;
D O I
10.3390/rs16224280
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Deep learning methods have shown significant advantages in polarimetric synthetic aperture radar (PolSAR) image classification. However, their performances rely on a large number of labeled data. To alleviate this problem, this paper proposes a PolSAR image classification method with a Masked Autoencoder based on Position prediction and Memory tokens (MAPM). First, MAPM designs a Masked Autoencoder (MAE) based on the transformer for pre-training, which can boost feature learning and improve classification results based on the number of labeled samples. Secondly, since the transformer is relatively insensitive to the order of the input tokens, a position prediction strategy is introduced in the encoder part of the MAE. It can effectively capture subtle differences and discriminate complex, blurry boundaries in PolSAR images. In the fine-tuning stage, the addition of learnable memory tokens can improve classification performance. In addition, L1 loss is used for MAE optimization to enhance the robustness of the model to outliers in PolSAR data. Experimental results show the effectiveness and advantages of the proposed MAPM in PolSAR image classification. Specifically, MAPM achieves performance gains of about 1% in classification accuracy compared with existing methods.
引用
收藏
页数:28
相关论文
共 40 条
  • [21] PolSAR Image Classification Using a Superpixel-Based Composite Kernel and Elastic Net
    Cao, Yice
    Wu, Yan
    Li, Ming
    Liang, Wenkai
    Zhang, Peng
    REMOTE SENSING, 2021, 13 (03) : 1 - 24
  • [22] A self-supervised learning framework based on masked autoencoder for complex wafer bin map classification
    Wang, Yi
    Ni, Dong
    Huang, Zhenyu
    Chen, Puyang
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [23] Interpretable POLSAR Image Classification Based on Adaptive-Dimension Feature Space Decision Tree
    Yin, Qiang
    Cheng, Jianda
    Zhang, Fan
    Zhou, Yongsheng
    Shao, Luyi
    Hong, Wen
    IEEE ACCESS, 2020, 8 : 173826 - 173837
  • [24] A Three-Component Fisher-Based Feature Weighting Method for Supervised PolSAR Image Classification
    Chen, Bo
    Wang, Shuang
    Jiao, Licheng
    Stolkin, Rustam
    Liu, Hongying
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2015, 12 (04) : 731 - 735
  • [25] GAF-MAE: A Self-Supervised Automatic Modulation Classification Method Based on Gramian Angular Field and Masked Autoencoder
    Shi, Yunhao
    Xu, Hua
    Zhang, Yue
    Qi, Zisen
    Wang, Dan
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2024, 10 (01) : 94 - 106
  • [26] POL-SAR Image Classification Based on Modified Stacked Autoencoder Network and Data Distribution
    Wang, Jianlong
    Hou, Biao
    Jiao, Licheng
    Wang, Shuang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (03): : 1678 - 1695
  • [28] Multimodal Imputation-Based Multimodal Autoencoder Framework for AQI Classification and Prediction of Indian Cities
    Rao, Routhu Srinivasa
    Kalabarige, Lakshmana Rao
    Holla, M. Raviraja
    Sahu, Aditya Kumar
    IEEE ACCESS, 2024, 12 : 108350 - 108363
  • [29] PolSAR Image Semantic Segmentation Based on Deep Transfer Learning-Realizing Smooth Classification With Small Training Sets
    Wu, Weijin
    Li, Hailei
    Li, Xinwu
    Guo, Huadong
    Zhang, Lu
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2019, 16 (06) : 977 - 981
  • [30] Real-time interpretation method for range image based on position prediction
    Zhong L.
    Yu Q.
    Zhou J.
    Guo P.
    Huang W.
    Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2020, 42 (02): : 85 - 91