MAPM:PolSAR Image Classification with Masked Autoencoder Based on Position Prediction and Memory Tokens

被引：0

作者：

Wang, Jianlong ^{[1
]}

Li, Yingying ^{[1
]}

Quan, Dou ^{[2
]}

Hou, Beibei ^{[1
]}

Wang, Zhensong ^{[1
]}

Sima, Haifeng ^{[3
]}

Sun, Junding ^{[1
]}

机构：

[1] Henan Polytech Univ, Sch Comp Sci & Technol, Jiaozuo 454003, Peoples R China

[2] Xidian Univ, Sch Artificial Intelligence, Key Lab Intelligent Percept & Image Understanding, Minist Educ, Xian 710071, Peoples R China

[3] Henan Polytech Univ, Sch Software, Jiaozuo 454003, Peoples R China

来源：

REMOTE SENSING | 2024年 / 16卷 / 22期

基金：

中国国家自然科学基金; 中国博士后科学基金;

关键词：

polarimetric SAR; masked autoencoder; position prediction; <italic>L</italic>1 loss; memory tokens; ABSOLUTE ERROR MAE; COVER; MODEL; RMSE;

D O I：

10.3390/rs16224280

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Deep learning methods have shown significant advantages in polarimetric synthetic aperture radar (PolSAR) image classification. However, their performances rely on a large number of labeled data. To alleviate this problem, this paper proposes a PolSAR image classification method with a Masked Autoencoder based on Position prediction and Memory tokens (MAPM). First, MAPM designs a Masked Autoencoder (MAE) based on the transformer for pre-training, which can boost feature learning and improve classification results based on the number of labeled samples. Secondly, since the transformer is relatively insensitive to the order of the input tokens, a position prediction strategy is introduced in the encoder part of the MAE. It can effectively capture subtle differences and discriminate complex, blurry boundaries in PolSAR images. In the fine-tuning stage, the addition of learnable memory tokens can improve classification performance. In addition, L1 loss is used for MAE optimization to enhance the robustness of the model to outliers in PolSAR data. Experimental results show the effectiveness and advantages of the proposed MAPM in PolSAR image classification. Specifically, MAPM achieves performance gains of about 1% in classification accuracy compared with existing methods.

引用

页数：28

共 40 条

[21] PolSAR Image Classification Using a Superpixel-Based Composite Kernel and Elastic Net
Cao, Yice
Wu, Yan
Li, Ming
Liang, Wenkai
Zhang, Peng
REMOTE SENSING, 2021, 13 (03) : 1 - 24
[22] A self-supervised learning framework based on masked autoencoder for complex wafer bin map classification
Wang, Yi
Ni, Dong
Huang, Zhenyu
Chen, Puyang
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
[23] Interpretable POLSAR Image Classification Based on Adaptive-Dimension Feature Space Decision Tree
Yin, Qiang
Cheng, Jianda
Zhang, Fan
Zhou, Yongsheng
Shao, Luyi
Hong, Wen
IEEE ACCESS, 2020, 8 : 173826 - 173837
[24] A Three-Component Fisher-Based Feature Weighting Method for Supervised PolSAR Image Classification
Chen, Bo
Wang, Shuang
Jiao, Licheng
Stolkin, Rustam
Liu, Hongying
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2015, 12 (04) : 731 - 735
[25] GAF-MAE: A Self-Supervised Automatic Modulation Classification Method Based on Gramian Angular Field and Masked Autoencoder
Shi, Yunhao
Xu, Hua
Zhang, Yue
Qi, Zisen
Wang, Dan
IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2024, 10 (01) : 94 - 106
[26] POL-SAR Image Classification Based on Modified Stacked Autoencoder Network and Data Distribution
Wang, Jianlong
Hou, Biao
Jiao, Licheng
Wang, Shuang
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (03): : 1678 - 1695
[27] Low frequency and radar's physical based features for improvement of convolutional neural networks for PolSAR image classification
Imani, Maryam
EGYPTIAN JOURNAL OF REMOTE SENSING AND SPACE SCIENCES, 2022, 25 (01) : 55 - 62
[28] Multimodal Imputation-Based Multimodal Autoencoder Framework for AQI Classification and Prediction of Indian Cities
Rao, Routhu Srinivasa
Kalabarige, Lakshmana Rao
Holla, M. Raviraja
Sahu, Aditya Kumar
IEEE ACCESS, 2024, 12 : 108350 - 108363
[29] PolSAR Image Semantic Segmentation Based on Deep Transfer Learning-Realizing Smooth Classification With Small Training Sets
Wu, Weijin
Li, Hailei
Li, Xinwu
Guo, Huadong
Zhang, Lu
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2019, 16 (06) : 977 - 981
[30] Real-time interpretation method for range image based on position prediction
Zhong L.
Yu Q.
Zhou J.
Guo P.
Huang W.
Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2020, 42 (02): : 85 - 91

← 1 2 3 4 →