MSMT-LCL: Multiscale Spatial-Spectral Masked Transformer With Local Contrastive Learning for Hyperspectral Image Classification

被引:2
作者
Zhou, Yunfei [1 ]
Huang, Xiaohui [1 ]
Yang, Xiaofei [2 ]
Peng, Jiangtao [3 ,4 ]
Ban, Yifang [5 ]
Jiang, Nan [1 ]
机构
[1] East China Jiaotong Univ, Sch Informat & Software Engn, Nanchang 330013, Peoples R China
[2] Guangzhou Univ, Sch Elect & Commun Engn, Guangzhou 511370, Peoples R China
[3] Hubei Univ, Hubei Key Lab Appl Math, Key Lab Intelligent Sensing Syst & Secur, Minist Educ, Wuhan 430062, Peoples R China
[4] Hubei Univ, Fac Math & Stat, Wuhan 430062, Peoples R China
[5] KTH Royal Inst Technol, Sch Architecture & Built Environm, Div Geoinformat, S-11428 Stockholm, Sweden
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
基金
中国国家自然科学基金;
关键词
Contrastive learning (CL); deep learning; hyperspectral image (HSI) classification; masked Transformer; FUSION;
D O I
10.1109/TGRS.2024.3472066
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Deep learning plays a crucial role in hyperspectral image (HSI) classification, with the Transformer being highly favored by researchers due to its exceptional ability to model long-range dependencies. However, the Transformer necessitates a substantial amount of labeled training samples to train its numerous parameters, exacerbating the challenge of training an effective HSI classification Transformer model, particularly given the inherent scarcity of HSI data. Therefore, we propose a novel method for HSI classification, termed multiscale spatial-spectral masked Transformer with local contrastive learning (MSMT-LCL). This method consists of two stages: self-supervised pretraining and supervised fine-tuning. Initially, we utilize the multiscale augmented feature mapping module (MAFM) to project original HSI data into two mixed-scale feature maps, which are then separately fed into two masked Transformer branches for reconstruction. To facilitate the model in learning the dependency relationships between central pixel land-cover information and neighboring land cover, we introduce a novel mask strategy based on center-patch. Furthermore, in the pretraining stage, we integrate local contrastive learning (LCL) to enable the model to focus on local center information at varying scales. Upon completion of pretraining, the network undergoes fine-tuning to obtain feature maps at two different scales. Subsequently, we devise a novel adaptive multiscale feature fusion module (AMFM) to adaptively aggregate these two features and produce the final classification results. Extensive experiments on three real datasets demonstrate the superiority of our proposed MSMT-LCL method over several state-of-the-art HSI classification methods.
引用
收藏
页数:16
相关论文
共 50 条
[1]   Transformer-Based Masked Autoencoder With Contrastive Loss for Hyperspectral Image Classification [J].
Cao, Xianghai ;
Lin, Haifeng ;
Guo, Shuaixu ;
Xiong, Tao ;
Jiao, Licheng .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[2]  
Chen T, 2020, PR MACH LEARN RES, V119
[3]   LFSMIM: A Low-Frequency Spectral Masked Image Modeling Method for Hyperspectral Image Classification [J].
Chen, Yuhan ;
Yan, Qingyun .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 :1-5
[4]   Hyperspectral and LiDAR Data Fusion: Outcome of the 2013 GRSS Data Fusion Contest [J].
Debes, Christian ;
Merentitis, Andreas ;
Heremans, Roel ;
Hahn, Juergen ;
Frangiadakis, Nikolaos ;
van Kasteren, Tim ;
Liao, Wenzhi ;
Bellens, Rik ;
Pizurica, Aleksandra ;
Gautama, Sidharta ;
Philips, Wilfried ;
Prasad, Saurabh ;
Du, Qian ;
Pacifici, Fabio .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2014, 7 (06) :2405-2418
[5]  
Dosovitskiy A., 2021, INT C LEARN REPR, P1
[6]   Multiscale Random-Shape Convolution and Adaptive Graph Convolution Fusion Network for Hyperspectral Image Classification [J].
Gao, Hongmin ;
Sheng, Runhua ;
Chen, Zhonghao ;
Liu, Haiyun ;
Xu, Shufang ;
Zhang, Bing .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[7]   AMSSE-Net: Adaptive Multiscale Spatial–Spectral Enhancement Network for Classification of Hyperspectral and LiDAR Data [J].
Gao, Hongmin ;
Feng, Hao ;
Zhang, Yiyan ;
Xu, Shufang ;
Zhang, Bing .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 :1-17
[8]   Hyperspectral Target Detection via Spectral Aggregation and Separation Network With Target Band Random Mask [J].
Gao, Hongmin ;
Zhang, Yitong ;
Chen, Zhonghao ;
Xu, Feng ;
Hong, Danfeng ;
Zhang, Bing .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[9]   Cross-Domain Contrastive Learning for Hyperspectral Image Classification [J].
Guan, Peiyan ;
Lam, Edmund Y. .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[10]   Cascaded Recurrent Neural Networks for Hyperspectral Image Classification [J].
Hang, Renlong ;
Liu, Qingshan ;
Hong, Danfeng ;
Ghamisi, Pedram .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (08) :5384-5394