Semi-supervised hierarchical Transformer for hyperspectral Image classification

被引:2
作者
He, Ziping [1 ,6 ]
Zhu, Qianglin [1 ]
Xia, Kewen [2 ]
Ghamisi, Pedram [3 ,4 ]
Zu, Baokai [5 ]
机构
[1] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha, Peoples R China
[2] Hebei Univ Technol, Sch Elect & Informat Engn, Tianjin, Peoples R China
[3] Helmholtz Inst Freiberg Resource Technol, Helmholtz Zentrum Dresden Rossendorf HZDR, Freiberg, Germany
[4] Inst Adv Res Artificial Intelligence IARAI, AI4RS artificial intelligence remote sensing, Vienna, Austria
[5] Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[6] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha 410114, Peoples R China
关键词
Hierarchical patch embedding; hyperspectral image classification; semi-supervised learning; transformer; GRAPH CONVOLUTIONAL NETWORKS;
D O I
10.1080/01431161.2023.2290996
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Transformer has achieved outstanding performance in many fields such as computer vision benefiting from its powerful and efficient modelling ability and long-range feature extraction capability complementary to convolution. However, on the one hand, the lack of CNN's innate inductive biases, such as translation invariance and local sensitivity, makes Transformer require more data for learning. On the other hand, labelled hyperspectral samples are scarce due to the time-consuming and costly annotation task. To this end, we propose a semi-supervised hierarchical Transformer model for HSI classification to improve the classification performance of the Transformer with limited labelled samples. In order to perturb the samples more fully and extensively to improve the model performance, two different data augmentation methods are used to perturb the unlabelled samples, and two sets of augmented samples are obtained respectively. The pseudo-label obtained on the original unlabelled sample is used to simultaneously supervise the augmented sample obtained on this unlabelled sample. Among them, only the pseudo-labels above the threshold are retained. To further improve the model stability and classification accuracy, hierarchical patch embedding is proposed to eliminate the mutual interference between pixels. Extensive experiments on three well-known hyperspectral datasets validate the effectiveness of the proposed semi-supervised Transformer model. The experiments show that the model achieves excellent classification accuracy even when there are only 10 labelled samples in each category, which can effectively improve the classification performance of Transformer under small-scale labelled samples.
引用
收藏
页码:21 / 50
页数:30
相关论文
共 70 条
[1]   Principal component analysis [J].
Abdi, Herve ;
Williams, Lynne J. .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (04) :433-459
[2]  
Ahmad M., 2021, Journal of Selected Topics in Applied Earth Observations and Remote Sensing, V15, P968, DOI [https://doi.org/10.1109/JSTARS.2021.3133021, DOI 10.1109/JSTARS.2021.3133021]
[3]  
Berthelot D., 2020, INT C LEARNING REPRE
[4]  
Berthelot D, 2019, ADV NEUR IN, V32
[5]  
Cascante-Bonilla P, 2021, AAAI CONF ARTIF INTE, V35, P6912
[6]   Semi-Supervised and Unsupervised Deep Visual Learning: A Survey [J].
Chen, Yanbei ;
Mancini, Massimiliano ;
Zhu, Xiatian ;
Akata, Zeynep .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (03) :1327-1347
[7]   Graph-based semi-supervised learning: A review [J].
Chong, Yanwen ;
Ding, Yun ;
Yan, Qing ;
Pan, Shaoming .
NEUROCOMPUTING, 2020, 408 (408) :216-230
[8]   Randaugment: Practical automated data augmentation with a reduced search space [J].
Cubuk, Ekin D. ;
Zoph, Barret ;
Shlens, Jonathon ;
Le, Quoc, V .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :3008-3017
[9]   AutoAugment: Learning Augmentation Strategies from Data [J].
Cubuk, Ekin D. ;
Zoph, Barret ;
Mane, Dandelion ;
Vasudevan, Vijay ;
Le, Quoc V. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :113-123
[10]   ConViT: improving vision transformers with soft convolutional inductive biases [J].
d'Ascoli, Stephane ;
Touvron, Hugo ;
Leavitt, Matthew L. ;
Morcos, Ari S. ;
Biroli, Giulio ;
Sagun, Levent .
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2022, 2022 (11)