Local Transformer With Spatial Partition Restore for Hyperspectral Image Classification

被引:74
作者
Xue, Zhaohui [1 ,2 ]
Xu, Qi [1 ,2 ]
Zhang, Mengxue [3 ]
机构
[1] Hohai Univ, Sch Earth Sci & Engn, Nanjing 211100, Peoples R China
[2] Hohai Univ, Jiangsu Prov Engn Res Ctr Water Resources & Envir, Nanjing 211100, Peoples R China
[3] Univ Valencia, Image Proc Lab, Valencia 46980, Spain
基金
中国国家自然科学基金;
关键词
Transformers; Convolutional neural networks; Image restoration; Feature extraction; Convolution; Licenses; Correlation; Convolutional neural network (CNN); hyperspectral image (HSI) classification; spatial attention; transformer; REPRESENTATION; NETWORK;
D O I
10.1109/JSTARS.2022.3174135
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Convolutional neural network (CNN) has exhibited enormous potentials in hyperspectral image (HSI) classification owing to excellent locally modeling ability. Although excellent performance of CNN-based methods has been witnessed, there still have some limitations of their internal network backbone. On the one hand, modeling long-distance context dependencies is an inborn defect, which leads to receptive field limitation and insufficient feature capture in HSI. On the other hand, CNN-based methods usually need various sample distribution to train and cannot infer dynamically, which may not capture the inherent changes of HSI data well. To overcome the abovementioned issues, we propose a novel local transformer with spatial partition restore network (SPRLT-Net) for HSI classification. First, local transformer is introduced to obtain the spatial attention weights dynamically by measuring the similarity between related pixel pairs. Second, a spatial partition restore (SPR) module is designed to split the input patch into several overlapping continuous subpatches as sequential. With the obtained attention weights at hand, the SPR module restores the sequential to the original patch. Finally, a fully connected layer is used for classification. SPRLT-Net can capture global context dependencies, and the dynamical attention weights can adapt the inherent changes of HSI spatial pixels. Experimental results based on spatially disjoint samples and randomly selected samples of five benchmark datasets demonstrate that SPRLT-Net outperforms the other state-of-the-art methods in terms of classification accuracy, generalization performance, and computational complexity.
引用
收藏
页码:4307 / 4325
页数:19
相关论文
共 55 条
[1]   Deep Learning With Attribute Profiles for Hyperspectral Image Classification [J].
Aptoula, Erchan ;
Ozdemir, Murat Can ;
Yanikoglu, Berrin .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2016, 13 (12) :1970-1974
[2]   Deep Feature Extraction and Classification of Hyperspectral Images Based on Convolutional Neural Networks [J].
Chen, Yushi ;
Jiang, Hanlu ;
Li, Chunyang ;
Jia, Xiuping ;
Ghamisi, Pedram .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (10) :6232-6251
[3]   Deep Learning-Based Classification of Hyperspectral Data [J].
Chen, Yushi ;
Lin, Zhouhan ;
Zhao, Xing ;
Wang, Gang ;
Gu, Yanfeng .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2014, 7 (06) :2094-2107
[4]  
Chu X., ARXIV210210882
[5]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[6]  
Dosovitskiy A, ARXIV201011929
[7]   TCANet: Triple Context-Aware Network for Weakly Supervised Object Detection in Remote Sensing Images [J].
Feng, Xiaoxu ;
Han, Junwei ;
Yao, Xiwen ;
Cheng, Gong .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (08) :6946-6955
[8]   Hyperspectral Image Classification With Pre-Activation Residual Attention Network [J].
Gao, Hongmin ;
Yang, Yao ;
Yao, Dan ;
Li, Chenming .
IEEE ACCESS, 2019, 7 :176587-176599
[9]   New Frontiers in Spectral-Spatial Hyperspectral Image Classification The latest advances based on mathematical morphology, Markov random fields, segmentation, sparse representation, and deep learning [J].
Ghamisi, Pedram ;
Maggiori, Emmanuel ;
Li, Shutao ;
Souza, Roberto ;
Tarabalka, Yuliya ;
Moser, Gabriele ;
De Giorgi, Andrea ;
Fang, Leyuan ;
Chen, Yushi ;
Chi, Mingmin ;
Serpico, Sebastiano B. ;
Benediktsson, Jon Atli .
IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2018, 6 (03) :10-43
[10]   LeViT: a Vision Transformer in ConvNet's Clothing for Faster Inference [J].
Graham, Ben ;
El-Nouby, Alaaeldin ;
Touvron, Hugo ;
Stock, Pierre ;
Joulin, Armand ;
Jegou, Herve ;
Douze, Matthijs .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :12239-12249