Hyperspectral Image Classification Based on Multibranch Attention Transformer Networks

被引:44
作者
Bai, Jing [1 ]
Wen, Zheng [1 ]
Xiao, Zhu [2 ]
Ye, Fawang [3 ]
Zhu, Yongdong [4 ]
Alazab, Mamoun [5 ]
Jiao, Licheng [1 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Minist Educ, Key Lab Intelligent Percept & Image Understanding, Xian 710071, Peoples R China
[2] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Peoples R China
[3] Beijing Res Inst Uranium Geol, Natl Key Lab Remote Sensing Informat & Imagery An, Beijing 100029, Peoples R China
[4] Zhejiang Lab, Hangzhou 311121, Peoples R China
[5] Charles Darwin Univ, Coll Engn IT & Environm, Darwin, NT 0810, Australia
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2022年 / 60卷
基金
中国国家自然科学基金; 湖南省自然科学基金;
关键词
Feature extraction; Transformers; Convolutional neural networks; Convolution; Hyperspectral imaging; Training; Three-dimensional displays; Deep learning (DL); hyperspectral image classification (HSIC); multibranch prediction; self-attention (SA) mechanism; spatial attention; transformer model; REPRESENTATION; CNN;
D O I
10.1109/TGRS.2022.3196661
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Deep learning (DL) has become a mainstream method of hyperspectral image (HSI) classification. Many DL-based methods exploit spatial-spectral features to achieve better classification results. However, due to the complex backgrounds in HSIs, existing methods usually show unsatisfactory performance for the class pixels located on the land-cover category boundary area. In large part, this is because the network is susceptible to interference by the irrelevant information around the target pixel in the training stage, resulting in inaccurate feature extraction. In this article, a new multibranch transformer architecture (spectral spatial transformer (SST)-M) that assembles spatial attention and extracts spectral features is proposed to address this problem. The transformer model has a global receptive field and thus can integrate global spatial position information in the HSI cube. Meanwhile, we design a spatial sequence attention model to enhance the useful spatial location features and weaken invalid information. Considering that HSIs contain considerable spectral information, a spectral feature extraction model is designed to extract discriminative spectral features, replacing the widely used principal component analysis (PCA) method and obtaining better classification results than it. Finally, inspired by semantic segmentation, a mask prediction model is designed to classify all of the pixels in the HSI cube; this guides the neural network to learn precise pixel characteristics and spatial distributions. To verify the effectiveness of our algorithm (SST-M), quantitative experiments were conducted in three well-known datasets, namely, Indian Pines (IP), University of Pavia (PU), and Kennedy Space Center (KSC). The experimental results demonstrate that the proposed model achieves better performance than the other state-of-the-art methods.
引用
收藏
页数:17
相关论文
共 51 条
[1]  
Almahairi A, 2016, INT C MACHINE LEARNI, P2549
[2]   Feature selection and classification of hyperspectral images, with support vector machines [J].
Archibald, Rick ;
Fann, George .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2007, 4 (04) :674-677
[3]   Hyperspectral Image Classification Based on Deep Attention Graph Convolutional Network [J].
Bai, Jing ;
Ding, Bixiu ;
Xiao, Zhu ;
Jiao, Licheng ;
Chen, Hongyang ;
Regan, Amelia C. .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[4]   Class Incremental Learning With Few-Shots Based on Linear Programming for Hyperspectral Image Classification [J].
Bai, Jing ;
Yuan, Anran ;
Xiao, Zhu ;
Zhou, Huaji ;
Wang, Dingchen ;
Jiang, Hongbo ;
Jiao, Licheng .
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (06) :5474-5485
[5]  
Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
[6]   Statistical Detection Theory Approach to Hyperspectral Image Classification [J].
Chang, Chein-, I .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (04) :2057-2074
[7]   Deep Feature Extraction and Classification of Hyperspectral Images Based on Convolutional Neural Networks [J].
Chen, Yushi ;
Jiang, Hanlu ;
Li, Chunyang ;
Jia, Xiuping ;
Ghamisi, Pedram .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (10) :6232-6251
[8]  
Dosovitskiy A, 2021, Arxiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929]
[9]   Semisupervised Feature Extraction of Hyperspectral Image Using Nonlinear Geodesic Sparse Hypergraphs [J].
Duan, Yule ;
Huang, Hong ;
Wang, Tao .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[10]   Dual Attention Network for Scene Segmentation [J].
Fu, Jun ;
Liu, Jing ;
Tian, Haijie ;
Li, Yong ;
Bao, Yongjun ;
Fang, Zhiwei ;
Lu, Hanqing .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3141-3149