A Spectral-Spatial Fusion Transformer Network for Hyperspectral Image Classification

被引:14
作者
Liao, Diling [1 ]
Shi, Cuiping [1 ]
Wang, Liguo [2 ]
机构
[1] Qiqihar Univ, Dept Commun Engn, Qiqihar 161000, Peoples R China
[2] Dalian Nationalities Univ, Coll Informat & Commun Engn, Dalian 116000, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2023年 / 61卷
基金
中国国家自然科学基金;
关键词
Deep learning (DL); fusion; hyperspectral image (HSI); long-distance dependence; RESIDUAL NETWORK;
D O I
10.1109/TGRS.2023.3286950
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In the past, deep learning (DL) technologies have been widely used in hyperspectral image (HSI) classification tasks. Among them, convolutional neural networks (CNNs) use fixed-size receptive field (RF) to obtain spectral and spatial features of HSIs, showing great feature extraction capabilities, which are one of the most popular DL frameworks. However, the convolution using local extraction and global parameter sharing mechanism pays more attention to spatial content information, which changes the spectral sequence information in the learned features. In addition, CNN is difficult to describe the long-distance correlation between HSI pixels and bands. To solve these problems, a spectral-spatial fusion Transformer network (S2FTNet) is proposed for the classification of HSIs. Specifically, S2FTNet adopts the Transformer framework to build a spatial Transformer module (SpaFormer) and a spectral Transformer module (SpeFormer) to capture image spatial and spectral long-distance dependencies. In addition, an adaptive spectral-spatial fusion mechanism (AS(2)FM) is proposed to effectively fuse the obtained advanced high-level semantic features. Finally, a large number of experiments were carried out on four datasets, Indian Pines, Pavia, Salinas, and WHU-Hi-LongKou, which verified that the proposed S2FTNet can provide better classification performance than other the state-of-the-art networks.
引用
收藏
页数:16
相关论文
共 54 条
[1]   Hyperspectral Image Classification Based on Multibranch Attention Transformer Networks [J].
Bai, Jing ;
Wen, Zheng ;
Xiao, Zhu ;
Ye, Fawang ;
Zhu, Yongdong ;
Alazab, Mamoun ;
Jiao, Licheng .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[2]   3-D Deep Learning Approach for Remote Sensing Image Classification [J].
Ben Hamida, Amina ;
Benoit, Alexandre ;
Lambert, Patrick ;
Ben Amar, Chokri .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (08) :4420-4434
[3]   Composite kernels for hyperspectral image classification [J].
Camps-Valls, G ;
Gomez-Chova, L ;
Muñoz-Marí, J ;
Vila-Francés, J ;
Calpe-Maravilla, J .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2006, 3 (01) :93-97
[4]  
Chen YP, 2017, ADV NEUR IN, V30
[5]   Deep Feature Extraction and Classification of Hyperspectral Images Based on Convolutional Neural Networks [J].
Chen, Yushi ;
Jiang, Hanlu ;
Li, Chunyang ;
Jia, Xiuping ;
Ghamisi, Pedram .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (10) :6232-6251
[6]   LiteDepthwiseNet: A Lightweight Network for Hyperspectral Image Classification [J].
Cui, Benlei ;
Dong, Xue-Mei ;
Zhan, Qiaoqiao ;
Peng, Jiangtao ;
Sun, Weiwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[7]   Spatial-Spectral Unified Adaptive Probability Graph Convolutional Networks for Hyperspectral Image Classification [J].
Ding, Yun ;
Chong, Yanwen ;
Pan, Shaoming ;
Wang, Yujie ;
Nie, Congchong .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (07) :3650-3664
[8]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[9]   On the impact of PCA dimension reduction for hyperspectral detection of difficult targets [J].
Farrell, MD ;
Mersereau, RM .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2005, 2 (02) :192-195
[10]   Dual Attention Network for Scene Segmentation [J].
Fu, Jun ;
Liu, Jing ;
Tian, Haijie ;
Li, Yong ;
Bao, Yongjun ;
Fang, Zhiwei ;
Lu, Hanqing .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3141-3149