Hyperspectral Image Classification Using Multi-Scale Lightweight Transformer

被引:2
|
作者
Gu, Quan [1 ]
Luan, Hongkang [1 ]
Huang, Kaixuan [1 ]
Sun, Yubao [1 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Engn Res Ctr Digital Forens, Jiangsu Collaborat Innovat Ctr Atmospher Environm, Minist Educ, Nanjing 210044, Peoples R China
基金
中国国家自然科学基金;
关键词
hyperspectral image classification; multi-scale spectral attention; Transformer; long-range spectral dependence; SPARSE REPRESENTATION;
D O I
10.3390/electronics13050949
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The distinctive feature of hyperspectral images (HSIs) is their large number of spectral bands, which allows us to identify categories of ground objects by capturing discrepancies in spectral information. Convolutional neural networks (CNN) with attention modules effectively improve the classification accuracy of HSI. However, CNNs are not successful in capturing long-range spectral-spatial dependence. In recent years, Vision Transformer (VIT) has received widespread attention due to its excellent performance in acquiring long-range features. However, it requires calculating the pairwise correlation between token embeddings and has the complexity of the square of the number of tokens, which leads to an increase in the computational complexity of the network. In order to cope with this issue, this paper proposes a multi-scale spectral-spatial attention network with frequency-domain lightweight Transformer (MSA-LWFormer) for HSI classification. This method synergistically integrates CNN, attention mechanisms, and Transformer into the spectral-spatial feature extraction module and frequency-domain fused classification module. Specifically, the spectral-spatial feature extraction module employs a multi-scale 2D-CNN with multi-scale spectral attention (MS-SA) to extract the shallow spectral-spatial features and capture the long-range spectral dependence. In addition, The frequency-domain fused classification module designs a frequency-domain lightweight Transformer that employs the Fast Fourier Transform (FFT) to convert features from the spatial domain to the frequency domain, effectively extracting global information and significantly reducing the time complexity of the network. Experiments on three classic hyperspectral datasets show that MSA-LWFormer has excellent performance.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] MULTI-SCALE 3D DEEP CONVOLUTIONAL NEURAL NETWORK FOR HYPERSPECTRAL IMAGE CLASSIFICATION
    He, Mingyi
    Li, Bo
    Chen, Huahui
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3904 - 3908
  • [32] Multi-Scale Spatial Perception Attention Network for Few-Shot Hyperspectral Image Classification
    Li, Yang
    Luo, Jian
    Long, Haoyu
    Jin, Qianqian
    IEEE ACCESS, 2024, 12 : 173076 - 173090
  • [33] A Multi-scale Convolutional Neural Network Based on Multilevel Wavelet Decomposition for Hyperspectral Image Classification
    Yang C.
    Song D.
    Wang B.
    Tang Y.
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2022, 13536 LNCS : 484 - 496
  • [34] Bi-directional LSTM with multi-scale dense attention mechanism for hyperspectral image classification
    Gao, Jinxiong
    Gao, Xiumei
    Wu, Nan
    Yang, Hongye
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (17) : 24003 - 24020
  • [35] Bi-directional LSTM with multi-scale dense attention mechanism for hyperspectral image classification
    Jinxiong Gao
    Xiumei Gao
    Nan Wu
    Hongye Yang
    Multimedia Tools and Applications, 2022, 81 : 24003 - 24020
  • [36] A Multi-Scale and Multi-Level Spectral-Spatial Feature Fusion Network for Hyperspectral Image Classification
    Mu, Caihong
    Guo, Zhen
    Liu, Yi
    REMOTE SENSING, 2020, 12 (01)
  • [37] Multi-granularity vision transformer via semantic token for hyperspectral image classification
    Li, Bin
    Ouyang, Er
    Hu, Wenjing
    Zhang, Guoyun
    Zhao, Lin
    Wu, Jianhui
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (17) : 6538 - 6560
  • [38] Tensor Transformer for hyperspectral image classification
    Zhang, Wei-Tao
    Bai, Yv
    Zheng, Sheng-Di
    Cui, Jian
    Huang, Zhen-zhen
    PATTERN RECOGNITION, 2025, 163
  • [39] Multi-scale and multi-patch transformer for sandstorm image enhancement
    Liang, Pengwei
    Ding, Wenyu
    Fan, Lu
    Wang, Haoyu
    Li, Zihong
    Yang, Fan
    Wang, Bo
    Li, Chongyi
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 89
  • [40] Cognitively-Inspired Multi-Scale Spectral-Spatial Transformer for Hyperspectral Image Super-Resolution
    Qin Xu
    Shiji Liu
    Jinpei Liu
    Bin Luo
    Cognitive Computation, 2024, 16 (1) : 377 - 391