Hyperspectral Image Classification Based on Transposed Convolutional Neural Network Transformer

被引:1
作者
Liu, Baisen [1 ,2 ,3 ]
Jia, Zongting [1 ]
Guo, Penggang [1 ]
Kong, Weili [1 ]
机构
[1] Harbin Univ Sci & Technol, Sch Measurement & Commun Engn, Harbin 150080, Peoples R China
[2] Heilongjiang Inst Technol, Coll Elect & Informat Engn, Harbin 150001, Peoples R China
[3] Harbin Engn Univ, Coll Informat & Commun Engn, Harbin 150001, Peoples R China
关键词
spectral inception; spatial transposed inception; cross-cosine attention mechanism; hyperspectral image; CNN; transformer; REMOTE-SENSING TECHNIQUE; CNN;
D O I
10.3390/electronics12183879
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hyperspectral imaging is a technique that captures images of objects within a wide spectrum range, allowing for the acquisition of additional spectral information to reveal subtle variations and compositional components in the objects. Convolutional neural networks (CNNs) have shown remarkable feature extraction capabilities for HSI classification, but their ability to capture deep semantic features is limited. On the other hand, transformer models based on attention mechanisms excel at handling sequential data and have demonstrated great potential in various applications. Motivated by these two facts, this paper proposes a multiscale spectral-spatial transposed transformer (MSSTT) that captures the high-level semantic features of an HSI while preserving the spectral information as much as possible. The MSSTT consists of a spectral-spatial Inception module that extracts spectral and spatial features using multiscale convolutional kernels, and a spatial transpose Inception module that further enhances and extracts spatial information. A transformer model with a cosine attention mechanism is also included to extract deep semantic features, with the QKV matrix constrained to ensure the output remains within the activation range. Finally, the classification results are obtained by applying a linear layer to the learnable tokens. The experimental results from three public datasets show that the proposed MSSTT outperforms other deep learning methods in HSI classification. On the India Pines, Pavia University, and Salinas datasets, accuracies of 97.19%, 99.47%, and 99.90% were achieved, respectively, with a training set proportion of 5%.
引用
收藏
页数:18
相关论文
共 39 条
  • [11] Cascaded Recurrent Neural Networks for Hyperspectral Image Classification
    Hang, Renlong
    Liu, Qingshan
    Hong, Danfeng
    Ghamisi, Pedram
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (08): : 5384 - 5394
  • [12] He MY, 2017, IEEE IMAGE PROC, P3904, DOI 10.1109/ICIP.2017.8297014
  • [13] Spatial-Spectral Transformer for Hyperspectral Image Classification
    He, Xin
    Chen, Yushi
    Lin, Zhouhan
    [J]. REMOTE SENSING, 2021, 13 (03) : 1 - 22
  • [14] Optimized Input for CNN-Based Hyperspectral Image Classification Using Spatial Transformer Network
    He, Xin
    Chen, Yushi
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2019, 16 (12) : 1884 - 1888
  • [15] Deep Convolutional Neural Networks for Hyperspectral Image Classification
    Hu, Wei
    Huang, Yangyu
    Wei, Li
    Zhang, Fan
    Li, Hengchao
    [J]. JOURNAL OF SENSORS, 2015, 2015
  • [16] Subspace Clustering for Hyperspectral Images via Dictionary Learning With Adaptive Regularization
    Huang, Shaoguang
    Zhang, Hongyan
    Pizurica, Aleksandra
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [17] Hyperspectral image data analysis
    Landgrebe, D
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2002, 19 (01) : 17 - 28
  • [18] Deep Learning for Hyperspectral Image Classification: An Overview
    Li, Shutao
    Song, Weiwei
    Fang, Leyuan
    Chen, Yushi
    Ghamisi, Pedram
    Benediktsson, Jon Atli
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (09): : 6690 - 6709
  • [19] Spectral-Spatial Classification of Hyperspectral Imagery with 3D Convolutional Neural Network
    Li, Ying
    Zhang, Haokui
    Shen, Qiang
    [J]. REMOTE SENSING, 2017, 9 (01)
  • [20] Linzen T., 2016, Transactions of the Association of Computational Linguistics, V4, P521, DOI DOI 10.1162/TACLA00115