Fusing Spatial Attention with Spectral-Channel Attention Mechanism for Hyperspectral Image Classification via Encoder-Decoder Networks

被引:9
作者
Sun, Jun [1 ,2 ]
Zhang, Junbo [3 ]
Gao, Xuesong [1 ,2 ]
Wang, Mantao [3 ]
Ou, Dinghua [1 ,2 ]
Wu, Xiaobo [1 ,2 ]
Zhang, Dejun [4 ]
机构
[1] Sichuan Agr Univ, Coll Resources, Chengdu 611130, Peoples R China
[2] Minist Nat Resources, Key Lab Invest & Monitoring, Protect & Utilizat Cultivated Land Resources, Chengdu 611130, Peoples R China
[3] Sichuan Agr Univ, Coll Informat Engn, Yaan 625000, Peoples R China
[4] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China
关键词
hyperspectral image classification; attention mechanism; transformer; RECURRENT NEURAL-NETWORKS; CNN;
D O I
10.3390/rs14091968
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
In recent years, convolutional neural networks (CNNs) have been widely used in hyperspectral image (HSI) classification. However, feature extraction on hyperspectral data still faces numerous challenges. Existing methods cannot extract spatial and spectral-channel contextual information in a targeted manner. In this paper, we propose an encoder-decoder network that fuses spatial attention and spectral-channel attention for HSI classification from three public HSI datasets to tackle these issues. In terms of feature information fusion, a multi-source attention mechanism including spatial and spectral-channel attention is proposed to encode the spatial and spectral multi-channels contextual information. Moreover, three fusion strategies are proposed to effectively utilize spatial and spectral-channel attention. They are direct aggregation, aggregation on feature space, and Hadamard product. In terms of network development, an encoder-decoder framework is employed for hyperspectral image classification. The encoder is a hierarchical transformer pipeline that can extract long-range context information. Both shallow local features and rich global semantic information are encoded through hierarchical feature expressions. The decoder consists of suitable upsampling, skip connection, and convolution blocks, which fuse multi-scale features efficiently. Compared with other state-of-the-art methods, our approach has greater performance in hyperspectral image classification.
引用
收藏
页数:22
相关论文
共 50 条
  • [41] Predicting and Explaining Hearing Aid Usage Using Encoder-Decoder with Attention Mechanism and SHAP
    Su, Qiqi
    Iliadou, Eleftheria
    2022 16TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS, SITIS, 2022, : 308 - 315
  • [42] An encoder-decoder network for crowd counting based on multi-scale attention mechanism
    Chuang H.-H.
    Chen Y.-C.
    Lin C.H.
    Multimedia Tools and Applications, 2025, 84 (03) : 1187 - 1210
  • [43] When Multigranularity Meets Spatial-Spectral Attention: A Hybrid Transformer for Hyperspectral Image Classification
    Ouyang, Er
    Li, Bin
    Hu, Wenjing
    Zhang, Guoyun
    Zhao, Lin
    Wu, Jianhui
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [44] Hyperspectral image classification based on three branch network with grouped spatial-spectral attention
    Su H.
    Chen N.
    Peng J.
    Sun W.
    National Remote Sensing Bulletin, 2024, 28 (01) : 247 - 265
  • [45] DISCRIMINATIVE SPECTRAL-SPATIAL ATTENTION-AWARE RESIDUAL NETWORK FOR HYPERSPECTRAL IMAGE CLASSIFICATION
    Cai, Yaoming
    Dong, Zhimin
    Cai, Zhihua
    Liu, Xiaobo
    Wang, Guangjun
    2019 10TH WORKSHOP ON HYPERSPECTRAL IMAGING AND SIGNAL PROCESSING - EVOLUTION IN REMOTE SENSING (WHISPERS), 2019,
  • [46] 3D Lightweight Spatial-Spectral Attention Network for Hyperspectral Image Classification
    Zheng, Ziyou
    Zhang, Shuzhen
    Song, Hailong
    Yan, Qi
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VI, 2024, 14430 : 297 - 308
  • [47] AUTOMATIC SINGING TRANSCRIPTION BASED ON ENCODER-DECODER RECURRENT NEURAL NETWORKS WITH A WEAKLY-SUPERVISED ATTENTION MECHANISM
    Nishikimi, Ryo
    Nakamura, Eita
    Fukayama, Satoru
    Goto, Masataka
    Yoshii, Kazuyoshi
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 161 - 165
  • [48] Hyperspectral image classification based on spatial pyramid attention mechanism combined with ResNet
    Liu, He
    Song, Yingluo
    Hu, Longxiang
    Liu, Guohui
    Wang, Kan
    Wang, Aili
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2024, 39 (06) : 833 - 843
  • [49] Tool health monitoring and prediction via attention-based encoder-decoder with a multi-step mechanism
    Baosu Guo
    Qin Zhang
    Qinjing Peng
    Jichao Zhuang
    Fenghe Wu
    Quan Zhang
    The International Journal of Advanced Manufacturing Technology, 2022, 122 : 685 - 695
  • [50] Tool health monitoring and prediction via attention-based encoder-decoder with a multi-step mechanism
    Guo, Baosu
    Zhang, Qin
    Peng, Qinjing
    Zhuang, Jichao
    Wu, Fenghe
    Zhang, Quan
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2022, 122 (02) : 685 - 695