Fusing Spatial Attention with Spectral-Channel Attention Mechanism for Hyperspectral Image Classification via Encoder-Decoder Networks

被引：9

作者：

Sun, Jun ^{[1
,2
]}

Zhang, Junbo ^{[3
]}

Gao, Xuesong ^{[1
,2
]}

Wang, Mantao ^{[3
]}

Ou, Dinghua ^{[1
,2
]}

Wu, Xiaobo ^{[1
,2
]}

Zhang, Dejun ^{[4
]}

机构：

[1] Sichuan Agr Univ, Coll Resources, Chengdu 611130, Peoples R China

[2] Minist Nat Resources, Key Lab Invest & Monitoring, Protect & Utilizat Cultivated Land Resources, Chengdu 611130, Peoples R China

[3] Sichuan Agr Univ, Coll Informat Engn, Yaan 625000, Peoples R China

[4] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China

来源：

REMOTE SENSING | 2022年 / 14卷 / 09期

关键词：

hyperspectral image classification; attention mechanism; transformer; RECURRENT NEURAL-NETWORKS; CNN;

D O I：

10.3390/rs14091968

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

In recent years, convolutional neural networks (CNNs) have been widely used in hyperspectral image (HSI) classification. However, feature extraction on hyperspectral data still faces numerous challenges. Existing methods cannot extract spatial and spectral-channel contextual information in a targeted manner. In this paper, we propose an encoder-decoder network that fuses spatial attention and spectral-channel attention for HSI classification from three public HSI datasets to tackle these issues. In terms of feature information fusion, a multi-source attention mechanism including spatial and spectral-channel attention is proposed to encode the spatial and spectral multi-channels contextual information. Moreover, three fusion strategies are proposed to effectively utilize spatial and spectral-channel attention. They are direct aggregation, aggregation on feature space, and Hadamard product. In terms of network development, an encoder-decoder framework is employed for hyperspectral image classification. The encoder is a hierarchical transformer pipeline that can extract long-range context information. Both shallow local features and rich global semantic information are encoded through hierarchical feature expressions. The decoder consists of suitable upsampling, skip connection, and convolution blocks, which fuse multi-scale features efficiently. Compared with other state-of-the-art methods, our approach has greater performance in hyperspectral image classification.

引用

页数：22

共 50 条

[41] Predicting and Explaining Hearing Aid Usage Using Encoder-Decoder with Attention Mechanism and SHAP
Su, Qiqi
Iliadou, Eleftheria
2022 16TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS, SITIS, 2022, : 308 - 315
[42] An encoder-decoder network for crowd counting based on multi-scale attention mechanism
Chuang H.-H.
Chen Y.-C.
Lin C.H.
Multimedia Tools and Applications, 2025, 84 (03) : 1187 - 1210
[43] When Multigranularity Meets Spatial-Spectral Attention: A Hybrid Transformer for Hyperspectral Image Classification
Ouyang, Er
Li, Bin
Hu, Wenjing
Zhang, Guoyun
Zhao, Lin
Wu, Jianhui
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[44] Hyperspectral image classification based on three branch network with grouped spatial-spectral attention
Su H.
Chen N.
Peng J.
Sun W.
National Remote Sensing Bulletin, 2024, 28 (01) : 247 - 265
[45] DISCRIMINATIVE SPECTRAL-SPATIAL ATTENTION-AWARE RESIDUAL NETWORK FOR HYPERSPECTRAL IMAGE CLASSIFICATION
Cai, Yaoming
Dong, Zhimin
Cai, Zhihua
Liu, Xiaobo
Wang, Guangjun
2019 10TH WORKSHOP ON HYPERSPECTRAL IMAGING AND SIGNAL PROCESSING - EVOLUTION IN REMOTE SENSING (WHISPERS), 2019,
[46] 3D Lightweight Spatial-Spectral Attention Network for Hyperspectral Image Classification
Zheng, Ziyou
Zhang, Shuzhen
Song, Hailong
Yan, Qi
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VI, 2024, 14430 : 297 - 308
[47] AUTOMATIC SINGING TRANSCRIPTION BASED ON ENCODER-DECODER RECURRENT NEURAL NETWORKS WITH A WEAKLY-SUPERVISED ATTENTION MECHANISM
Nishikimi, Ryo
Nakamura, Eita
Fukayama, Satoru
Goto, Masataka
Yoshii, Kazuyoshi
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 161 - 165
[48] Hyperspectral image classification based on spatial pyramid attention mechanism combined with ResNet
Liu, He
Song, Yingluo
Hu, Longxiang
Liu, Guohui
Wang, Kan
Wang, Aili
CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2024, 39 (06) : 833 - 843
[49] Tool health monitoring and prediction via attention-based encoder-decoder with a multi-step mechanism
Baosu Guo
Qin Zhang
Qinjing Peng
Jichao Zhuang
Fenghe Wu
Quan Zhang
The International Journal of Advanced Manufacturing Technology, 2022, 122 : 685 - 695
[50] Tool health monitoring and prediction via attention-based encoder-decoder with a multi-step mechanism
Guo, Baosu
Zhang, Qin
Peng, Qinjing
Zhuang, Jichao
Wu, Fenghe
Zhang, Quan
INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2022, 122 (02) : 685 - 695

← 1 2 3 4 5 →