Spectral Superresolution Using Transformer with Convolutional Spectral Self-Attention

被引：4

作者：

Liao, Xiaomei ^{[1
]}

He, Lirong ^{[2
]}

Mao, Jiayou ^{[2
]}

Xu, Meng ^{[2
]}

机构：

[1] Shenzhen Univ, Coll Life Sci & Oceanog, Shenzhen 518060, Peoples R China

[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China

来源：

REMOTE SENSING | 2024年 / 16卷 / 10期

基金：

中国国家自然科学基金;

关键词：

hyperspectral image; spectral superresolution; transformer; convolutional neural network; self-attention; NETWORK;

D O I：

10.3390/rs16101688

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Hyperspectral images (HSI) find extensive application across numerous domains of study. Spectral superresolution (SSR) refers to reconstructing HSIs from readily available RGB images using the mapping relationships between RGB images and HSIs. In recent years, convolutional neural networks (CNNs) have become widely adopted in SSR research, primarily because of their exceptional ability to extract features. However, most current CNN-based algorithms are weak in terms of extracting the spectral features of HSIs. While certain algorithms can reconstruct HSIs through the fusion of spectral and spatial data, their practical effectiveness is hindered by their substantial computational complexity. In light of these challenges, we propose a lightweight network, Transformer with convolutional spectral self-attention (TCSSA), for SSR. TCSSA comprises a CNN-Transformer encoder and a CNN-Transformer decoder, in which the convolutional spectral self-attention blocks (CSSABs) are the basic modules. Multiple cascaded encoding and decoding modules within TCSSA facilitate the efficient extraction of spatial and spectral contextual information from HSIs. The convolutional spectral self-attention (CSSA) as the basic unit of CSSAB combines CNN with self-attention in the transformer, effectively extracting both spatial local features and global spectral features from HSIs. Experimental validation of TCSSA's effectiveness is performed on three distinct datasets: GF5 for remote sensing images along with CAVE and NTIRE2022 for natural images. The experimental results demonstrate that the proposed method achieves a harmonious balance between reconstruction performance and computational complexity.

引用

页数：20

共 62 条

[61] Hierarchical Regression Network for Spectral Reconstruction from RGB Images [J].

Zhao, Yuzhi ;

Po, Lai-Man ;

Yan, Qiong ;

Liu, Wei ;

Lin, Tingyu .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :1695-1704

[62] Deep Amended Gradient Descent for Efficient Spectral Reconstruction From Single RGB Images [J].

Zhu, Zhiyu ;

Liu, Hui ;

Hou, Junhui ;

Jia, Sen ;

Zhang, Qingfu .

IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2021, 7 :1176-1188

← 1 2 3 4 5 6 7 →