Spectral Superresolution Using Transformer with Convolutional Spectral Self-Attention

被引：4

作者：

Liao, Xiaomei ^{[1
]}

He, Lirong ^{[2
]}

Mao, Jiayou ^{[2
]}

Xu, Meng ^{[2
]}

机构：

[1] Shenzhen Univ, Coll Life Sci & Oceanog, Shenzhen 518060, Peoples R China

[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China

来源：

REMOTE SENSING | 2024年 / 16卷 / 10期

基金：

中国国家自然科学基金;

关键词：

hyperspectral image; spectral superresolution; transformer; convolutional neural network; self-attention; NETWORK;

D O I：

10.3390/rs16101688

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Hyperspectral images (HSI) find extensive application across numerous domains of study. Spectral superresolution (SSR) refers to reconstructing HSIs from readily available RGB images using the mapping relationships between RGB images and HSIs. In recent years, convolutional neural networks (CNNs) have become widely adopted in SSR research, primarily because of their exceptional ability to extract features. However, most current CNN-based algorithms are weak in terms of extracting the spectral features of HSIs. While certain algorithms can reconstruct HSIs through the fusion of spectral and spatial data, their practical effectiveness is hindered by their substantial computational complexity. In light of these challenges, we propose a lightweight network, Transformer with convolutional spectral self-attention (TCSSA), for SSR. TCSSA comprises a CNN-Transformer encoder and a CNN-Transformer decoder, in which the convolutional spectral self-attention blocks (CSSABs) are the basic modules. Multiple cascaded encoding and decoding modules within TCSSA facilitate the efficient extraction of spatial and spectral contextual information from HSIs. The convolutional spectral self-attention (CSSA) as the basic unit of CSSAB combines CNN with self-attention in the transformer, effectively extracting both spatial local features and global spectral features from HSIs. Experimental validation of TCSSA's effectiveness is performed on three distinct datasets: GF5 for remote sensing images along with CAVE and NTIRE2022 for natural images. The experimental results demonstrate that the proposed method achieves a harmonious balance between reconstruction performance and computational complexity.

引用

页数：20

共 62 条

[1] In Defense of Shallow Learned Spectral Reconstruction from RGB Images [J].

Aeschbacher, Jonas ;

Wu, Jiqing ;

Timofte, Radu .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, :471-479

[2] Hyperspectral Recovery from RGB Images using Gaussian Processes [J].

Akhtar, Naveed ;

Mian, Ajmal .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (01) :100-113

[3] NTIRE 2022 Spectral Recovery Challenge and Data Set [J].

Arad, Boaz ;

Timofte, Radu ;

Yahel, Rony ;

Morag, Nimrod ;

Bernat, Amir ;

Cai, Yuanhao ;

Lin, Jing ;

Lin, Zudi ;

Wang, Haoqian ;

Zhang, Yulun ;

Pfister, Hanspeter ;

Van Gool, Luc ;

Liu, Shuai ;

Li, Yongqiang ;

Feng, Chaoyu ;

Lei, Lei ;

Li, Jiaojiao ;

Du, Songcheng ;

Wu, Chaoxiong ;

Leng, Yihong ;

Song, Rui ;

Zhang, Mingwei ;

Song, Chongxing ;

Zhao, Shuyi ;

Lang, Zhiqiang ;

Wei, Wei ;

Zhang, Lei ;

Dian, Renwei ;

Shan, Tianci ;

Guo, Anjing ;

Feng, Chengguo ;

Liu, Jinyang ;

Agarla, Mirko ;

Bianco, Simone ;

Buzzelli, Marco ;

Celona, Luigi ;

Schettini, Raimondo ;

He, Jiang ;

Xiao, Yi ;

Xiao, Jiajun ;

Yuan, Qiangqiang ;

Li, Jie ;

Zhang, Liangpei ;

Kwon, Taesung ;

Ryu, Dohoon ;

Bae, Hyokyoung ;

Yang, Hao-Hsiang ;

Chang, Hua-En ;

Huang, Zhi-Kai ;

Chen, Wei-Ting .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, :862-880

[4] Sparse Recovery of Hyperspectral Signal from Natural RGB Images [J].

Arad, Boaz ;

Ben-Shahar, Ohad .

COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 :19-34

[5] Deep semantic segmentation of natural and medical images: a review [J].

Asgari Taghanaki, Saeid ;

Abhishek, Kumar ;

Cohen, Joseph Paul ;

Cohen-Adad, Julien ;

Hamarneh, Ghassan .

ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (01) :137-178

[6] Hyperspectral Super-Resolution Reconstruction Network Based on Hybrid Convolution and Spectral Symmetry Preservation [J].

Bu, Lijing ;

Dai, Dong ;

Zhang, Zhengpeng ;

Yang, Yin ;

Deng, Mingjun .

REMOTE SENSING, 2023, 15 (13)

[7] Coarse-to-Fine Sparse Transformer for Hyperspectral Image Reconstruction [J].

Cai, Yuanhao ;

Lin, Jing ;

Hu, Xiaowan ;

Wang, Haoqian ;

Yuan, Xin ;

Zhang, Yulun ;

Timofte, Radu ;

Van Gool, Luc .

COMPUTER VISION - ECCV 2022, PT XVII, 2022, 13677 :686-704

[8]

Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13

[9]

Chakrabarti A, 2011, PROC CVPR IEEE, P193, DOI 10.1109/CVPR.2011.5995660

[10] ConViT: improving vision transformers with soft convolutional inductive biases [J].

d'Ascoli, Stephane ;

Touvron, Hugo ;

Leavitt, Matthew L. ;

Morcos, Ari S. ;

Biroli, Giulio ;

Sagun, Levent .

JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2022, 2022 (11)

← 1 2 3 4 5 6 7 →