Cognitively-Inspired Multi-Scale Spectral-Spatial Transformer for Hyperspectral Image Super-Resolution

被引：0

作者：

Xu, Qin ^{[1
,2
,3
]}

Liu, Shiji ^{[1
,2
,3
]}

Liu, Jinpei ^{[4
]}

Luo, Bin ^{[1
,2
,3
]}

机构：

[1] Anhui Univ, Minist Educ, Key Lab Intelligent Comp & Signal Proc, Hefei 230601, Peoples R China

[2] Anhui Univ, Anhui Prov Key Lab Multimodal Cognit Computat, Hefei 230601, Peoples R China

[3] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China

[4] Anhui Univ, Sch Business, Hefei 230601, Peoples R China

来源：

COGNITIVE COMPUTATION | 2024年 / 16卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Hyperspectral image super-resolution; Transformer; Convolutional neural network; Multi-scale feature extraction; Perception;

D O I：

10.1007/s12559-023-10210-y

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The hyperspectral image (HSI) super-resolution (SR) without auxiliary high-resolution images is a challenging task in computer vision applications. The existing methods almost resort to the deep convolutional neural networks of fixed geometrical kernel, which can not model the long-range dependencies and does not conform to the human visual cognition. To address this issue, we propose the cognitively-inspired multi-scale spectral-spatial transformer for HSI SR. To solve the problem of high storage and computation burden, the overlapped band grouping strategy is adopted in light of high similarity between neighboring spectral bands of HSI. Considering the different textures and details that appear in HSIs, inspired by the human cognitive mechanism, the multi-scale spatial and spectral transformer blocks are developed which can efficiently and effectively learn the spatial and spectral feature representation at different scales and long-range dependencies of features. Finally, to fuse the feature information of neighboring groups, the 2D convolution mixed with 3D separable convolution is designed, which fully explores the complementarity and continuity of spatial and spectral information. Extensive experiments conducted on three benchmark datasets demonstrate that the proposed method yields state-of-the-art results at different scales. The effectiveness of the proposed method is verified through spatial and spectral dimension data visualization and ablation experiments. The code and models are publicly available at https://github.com/liushiji666/MMSSTN. The experimental results prove the effectiveness of our proposed method, which largely overcomes the disadvantage that convolution is ineffective for long-range dependence modeling. The method performs long-range dependence modeling on both spatial and spectral features and efficiently mines complementary information between bands, thereby enhancing the model's high perceptual ability.

引用

页码：377 / 391

页数：15

共 50 条

[41] Multi-scale implicit transformer with re-parameterization for arbitrary-scale super-resolution
Zhu, Jinchen
Zhang, Mingjian
Zheng, Ling
Weng, Shizhuang
PATTERN RECOGNITION, 2025, 162
[42] LESSFormer: Local-Enhanced Spectral-Spatial Transformer for Hyperspectral Image Classification
Zou, Jiaqi
He, Wei
Zhang, Hongyan
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[43] Foundation Model-Based Spectral-Spatial Transformer for Hyperspectral Image Classification
Huang, Lingbo
Chen, Yushi
He, Xin
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[44] Image super-resolution using supervised multi-scale feature extraction network
Yemei Sun
Yan Zhang
Shudong Liu
Weijia Lu
Xianguo Li
Multimedia Tools and Applications, 2021, 80 : 1995 - 2008
[45] Single Image Super-Resolution Using Multi-scale Convolutional Neural Network
Jia, Xiaoyi
Xu, Xiangmin
Cai, Bolun
Guo, Kailing
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT I, 2018, 10735 : 149 - 157
[46] Single-image super-resolution via selective multi-scale network
Zewei He
Binjie Ding
Guizhong Fu
Yanpeng Cao
Jiangxin Yang
Yanlong Cao
Signal, Image and Video Processing, 2022, 16 : 937 - 945
[47] Image super-resolution using supervised multi-scale feature extraction network
Sun, Yemei
Zhang, Yan
Liu, Shudong
Lu, Weijia
Li, Xianguo
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (02) : 1995 - 2008
[48] A Channel-Wise Multi-Scale Network for Single Image Super-Resolution
Ji, Jiahuan
Zhong, Baojiang
Wu, Qihui
Ma, Kai-Kuang
IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 805 - 809
[49] Single image super-resolution with lightweight multi-scale dilated attention network
Song, Xiaogang
Pang, Xinchao
Zhang, Lei
Lu, Xiaofeng
Hei, Xinhong
APPLIED SOFT COMPUTING, 2025, 169
[50] Spectral-Spatial Blockwise Masked Transformer With Contrastive Multi-View Learning for Hyperspectral Image Classification
Hu, Han
Liu, Zhenhui
Xu, Ziqing
Wang, Haoyi
Li, Xianju
Han, Xu
Peng, Jianyi
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT IV, 2025, 15034 : 480 - 494

← 1 2 3 4 5 →