Cognitively-Inspired Multi-Scale Spectral-Spatial Transformer for Hyperspectral Image Super-Resolution

被引:0
|
作者
Xu, Qin [1 ,2 ,3 ]
Liu, Shiji [1 ,2 ,3 ]
Liu, Jinpei [4 ]
Luo, Bin [1 ,2 ,3 ]
机构
[1] Anhui Univ, Minist Educ, Key Lab Intelligent Comp & Signal Proc, Hefei 230601, Peoples R China
[2] Anhui Univ, Anhui Prov Key Lab Multimodal Cognit Computat, Hefei 230601, Peoples R China
[3] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China
[4] Anhui Univ, Sch Business, Hefei 230601, Peoples R China
基金
中国国家自然科学基金;
关键词
Hyperspectral image super-resolution; Transformer; Convolutional neural network; Multi-scale feature extraction; Perception;
D O I
10.1007/s12559-023-10210-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The hyperspectral image (HSI) super-resolution (SR) without auxiliary high-resolution images is a challenging task in computer vision applications. The existing methods almost resort to the deep convolutional neural networks of fixed geometrical kernel, which can not model the long-range dependencies and does not conform to the human visual cognition. To address this issue, we propose the cognitively-inspired multi-scale spectral-spatial transformer for HSI SR. To solve the problem of high storage and computation burden, the overlapped band grouping strategy is adopted in light of high similarity between neighboring spectral bands of HSI. Considering the different textures and details that appear in HSIs, inspired by the human cognitive mechanism, the multi-scale spatial and spectral transformer blocks are developed which can efficiently and effectively learn the spatial and spectral feature representation at different scales and long-range dependencies of features. Finally, to fuse the feature information of neighboring groups, the 2D convolution mixed with 3D separable convolution is designed, which fully explores the complementarity and continuity of spatial and spectral information. Extensive experiments conducted on three benchmark datasets demonstrate that the proposed method yields state-of-the-art results at different scales. The effectiveness of the proposed method is verified through spatial and spectral dimension data visualization and ablation experiments. The code and models are publicly available at https://github.com/liushiji666/MMSSTN. The experimental results prove the effectiveness of our proposed method, which largely overcomes the disadvantage that convolution is ineffective for long-range dependence modeling. The method performs long-range dependence modeling on both spatial and spectral features and efficiently mines complementary information between bands, thereby enhancing the model's high perceptual ability.
引用
收藏
页码:377 / 391
页数:15
相关论文
共 50 条
  • [41] Multi-scale implicit transformer with re-parameterization for arbitrary-scale super-resolution
    Zhu, Jinchen
    Zhang, Mingjian
    Zheng, Ling
    Weng, Shizhuang
    PATTERN RECOGNITION, 2025, 162
  • [42] LESSFormer: Local-Enhanced Spectral-Spatial Transformer for Hyperspectral Image Classification
    Zou, Jiaqi
    He, Wei
    Zhang, Hongyan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [43] Foundation Model-Based Spectral-Spatial Transformer for Hyperspectral Image Classification
    Huang, Lingbo
    Chen, Yushi
    He, Xin
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [44] Image super-resolution using supervised multi-scale feature extraction network
    Yemei Sun
    Yan Zhang
    Shudong Liu
    Weijia Lu
    Xianguo Li
    Multimedia Tools and Applications, 2021, 80 : 1995 - 2008
  • [45] Single Image Super-Resolution Using Multi-scale Convolutional Neural Network
    Jia, Xiaoyi
    Xu, Xiangmin
    Cai, Bolun
    Guo, Kailing
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT I, 2018, 10735 : 149 - 157
  • [46] Single-image super-resolution via selective multi-scale network
    Zewei He
    Binjie Ding
    Guizhong Fu
    Yanpeng Cao
    Jiangxin Yang
    Yanlong Cao
    Signal, Image and Video Processing, 2022, 16 : 937 - 945
  • [47] Image super-resolution using supervised multi-scale feature extraction network
    Sun, Yemei
    Zhang, Yan
    Liu, Shudong
    Lu, Weijia
    Li, Xianguo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (02) : 1995 - 2008
  • [48] A Channel-Wise Multi-Scale Network for Single Image Super-Resolution
    Ji, Jiahuan
    Zhong, Baojiang
    Wu, Qihui
    Ma, Kai-Kuang
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 805 - 809
  • [49] Single image super-resolution with lightweight multi-scale dilated attention network
    Song, Xiaogang
    Pang, Xinchao
    Zhang, Lei
    Lu, Xiaofeng
    Hei, Xinhong
    APPLIED SOFT COMPUTING, 2025, 169
  • [50] Spectral-Spatial Blockwise Masked Transformer With Contrastive Multi-View Learning for Hyperspectral Image Classification
    Hu, Han
    Liu, Zhenhui
    Xu, Ziqing
    Wang, Haoyi
    Li, Xianju
    Han, Xu
    Peng, Jianyi
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT IV, 2025, 15034 : 480 - 494