Cognitively-Inspired Multi-Scale Spectral-Spatial Transformer for Hyperspectral Image Super-Resolution

被引:0
|
作者
Xu, Qin [1 ,2 ,3 ]
Liu, Shiji [1 ,2 ,3 ]
Liu, Jinpei [4 ]
Luo, Bin [1 ,2 ,3 ]
机构
[1] Anhui Univ, Minist Educ, Key Lab Intelligent Comp & Signal Proc, Hefei 230601, Peoples R China
[2] Anhui Univ, Anhui Prov Key Lab Multimodal Cognit Computat, Hefei 230601, Peoples R China
[3] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China
[4] Anhui Univ, Sch Business, Hefei 230601, Peoples R China
基金
中国国家自然科学基金;
关键词
Hyperspectral image super-resolution; Transformer; Convolutional neural network; Multi-scale feature extraction; Perception;
D O I
10.1007/s12559-023-10210-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The hyperspectral image (HSI) super-resolution (SR) without auxiliary high-resolution images is a challenging task in computer vision applications. The existing methods almost resort to the deep convolutional neural networks of fixed geometrical kernel, which can not model the long-range dependencies and does not conform to the human visual cognition. To address this issue, we propose the cognitively-inspired multi-scale spectral-spatial transformer for HSI SR. To solve the problem of high storage and computation burden, the overlapped band grouping strategy is adopted in light of high similarity between neighboring spectral bands of HSI. Considering the different textures and details that appear in HSIs, inspired by the human cognitive mechanism, the multi-scale spatial and spectral transformer blocks are developed which can efficiently and effectively learn the spatial and spectral feature representation at different scales and long-range dependencies of features. Finally, to fuse the feature information of neighboring groups, the 2D convolution mixed with 3D separable convolution is designed, which fully explores the complementarity and continuity of spatial and spectral information. Extensive experiments conducted on three benchmark datasets demonstrate that the proposed method yields state-of-the-art results at different scales. The effectiveness of the proposed method is verified through spatial and spectral dimension data visualization and ablation experiments. The code and models are publicly available at https://github.com/liushiji666/MMSSTN. The experimental results prove the effectiveness of our proposed method, which largely overcomes the disadvantage that convolution is ineffective for long-range dependence modeling. The method performs long-range dependence modeling on both spatial and spectral features and efficiently mines complementary information between bands, thereby enhancing the model's high perceptual ability.
引用
收藏
页码:377 / 391
页数:15
相关论文
共 50 条
  • [31] Interactformer: Interactive Transformer and CNN for Hyperspectral Image Super-Resolution
    Liu, Yaoting
    Hu, Jianwen
    Kang, Xudong
    Luo, Jing
    Fan, Shaosheng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [32] MSDformer: Multiscale Deformable Transformer for Hyperspectral Image Super-Resolution
    Chen, Shi
    Zhang, Lefei
    Zhang, Liangpei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [33] Lightweight multi-scale distillation attention network for image super-resolution
    Tang, Yinggan
    Hu, Quanwei
    Bu, Chunning
    KNOWLEDGE-BASED SYSTEMS, 2025, 309
  • [34] LMSN:a lightweight multi-scale network for single image super-resolution
    Zou, Yiye
    Yang, Xiaomin
    Albertini, Marcelo Keese
    Hussain, Farhan
    MULTIMEDIA SYSTEMS, 2021, 27 (04) : 845 - 856
  • [35] Multi-scale feature selection network for lightweight image super-resolution
    Li, Minghong
    Zhao, Yuqian
    Zhang, Fan
    Luo, Biao
    Yang, Chunhua
    Gui, Weihua
    Chang, Kan
    NEURAL NETWORKS, 2024, 169 : 352 - 364
  • [36] MSTNet: A Multilevel Spectral-Spatial Transformer Network for Hyperspectral Image Classification
    Yu, Haoyang
    Xu, Zhen
    Zheng, Ke
    Hong, Danfeng
    Yang, Hao
    Song, Meiping
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [37] Multi-scale convolutional attention network for lightweight image super-resolution
    Xie, Feng
    Lu, Pei
    Liu, Xiaoyong
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95
  • [38] LMSN:a lightweight multi-scale network for single image super-resolution
    Yiye Zou
    Xiaomin Yang
    Marcelo Keese Albertini
    Farhan Hussain
    Multimedia Systems, 2021, 27 : 845 - 856
  • [39] Thangka Hyperspectral Image Super-Resolution Based on a Spatial-Spectral Integration Network
    Wang, Sai
    Fan, Fenglei
    REMOTE SENSING, 2023, 15 (14)
  • [40] Lightweight Image Super-Resolution Reconstruction Method Based on Multi-scale Spatial Adaptive Attention Network
    Huang, Feng
    Liu, Hongwei
    Shen, Ying
    Qiu, Zhaobing
    Chen, Liqiong
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2025, 38 (01): : 36 - 50