Spatial and frequency information fusion transformer for image super-resolution

被引:0
|
作者
Zhang, Yan [1 ]
Xu, Fujie [1 ]
Sun, Yemei [1 ]
Wang, Jiao [1 ]
机构
[1] Tianjin Chengjian Univ, Coll Comp & Informat Engn, Tianjin 300384, Peoples R China
关键词
Super resolution; Vision transformer; Frequency components; Convolutional neural network;
D O I
10.1016/j.neunet.2025.107351
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous works have indicated that Transformer-based models bring impressive image reconstruction performance in single image super-resolution (SISR). However, existing Transformer-based approaches utilize self-attention within non-overlapping windows. This restriction hinders the network's ability to adopt large receptive fields, which are essential for capturing global information and establishing long-distance dependencies, especially in the early layers. To fully leverage global information and activate more pixels during the image reconstruction process, we have developed a Spatial and Frequency Information Fusion Transformer (SFFT) with an expansive receptive field. SFFT concurrently combines spatial and frequency domain information to comprehensively leverage their complementary strengths, capturing both local and global image features while integrating low and high-frequency information. Additionally, we utilize the overlapping cross-attention block (OCAB) to facilitate pixel transmission between adjacent windows, enhancing network performance. During the training stage, we incorporate the Fast Fourier Transform (FFT) loss, thereby fully leveraging the capabilities of our proposed modules and further tapping into the model's potential. Extensive quantitative and qualitative evaluations on benchmark datasets indicate that the proposed algorithm surpasses state-of-the-art methods in terms of accuracy. Specifically, our method achieves a PSNR score of 32.67 dB on the Manga109 dataset, surpassing SwinIR by 0.64 dB and HAT by 0.19 dB, respectively. The source code and pre-trained models are available at https://github.com/Xufujie/SFFT
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Spatial relaxation transformer for image super-resolution
    Li, Yinghua
    Zhang, Ying
    Zeng, Hao
    He, Jinglu
    Guo, Jie
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (07)
  • [2] Local spatial information for image super-resolution
    Zareapoor, Masoumeh
    Jain, Deepak Kumar
    Yang, Jie
    COGNITIVE SYSTEMS RESEARCH, 2018, 52 : 49 - 57
  • [3] Single Image Super-resolution Using Spatial Transformer Networks
    Wang, Qiang
    Fan, Huijie
    Cong, Yang
    Tang, Yandong
    2017 IEEE 7TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (CYBER), 2017, : 564 - 567
  • [4] Spstnet: image super-resolution using spatial pyramid swin transformer network
    Sun, Yemei
    Wang, Jiao
    Yang, Yue
    Zhang, Yan
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (04)
  • [5] CNN–Transformer gated fusion network for medical image super-resolution
    Juanjuan Qin
    Jian Xiong
    Zhantu Liang
    Scientific Reports, 15 (1)
  • [6] Spatial Transformer Generative Adversarial Network for Robust Image Super-Resolution
    Kasem, Hossam M.
    Hung, Kwok-Wai
    Jiang, Jianmin
    IEEE ACCESS, 2019, 7 : 182993 - 183009
  • [7] INFORMATION-GROWTH SWIN TRANSFORMER NETWORK FOR IMAGE SUPER-RESOLUTION
    Ji, Yantao
    Jiang, Peilin
    Shi, Jingang
    Guo, Yu
    Zhang, Ruiteng
    Wang, Fei
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3993 - 3997
  • [8] IMAGE FUSION FOR HYPERSPECTRAL IMAGE SUPER-RESOLUTION
    Irmak, Hasan
    Akar, Gozde Bozdagi
    Yuksel, Seniha Esen
    2018 9TH WORKSHOP ON HYPERSPECTRAL IMAGE AND SIGNAL PROCESSING: EVOLUTION IN REMOTE SENSING (WHISPERS), 2018,
  • [9] Image super-resolution method based on the interactive fusion of transformer and CNN features
    Wang, Jianxin
    Zou, Yongsong
    Alfarraj, Osama
    Sharma, Pradip Kumar
    Said, Wael
    Wang, Jin
    VISUAL COMPUTER, 2024, 40 (08) : 5827 - 5839
  • [10] Fusformer: A Transformer-Based Fusion Network for Hyperspectral Image Super-Resolution
    Hu, Jin-Fan
    Huang, Ting-Zhu
    Deng, Liang-Jian
    Dou, Hong-Xia
    Hong, Danfeng
    Vivone, Gemine
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19