Spatial and frequency information fusion transformer for image super-resolution

被引:0
|
作者
Zhang, Yan [1 ]
Xu, Fujie [1 ]
Sun, Yemei [1 ]
Wang, Jiao [1 ]
机构
[1] Tianjin Chengjian Univ, Coll Comp & Informat Engn, Tianjin 300384, Peoples R China
关键词
Super resolution; Vision transformer; Frequency components; Convolutional neural network;
D O I
10.1016/j.neunet.2025.107351
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous works have indicated that Transformer-based models bring impressive image reconstruction performance in single image super-resolution (SISR). However, existing Transformer-based approaches utilize self-attention within non-overlapping windows. This restriction hinders the network's ability to adopt large receptive fields, which are essential for capturing global information and establishing long-distance dependencies, especially in the early layers. To fully leverage global information and activate more pixels during the image reconstruction process, we have developed a Spatial and Frequency Information Fusion Transformer (SFFT) with an expansive receptive field. SFFT concurrently combines spatial and frequency domain information to comprehensively leverage their complementary strengths, capturing both local and global image features while integrating low and high-frequency information. Additionally, we utilize the overlapping cross-attention block (OCAB) to facilitate pixel transmission between adjacent windows, enhancing network performance. During the training stage, we incorporate the Fast Fourier Transform (FFT) loss, thereby fully leveraging the capabilities of our proposed modules and further tapping into the model's potential. Extensive quantitative and qualitative evaluations on benchmark datasets indicate that the proposed algorithm surpasses state-of-the-art methods in terms of accuracy. Specifically, our method achieves a PSNR score of 32.67 dB on the Manga109 dataset, surpassing SwinIR by 0.64 dB and HAT by 0.19 dB, respectively. The source code and pre-trained models are available at https://github.com/Xufujie/SFFT
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Spatial relaxation transformer for image super-resolution
    Li, Yinghua
    Zhang, Ying
    Zeng, Hao
    He, Jinglu
    Guo, Jie
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (07)
  • [2] Local spatial information for image super-resolution
    Zareapoor, Masoumeh
    Jain, Deepak Kumar
    Yang, Jie
    COGNITIVE SYSTEMS RESEARCH, 2018, 52 : 49 - 57
  • [3] Single Image Super-resolution Using Spatial Transformer Networks
    Wang, Qiang
    Fan, Huijie
    Cong, Yang
    Tang, Yandong
    2017 IEEE 7TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (CYBER), 2017, : 564 - 567
  • [4] Spatial Transformer Generative Adversarial Network for Image Super-Resolution
    Rempakos, Pantelis
    Vrigkas, Michalis
    Plissiti, Marina E.
    Nikou, Christophoros
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2023, PT I, 2023, 14233 : 399 - 411
  • [5] Transformer for Single Image Super-Resolution
    Lu, Zhisheng
    Li, Juncheng
    Liu, Hong
    Huang, Chaoyan
    Zhang, Linlin
    Zeng, Tieyong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 456 - 465
  • [6] A spectral and spatial transformer for hyperspectral remote sensing image super-resolution
    Wang, Bingqian
    Chen, Jianhua
    Wang, Huajun
    Tang, Yipeng
    Chen, Jiongling
    Jiang, Ye
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2024, 17 (01)
  • [7] A novel spatial and spectral transformer network for hyperspectral image super-resolution
    Wu, Huapeng
    Xu, Hui
    Zhan, Tianming
    MULTIMEDIA SYSTEMS, 2024, 30 (03)
  • [8] Spatial Transformer Generative Adversarial Network for Robust Image Super-Resolution
    Kasem, Hossam M.
    Hung, Kwok-Wai
    Jiang, Jianmin
    IEEE ACCESS, 2019, 7 : 182993 - 183009
  • [9] INFORMATION-GROWTH SWIN TRANSFORMER NETWORK FOR IMAGE SUPER-RESOLUTION
    Ji, Yantao
    Jiang, Peilin
    Shi, Jingang
    Guo, Yu
    Zhang, Ruiteng
    Wang, Fei
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3993 - 3997
  • [10] Multiframe image super-resolution adapted with local spatial information
    Zhang, Liangpei
    Yuan, Qiangqiang
    Shen, Huanfeng
    Li, Pingxiang
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2011, 28 (03) : 381 - 390