HCT: image super-resolution restoration using hierarchical convolution transformer networks

被引:0
|
作者
Guo, Ying [1 ,2 ]
Tian, Chang [1 ]
Wang, Han [1 ]
Liu, Jie [1 ]
Di, Chong [3 ]
Ning, Keqing [1 ]
机构
[1] North China Univ Technol, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Beijing 100084, Peoples R China
[3] Qilu Univ Technol, Shandong Artificial Intelligence Inst, Shandong Acad Sci, Jinan 250353, Peoples R China
基金
中国国家自然科学基金;
关键词
Hierarchical convolution network; Swin transformer; Image super-resolution;
D O I
10.1007/s10044-025-01413-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the computer vision domain, image super-resolution (SR) technology, which restores high-resolution details from low-resolution images, plays a vital role in practical applications such as medical imaging, public safety, and remote sensing. Traditional methods employ convolutional neural networks to address these issues, while Visual Transformers show potential performance in high-level vision tasks. However, compared to typical CNN architecture networks, Visual Transformers exhibit weaker reliance on high-frequency information in images, leading to blurred details and residual artifacts. To solve this issue, we use a hierarchical network structure, which allows for a more flexible feeling field for our approach. Firstly, our method complements lost spatial features using a Convolutional Swin Transformer Layer incorporating a Convolutional Feed Forward Network. This allows for the retrieval of missing spatial information and enhances the model's representational capabilities. Next, deep feature extraction is performed by combining multiple layers into a Residual Convolutional Swin Transformer Block. Finally, we employ a hierarchical-type structure to combine the features of each branch. Experiments validate the effectiveness of the proposed method in generating images with greater detail aligned with human perception. Based on the experiments, our method is effective on SR tasks with magnification factors of 2, 3, and 4. Our method can reconstruct a clear and complete edge structure. We provide code at https://github.com/Q88392/HCT.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] CoT-MISR:Marrying convolution and transformer for multi-image super-resolution
    Song, Qing
    Xiu, Mingming
    Nie, Yang
    Hu, Mengjie
    Liu, Chun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (31) : 76891 - 76903
  • [22] SVTSR: image super-resolution using scattering vision transformer
    Liang, Jiabao
    Jin, Yutao
    Chen, Xiaoyan
    Huang, Haotian
    Deng, Yue
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [23] Image Super-Resolution Using a Simple Transformer Without Pretraining
    Huan Liu
    Mingwen Shao
    Chao Wang
    Feilong Cao
    Neural Processing Letters, 2023, 55 : 1479 - 1497
  • [24] Image Super-Resolution Using a Simple Transformer Without Pretraining
    Liu, Huan
    Shao, Mingwen
    Wang, Chao
    Cao, Feilong
    NEURAL PROCESSING LETTERS, 2023, 55 (02) : 1479 - 1497
  • [25] Image super-resolution using dilated neighborhood attention transformer
    Chen, Li
    Zuo, Jinnian
    Du, Kai
    Zou, Jinsong
    Yin, Shaoyun
    Wang, Jinyu
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (02)
  • [26] IMAGE SUPER-RESOLUTION BASED ON CONVOLUTION NEURAL NETWORKS USING MULTI-CHANNEL INPUT
    Youm, Gwang-Young
    Bae, Sung-Ho
    Kim, Munchurl
    2016 IEEE 12TH IMAGE, VIDEO, AND MULTIDIMENSIONAL SIGNAL PROCESSING WORKSHOP (IVMSP), 2016,
  • [27] Super-Resolution Image Restoration Using Convolutional Neural Network
    Yu, Nedzelskyi O.
    Lashchevska, N. O.
    VISNYK NTUU KPI SERIIA-RADIOTEKHNIKA RADIOAPARATOBUDUVANNIA, 2023, (91): : 79 - 86
  • [28] Terahertz image super-resolution restoration using a hybrid-Transformer-based generative adversarial network
    Wu, Heng
    Zheng, Jing
    He, Chunhua
    Xiao, Huapan
    Luo, Shaojuan
    OPTICS AND LASERS IN ENGINEERING, 2025, 189
  • [29] Spatial relaxation transformer for image super-resolution
    Li, Yinghua
    Zhang, Ying
    Zeng, Hao
    He, Jinglu
    Guo, Jie
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (07)
  • [30] Dual Aggregation Transformer for Image Super-Resolution
    Chen, Zheng
    Zhang, Yulun
    Gu, Jinjin
    Kong, Linghe
    Yang, Xiaokang
    Yu, Fisher
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 12278 - 12287