HCT: image super-resolution restoration using hierarchical convolution transformer networks

被引:0
|
作者
Guo, Ying [1 ,2 ]
Tian, Chang [1 ]
Wang, Han [1 ]
Liu, Jie [1 ]
Di, Chong [3 ]
Ning, Keqing [1 ]
机构
[1] North China Univ Technol, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Beijing 100084, Peoples R China
[3] Qilu Univ Technol, Shandong Artificial Intelligence Inst, Shandong Acad Sci, Jinan 250353, Peoples R China
基金
中国国家自然科学基金;
关键词
Hierarchical convolution network; Swin transformer; Image super-resolution;
D O I
10.1007/s10044-025-01413-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the computer vision domain, image super-resolution (SR) technology, which restores high-resolution details from low-resolution images, plays a vital role in practical applications such as medical imaging, public safety, and remote sensing. Traditional methods employ convolutional neural networks to address these issues, while Visual Transformers show potential performance in high-level vision tasks. However, compared to typical CNN architecture networks, Visual Transformers exhibit weaker reliance on high-frequency information in images, leading to blurred details and residual artifacts. To solve this issue, we use a hierarchical network structure, which allows for a more flexible feeling field for our approach. Firstly, our method complements lost spatial features using a Convolutional Swin Transformer Layer incorporating a Convolutional Feed Forward Network. This allows for the retrieval of missing spatial information and enhances the model's representational capabilities. Next, deep feature extraction is performed by combining multiple layers into a Residual Convolutional Swin Transformer Block. Finally, we employ a hierarchical-type structure to combine the features of each branch. Experiments validate the effectiveness of the proposed method in generating images with greater detail aligned with human perception. Based on the experiments, our method is effective on SR tasks with magnification factors of 2, 3, and 4. Our method can reconstruct a clear and complete edge structure. We provide code at https://github.com/Q88392/HCT.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Single Image Super-resolution Using Spatial Transformer Networks
    Wang, Qiang
    Fan, Huijie
    Cong, Yang
    Tang, Yandong
    2017 IEEE 7TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (CYBER), 2017, : 564 - 567
  • [2] Asymmetric convolution Swin transformer for medical image super-resolution
    Lu, Weijia
    Jiang, Jiehui
    Tian, Hao
    Gu, Jun
    Lu, Yuhong
    Yang, Wanli
    Gong, Ming
    Han, Tianyi
    Jiang, Xiaojuan
    Zhang, Tingting
    ALEXANDRIA ENGINEERING JOURNAL, 2023, 85 : 177 - 184
  • [3] Deep networks for image super-resolution using hierarchical features
    Yang, Xin
    Zhang, Yifan
    Zhou, Dake
    BULLETIN OF THE POLISH ACADEMY OF SCIENCES-TECHNICAL SCIENCES, 2022, 70 (01)
  • [4] HCT: a hybrid CNN and transformer network for hyperspectral image super-resolution
    Wu, Huapeng
    Wang, Chenyun
    Lu, Chenyang
    Zhan, Tianming
    MULTIMEDIA SYSTEMS, 2024, 30 (04)
  • [5] HADT: Image super-resolution restoration using Hybrid Attention-Dense Connected Transformer Networks
    Guo, Ying
    Tian, Chang
    Liu, Jie
    Di, Chong
    Ning, Keqing
    NEUROCOMPUTING, 2025, 614
  • [6] HYBRID CONVOLUTION-TRANSFORMER FOR LIGHTWEIGHT SINGLE IMAGE SUPER-RESOLUTION
    Li, Jiuqiang
    Ke, Yutong
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 2395 - 2399
  • [7] HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution
    Zhang, Xiang
    Zhang, Yulun
    Yu, Fisher
    COMPUTER VISION - ECCV 2024, PT XL, 2025, 15098 : 483 - 500
  • [8] Image Super-Resolution Using Dilated Window Transformer
    Park, Soobin
    Choi, Yong Suk
    IEEE ACCESS, 2023, 11 (60028-60039): : 60028 - 60039
  • [9] Transformer for Single Image Super-Resolution
    Lu, Zhisheng
    Li, Juncheng
    Liu, Hong
    Huang, Chaoyan
    Zhang, Linlin
    Zeng, Tieyong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 456 - 465
  • [10] Post-trained convolution networks for single image super-resolution
    Zandavi, Seid Miad
    ARTIFICIAL INTELLIGENCE, 2023, 318