Hybrid attention transformer with re-parameterized large kernel convolution for image super-resolution

被引:1
|
作者
Ma, Zhicheng [1 ,2 ]
Liu, Zhaoxiang [1 ,2 ]
Wang, Kai [1 ,2 ]
Lian, Shiguo [1 ,2 ]
机构
[1] China Unicom, AI Innovat Ctr, Beijing 100013, Peoples R China
[2] China Unicom, Unicom Digital Technol, Beijing 100013, Peoples R China
关键词
Image super -resolution; Transformer; Hybrid attention; Large kernel convolution; Re; -parameterization;
D O I
10.1016/j.imavis.2024.105162
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Single image super-resolution is a well-established low-level vision task that aims to reconstruct high-resolution images from low-resolution images. Methods based on Transformer have shown remarkable success and achieved outstanding performance in SISR tasks. While Transformer effectively models global information, it is less effective at capturing high frequencies such as stripes that primarily provide local information. Additionally, it has the potential to further enhance the capture of global information. To tackle this, we propose a novel Large Kernel Hybrid Attention Transformer using re-parameterization. It combines different kernel sizes and different steps re-parameterized convolution layers with Transformer to effectively capture global and local information to learn comprehensive features with low-frequency and high-frequency information. Moreover, in order to solve the problem of using batch normalization layer to introduce artifacts in SISR, we propose a new training strategy which is fusing convolution layer and batch normalization layer after certain training epochs. This strategy can enjoy the acceleration convergence effect of batch normalization layer in training and effectively eliminate the problem of artifacts in the inference stage. For re-parameterization of multiple parallel branch convolution layers, adopting this strategy can further reduce the amount of calculation of training. By coupling these core improvements, our LKHAT achieves state-of-the-art performance for single image super-resolution task.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Design of lightweight re-parameterized remote sensing image super-resolution network
    Yi J.
    Chen J.
    Cao F.
    Li J.
    Xie W.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2024, (02): : 268 - 285
  • [2] LKFormer: large kernel transformer for infrared image super-resolution
    Qin, Feiwei
    Yan, Kang
    Wang, Changmiao
    Ge, Ruiquan
    Peng, Yong
    Zhang, Kai
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (28) : 72063 - 72077
  • [3] LKASR: Large kernel attention for lightweight image super-resolution
    Feng, Hao
    Wang, Liejun
    Li, Yongming
    Du, Anyu
    KNOWLEDGE-BASED SYSTEMS, 2022, 252
  • [4] HYBRID CONVOLUTION-TRANSFORMER FOR LIGHTWEIGHT SINGLE IMAGE SUPER-RESOLUTION
    Li, Jiuqiang
    Ke, Yutong
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 2395 - 2399
  • [5] Lightweight Remote-Sensing Image Super-Resolution via Re-Parameterized Feature Distillation Network
    Zhang, Tianlin
    Bian, Chunjiang
    Zhang, Xiaoming
    Chen, Hongzhen
    Chen, Shi
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [6] Large coordinate kernel attention network for lightweight image super-resolution
    Hao, Fangwei
    Wu, Jiesheng
    Lu, Haotian
    Du, Ji
    Xu, Jing
    Xu, Xiaoxuan
    arXiv,
  • [7] Image super-resolution with parallel convolution attention network
    Zhang, Qiao
    Yang, Xiaomin
    Xiao, Long
    Yang, Feng
    Hussain, Farhan
    Won Kim, Pyoung
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (22):
  • [8] A multi-scale enhanced large-kernel attention transformer network for lightweight image super-resolution
    Chang, Kairong
    Jun, Sun
    Biao, Yang
    Hu, Mingzhi
    Yang, Junlong
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (03)
  • [9] EHAT:Enhanced Hybrid Attention Transformer for Remote Sensing Image Super-Resolution
    Wang, Jian
    Xie, Zexin
    Du, Yanlin
    Song, Wei
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VIII, 2025, 15038 : 225 - 237
  • [10] Kernel Attention Network for Single Image Super-Resolution
    Zhang, Dongyang
    Shao, Jie
    Shen, Heng Tao
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 16 (03)