Hybrid attention transformer with re-parameterized large kernel convolution for image super-resolution

被引：1

作者：

Ma, Zhicheng ^{[1
,2
]}

Liu, Zhaoxiang ^{[1
,2
]}

Wang, Kai ^{[1
,2
]}

Lian, Shiguo ^{[1
,2
]}

机构：

[1] China Unicom, AI Innovat Ctr, Beijing 100013, Peoples R China

[2] China Unicom, Unicom Digital Technol, Beijing 100013, Peoples R China

来源：

IMAGE AND VISION COMPUTING | 2024年 / 149卷

关键词：

Image super -resolution; Transformer; Hybrid attention; Large kernel convolution; Re; -parameterization;

D O I：

10.1016/j.imavis.2024.105162

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Single image super-resolution is a well-established low-level vision task that aims to reconstruct high-resolution images from low-resolution images. Methods based on Transformer have shown remarkable success and achieved outstanding performance in SISR tasks. While Transformer effectively models global information, it is less effective at capturing high frequencies such as stripes that primarily provide local information. Additionally, it has the potential to further enhance the capture of global information. To tackle this, we propose a novel Large Kernel Hybrid Attention Transformer using re-parameterization. It combines different kernel sizes and different steps re-parameterized convolution layers with Transformer to effectively capture global and local information to learn comprehensive features with low-frequency and high-frequency information. Moreover, in order to solve the problem of using batch normalization layer to introduce artifacts in SISR, we propose a new training strategy which is fusing convolution layer and batch normalization layer after certain training epochs. This strategy can enjoy the acceleration convergence effect of batch normalization layer in training and effectively eliminate the problem of artifacts in the inference stage. For re-parameterization of multiple parallel branch convolution layers, adopting this strategy can further reduce the amount of calculation of training. By coupling these core improvements, our LKHAT achieves state-of-the-art performance for single image super-resolution task.

引用

页数：9

共 50 条

[1] Design of lightweight re-parameterized remote sensing image super-resolution network
Yi J.
Chen J.
Cao F.
Li J.
Xie W.
Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2024, (02): : 268 - 285
[2] LKFormer: large kernel transformer for infrared image super-resolution
Qin, Feiwei
Yan, Kang
Wang, Changmiao
Ge, Ruiquan
Peng, Yong
Zhang, Kai
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (28) : 72063 - 72077
[3] LKASR: Large kernel attention for lightweight image super-resolution
Feng, Hao
Wang, Liejun
Li, Yongming
Du, Anyu
KNOWLEDGE-BASED SYSTEMS, 2022, 252
[4] HYBRID CONVOLUTION-TRANSFORMER FOR LIGHTWEIGHT SINGLE IMAGE SUPER-RESOLUTION
Li, Jiuqiang
Ke, Yutong
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 2395 - 2399
[5] Lightweight Remote-Sensing Image Super-Resolution via Re-Parameterized Feature Distillation Network
Zhang, Tianlin
Bian, Chunjiang
Zhang, Xiaoming
Chen, Hongzhen
Chen, Shi
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
[6] Large coordinate kernel attention network for lightweight image super-resolution
Hao, Fangwei
Wu, Jiesheng
Lu, Haotian
Du, Ji
Xu, Jing
Xu, Xiaoxuan
arXiv,
[7] Image super-resolution with parallel convolution attention network
Zhang, Qiao
Yang, Xiaomin
Xiao, Long
Yang, Feng
Hussain, Farhan
Won Kim, Pyoung
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (22):
[8] A multi-scale enhanced large-kernel attention transformer network for lightweight image super-resolution
Chang, Kairong
Jun, Sun
Biao, Yang
Hu, Mingzhi
Yang, Junlong
SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (03)
[9] EHAT:Enhanced Hybrid Attention Transformer for Remote Sensing Image Super-Resolution
Wang, Jian
Xie, Zexin
Du, Yanlin
Song, Wei
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT VIII, 2025, 15038 : 225 - 237
[10] Kernel Attention Network for Single Image Super-Resolution
Zhang, Dongyang
Shao, Jie
Shen, Heng Tao
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 16 (03)

← 1 2 3 4 5 →