Transformer-based image super-resolution and its lightweight

被引：2

作者：

Zhang, Dongxiao ^{[1
]}

Qi, Tangyao ^{[1
]}

Gao, Juhao ^{[1
]}

机构：

[1] Jimei Univ, Sch Sci, Xiamen 361021, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2024年 / 83卷 / 26期

关键词：

Super-resolution; Transformer; Lightweight; Content-based early-stopping; Up and down iteration; NETWORK; RESOLUTION;

D O I：

10.1007/s11042-024-18140-z

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Transformer has shown remarkable performance improvements over convolutional neural network (CNN) in natural language processing and high-level vision tasks. However, its application in low-level vision tasks, such as single image super-resolution (SISR), is still under-explored. In this paper, we introduce an up-down iterative algorithm and design a residual down and up Transformer block (RDUTB) in the Transformer framework. Then we propose a network for SISR based on RDUTB, which can effectively reconstruct low resolution (LR) images. Furthermore, to address the increasing demand for SISR models that can run on low-end mobile devices, we simplify the proposed model structure and adopt a content-based early-stopping strategy in the proposed SISR model to reduce the parameters and accelerate the reconstruction process while maintaining high quality. Experimental results show that our proposed Transformer-based SISR network and its lightweight version achieve superior performance over both traditional CNN-based SISR methods and some of the latest Transformer-based SISR methods.

引用

页码：68625 / 68649

页数：25

共 54 条

[1] NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study [J].

Agustsson, Eirikur ;

Timofte, Radu .

2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :1122-1131

[2] Fast, Accurate, and Lightweight Super-Resolution with Cascading Residual Network [J].

Ahn, Namhyuk ;

Kang, Byungkon ;

Sohn, Kyung-Ah .

COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 :256-272

[3] Single Image Super-Resolution via a Holistic Attention Network [J].

Niu, Ben ;

Wen, Weilei ;

Ren, Wenqi ;

Zhang, Xiangde ;

Yang, Lianping ;

Wang, Shuzhen ;

Zhang, Kaihao ;

Cao, Xiaochun ;

Shen, Haifeng .

COMPUTER VISION - ECCV 2020, PT XII, 2020, 12357 :191-207

[4] Low-Complexity Single-Image Super-Resolution based on Nonnegative Neighbor Embedding [J].

Bevilacqua, Marco ;

Roumy, Aline ;

Guillemot, Christine ;

Morel, Marie-Line Alberi .

PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,

[5] End-to-End Object Detection with Transformers [J].

Carion, Nicolas ;

Massa, Francisco ;

Synnaeve, Gabriel ;

Usunier, Nicolas ;

Kirillov, Alexander ;

Zagoruyko, Sergey .

COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229

[6] ARM: Any-Time Super-Resolution Method [J].

Chen, Bohong ;

Lin, Mingbao ;

Sheng, Kekai ;

Zhang, Mengdan ;

Chen, Peixian ;

Li, Ke ;

Cao, Liujuan ;

Ji, Rongrong .

COMPUTER VISION, ECCV 2022, PT XIX, 2022, 13679 :254-270

[7] Pre-Trained Image Processing Transformer [J].

Chen, Hanting ;

Wang, Yunhe ;

Guo, Tianyu ;

Xu, Chang ;

Deng, Yiping ;

Liu, Zhenhua ;

Ma, Siwei ;

Xu, Chunjing ;

Xu, Chao ;

Gao, Wen .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12294-12305

[8] Activating More Pixels in Image Super-Resolution Transformer [J].

Chen, Xiangyu ;

Wang, Xintao ;

Zhou, Jiantao ;

Qiao, Yu ;

Dong, Chao .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :22367-22377

[9] Dual Aggregation Transformer for Image Super-Resolution [J].

Chen, Zheng ;

Zhang, Yulun ;

Gu, Jinjin ;

Kong, Linghe ;

Yang, Xiaokang ;

Yu, Fisher .

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, :12278-12287

[10]

Chen Zheng, 2022, Advances in Neural Information Processing Systems

← 1 2 3 4 5 6 →