Transformer-based image super-resolution and its lightweight

被引：2

作者：

Zhang, Dongxiao ^{[1
]}

Qi, Tangyao ^{[1
]}

Gao, Juhao ^{[1
]}

机构：

[1] Jimei Univ, Sch Sci, Xiamen 361021, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2024年 / 83卷 / 26期

关键词：

Super-resolution; Transformer; Lightweight; Content-based early-stopping; Up and down iteration; NETWORK; RESOLUTION;

D O I：

10.1007/s11042-024-18140-z

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Transformer has shown remarkable performance improvements over convolutional neural network (CNN) in natural language processing and high-level vision tasks. However, its application in low-level vision tasks, such as single image super-resolution (SISR), is still under-explored. In this paper, we introduce an up-down iterative algorithm and design a residual down and up Transformer block (RDUTB) in the Transformer framework. Then we propose a network for SISR based on RDUTB, which can effectively reconstruct low resolution (LR) images. Furthermore, to address the increasing demand for SISR models that can run on low-end mobile devices, we simplify the proposed model structure and adopt a content-based early-stopping strategy in the proposed SISR model to reduce the parameters and accelerate the reconstruction process while maintaining high quality. Experimental results show that our proposed Transformer-based SISR network and its lightweight version achieve superior performance over both traditional CNN-based SISR methods and some of the latest Transformer-based SISR methods.

引用

页码：68625 / 68649

页数：25

共 54 条

[31] Transformer for Single Image Super-Resolution [J].

Lu, Zhisheng ;

Li, Juncheng ;

Liu, Hong ;

Huang, Chaoyan ;

Zhang, Linlin ;

Zeng, Tieyong .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, :456-465

[32]

Martin D, 2001, EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL II, PROCEEDINGS, P416, DOI 10.1109/ICCV.2001.937655

[33] Sketch-based manga retrieval using manga109 dataset [J].

Matsui, Yusuke ;

Ito, Kota ;

Aramaki, Yuji ;

Fujimoto, Azuma ;

Ogawa, Toru ;

Yamasaki, Toshihiko ;

Aizawa, Kiyoharu .

MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (20) :21811-21838

[34] Image Super-Resolution with Non-Local Sparse Attention [J].

Mei, Yiqun ;

Fan, Yuchen ;

Zhou, Yuqian .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :3516-3525

[35] Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network [J].

Shi, Wenzhe ;

Caballero, Jose ;

Huszar, Ferenc ;

Totz, Johannes ;

Aitken, Andrew P. ;

Bishop, Rob ;

Rueckert, Daniel ;

Wang, Zehan .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1874-1883

[36] MemNet: A Persistent Memory Network for Image Restoration [J].

Tai, Ying ;

Yang, Jian ;

Liu, Xiaoming ;

Xu, Chunyan .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4549-4557

[37] Image Super-Resolution via Deep Recursive Residual Network [J].

Tai, Ying ;

Yang, Jian ;

Liu, Xiaoming .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2790-2798

[38]

Tong J, 2022, MultiMedia Modeling, V2022, P134

[39]

Vaswani A, 2017, ADV NEUR IN, V30

[40] Deep Networks for Image Super-Resolution with Sparse Prior [J].

Wang, Zhaowen ;

Liu, Ding ;

Yang, Jianchao ;

Han, Wei ;

Huang, Thomas .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :370-378

← 1 2 3 4 5 6 →