Transformer-based image super-resolution and its lightweight

被引:2
作者
Zhang, Dongxiao [1 ]
Qi, Tangyao [1 ]
Gao, Juhao [1 ]
机构
[1] Jimei Univ, Sch Sci, Xiamen 361021, Peoples R China
关键词
Super-resolution; Transformer; Lightweight; Content-based early-stopping; Up and down iteration; NETWORK; RESOLUTION;
D O I
10.1007/s11042-024-18140-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Transformer has shown remarkable performance improvements over convolutional neural network (CNN) in natural language processing and high-level vision tasks. However, its application in low-level vision tasks, such as single image super-resolution (SISR), is still under-explored. In this paper, we introduce an up-down iterative algorithm and design a residual down and up Transformer block (RDUTB) in the Transformer framework. Then we propose a network for SISR based on RDUTB, which can effectively reconstruct low resolution (LR) images. Furthermore, to address the increasing demand for SISR models that can run on low-end mobile devices, we simplify the proposed model structure and adopt a content-based early-stopping strategy in the proposed SISR model to reduce the parameters and accelerate the reconstruction process while maintaining high quality. Experimental results show that our proposed Transformer-based SISR network and its lightweight version achieve superior performance over both traditional CNN-based SISR methods and some of the latest Transformer-based SISR methods.
引用
收藏
页码:68625 / 68649
页数:25
相关论文
共 54 条
[31]   Transformer for Single Image Super-Resolution [J].
Lu, Zhisheng ;
Li, Juncheng ;
Liu, Hong ;
Huang, Chaoyan ;
Zhang, Linlin ;
Zeng, Tieyong .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, :456-465
[32]  
Martin D, 2001, EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL II, PROCEEDINGS, P416, DOI 10.1109/ICCV.2001.937655
[33]   Sketch-based manga retrieval using manga109 dataset [J].
Matsui, Yusuke ;
Ito, Kota ;
Aramaki, Yuji ;
Fujimoto, Azuma ;
Ogawa, Toru ;
Yamasaki, Toshihiko ;
Aizawa, Kiyoharu .
MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (20) :21811-21838
[34]   Image Super-Resolution with Non-Local Sparse Attention [J].
Mei, Yiqun ;
Fan, Yuchen ;
Zhou, Yuqian .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :3516-3525
[35]   Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network [J].
Shi, Wenzhe ;
Caballero, Jose ;
Huszar, Ferenc ;
Totz, Johannes ;
Aitken, Andrew P. ;
Bishop, Rob ;
Rueckert, Daniel ;
Wang, Zehan .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1874-1883
[36]   MemNet: A Persistent Memory Network for Image Restoration [J].
Tai, Ying ;
Yang, Jian ;
Liu, Xiaoming ;
Xu, Chunyan .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4549-4557
[37]   Image Super-Resolution via Deep Recursive Residual Network [J].
Tai, Ying ;
Yang, Jian ;
Liu, Xiaoming .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2790-2798
[38]  
Tong J, 2022, MultiMedia Modeling, V2022, P134
[39]  
Vaswani A, 2017, ADV NEUR IN, V30
[40]   Deep Networks for Image Super-Resolution with Sparse Prior [J].
Wang, Zhaowen ;
Liu, Ding ;
Yang, Jianchao ;
Han, Wei ;
Huang, Thomas .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :370-378