Transformer-based image super-resolution and its lightweight

被引:2
作者
Zhang, Dongxiao [1 ]
Qi, Tangyao [1 ]
Gao, Juhao [1 ]
机构
[1] Jimei Univ, Sch Sci, Xiamen 361021, Peoples R China
关键词
Super-resolution; Transformer; Lightweight; Content-based early-stopping; Up and down iteration; NETWORK; RESOLUTION;
D O I
10.1007/s11042-024-18140-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Transformer has shown remarkable performance improvements over convolutional neural network (CNN) in natural language processing and high-level vision tasks. However, its application in low-level vision tasks, such as single image super-resolution (SISR), is still under-explored. In this paper, we introduce an up-down iterative algorithm and design a residual down and up Transformer block (RDUTB) in the Transformer framework. Then we propose a network for SISR based on RDUTB, which can effectively reconstruct low resolution (LR) images. Furthermore, to address the increasing demand for SISR models that can run on low-end mobile devices, we simplify the proposed model structure and adopt a content-based early-stopping strategy in the proposed SISR model to reduce the parameters and accelerate the reconstruction process while maintaining high quality. Experimental results show that our proposed Transformer-based SISR network and its lightweight version achieve superior performance over both traditional CNN-based SISR methods and some of the latest Transformer-based SISR methods.
引用
收藏
页码:68625 / 68649
页数:25
相关论文
共 54 条
[1]   NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study [J].
Agustsson, Eirikur ;
Timofte, Radu .
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :1122-1131
[2]   Fast, Accurate, and Lightweight Super-Resolution with Cascading Residual Network [J].
Ahn, Namhyuk ;
Kang, Byungkon ;
Sohn, Kyung-Ah .
COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 :256-272
[3]   Single Image Super-Resolution via a Holistic Attention Network [J].
Niu, Ben ;
Wen, Weilei ;
Ren, Wenqi ;
Zhang, Xiangde ;
Yang, Lianping ;
Wang, Shuzhen ;
Zhang, Kaihao ;
Cao, Xiaochun ;
Shen, Haifeng .
COMPUTER VISION - ECCV 2020, PT XII, 2020, 12357 :191-207
[4]   Low-Complexity Single-Image Super-Resolution based on Nonnegative Neighbor Embedding [J].
Bevilacqua, Marco ;
Roumy, Aline ;
Guillemot, Christine ;
Morel, Marie-Line Alberi .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
[5]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[6]   ARM: Any-Time Super-Resolution Method [J].
Chen, Bohong ;
Lin, Mingbao ;
Sheng, Kekai ;
Zhang, Mengdan ;
Chen, Peixian ;
Li, Ke ;
Cao, Liujuan ;
Ji, Rongrong .
COMPUTER VISION, ECCV 2022, PT XIX, 2022, 13679 :254-270
[7]   Pre-Trained Image Processing Transformer [J].
Chen, Hanting ;
Wang, Yunhe ;
Guo, Tianyu ;
Xu, Chang ;
Deng, Yiping ;
Liu, Zhenhua ;
Ma, Siwei ;
Xu, Chunjing ;
Xu, Chao ;
Gao, Wen .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12294-12305
[8]   Activating More Pixels in Image Super-Resolution Transformer [J].
Chen, Xiangyu ;
Wang, Xintao ;
Zhou, Jiantao ;
Qiao, Yu ;
Dong, Chao .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :22367-22377
[9]   Dual Aggregation Transformer for Image Super-Resolution [J].
Chen, Zheng ;
Zhang, Yulun ;
Gu, Jinjin ;
Kong, Linghe ;
Yang, Xiaokang ;
Yu, Fisher .
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, :12278-12287
[10]  
Chen Zheng, 2022, Advances in Neural Information Processing Systems