TCSR: Lightweight Transformer and CNN Interaction Network for Image Super-Resolution

被引:0
作者
Cai, Danlin [1 ,2 ]
Tan, Wenwen [3 ]
Chen, Feiyang [3 ]
Lou, Xinchi [1 ]
Xiahou, Jianbin [1 ,2 ,4 ]
Zhu, Daxin [1 ,2 ,4 ]
Huang, Detian [3 ]
机构
[1] Quanzhou Normal Univ, Sch Math & Comp Sci, Quanzhou 362000, Peoples R China
[2] Quanzhou Normal Univ, Fujian Prov Key Lab Data Intens Comp, Quanzhou 362000, Peoples R China
[3] Huaqiao Univ, Coll Engn, Quanzhou 362021, Peoples R China
[4] Quanzhou Normal Univ, Fujian Univ Lab Intelligent Comp & Informat Proc, Quanzhou 362000, Peoples R China
基金
中国国家自然科学基金;
关键词
Transformers; Feature extraction; Computational modeling; Convolutional neural networks; Superresolution; Image reconstruction; Computational efficiency; Codecs; Training; Transformer cores; Lightweight image super-resolution; transformer; convolutional neural network; UNet;
D O I
10.1109/ACCESS.2024.3476369
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional neural network (CNN) has achieved impressive success in lightweight image super-resolution (SR) methods, yet the nature of its local operations constrains the SR performance. Recent Transformer has attracted increasing attention in lightweight SR methods owing to its remarkable global feature extraction capacity. However, the huge computational cost makes it challenging for lightweight SR methods to efficiently utilize Transformer to exploit global contextual information from shallow to intermediate layers. To address these issues, we propose a novel lightweight Transformer and CNN interaction network for image Super-Resolution (TCSR), which fully leverages the complementary strengths of Transformer and CNN. Specifically, an efficient lightweight Transformer and CNN Interaction Block (TCIB) is designed to extract local and global features at various stages of the network, resulting in favorable hybrid features that significantly improve the quality of reconstructed images. Then, we construct a lightweight Reversed UNet (RUNet) to progressively aggregate hybrid features as well as to better trade-off the reconstruction accuracy and efficiency. Furthermore, we introduce a Refinement module to further refine edge and texture details with global information. Experimental results on numerous benchmarks validate that the proposed TCSR achieves superior performance with fewer parameters and less computational overhead than state-of-the-art lightweight methods.
引用
收藏
页码:174782 / 174795
页数:14
相关论文
共 57 条
[1]   NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study [J].
Agustsson, Eirikur ;
Timofte, Radu .
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :1122-1131
[2]   Fast, Accurate, and Lightweight Super-Resolution with Cascading Residual Network [J].
Ahn, Namhyuk ;
Kang, Byungkon ;
Sohn, Kyung-Ah .
COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 :256-272
[3]  
Ba J L., LAYER NORMALIZATION
[4]   Low-Complexity Single-Image Super-Resolution based on Nonnegative Neighbor Embedding [J].
Bevilacqua, Marco ;
Roumy, Aline ;
Guillemot, Christine ;
Morel, Marie-Line Alberi .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
[5]   Unsupervised multi-level spatio-spectral fusion transformer for hyperspectral image super-resolution [J].
Cao, Xuheng ;
Lian, Yusheng ;
Li, Jin ;
Wang, Kaixuan ;
Ma, Chao .
OPTICS AND LASER TECHNOLOGY, 2024, 176
[6]   Unsupervised Hybrid Network of Transformer and CNN for Blind Hyperspectral and Multispectral Image Fusion [J].
Cao, Xuheng ;
Lian, Yusheng ;
Wang, Kaixuan ;
Ma, Chao ;
Xu, Xianqing .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 :1-15
[7]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[8]   Pre-Trained Image Processing Transformer [J].
Chen, Hanting ;
Wang, Yunhe ;
Guo, Tianyu ;
Xu, Chang ;
Deng, Yiping ;
Liu, Zhenhua ;
Ma, Siwei ;
Xu, Chunjing ;
Xu, Chao ;
Gao, Wen .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12294-12305
[9]  
Chen HY, 2021, Arxiv, DOI arXiv:2104.09497
[10]  
Chen J., 2021, arXiv