Multi-granularity Transformer for Image Super-Resolution

被引：0

作者：

Zhuge, Yunzhi ^{[1
]}

Jia, Xu ^{[2
]}

机构：

[1] Univ Adelaide, Adelaide, SA, Australia

[2] Dalian Univ Technol, Sch Artificial Intelligence, Dalian, Peoples R China

来源：

COMPUTER VISION - ACCV 2022, PT III | 2023年 / 13843卷

关键词：

SPARSE;

D O I：

10.1007/978-3-031-26313-2_9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, transformers have made great success in computer vision. Thus far, most of those works focus on high-level tasks, e.g., image classification and object detection, and fewer attempts were made to solve low-level problems. In this work, we tackle image super-resolution. Specifically, transformer architectures with multi-granularity transformer groups are explored for complementary information interaction, to improve the accuracy of super-resolution. We exploit three transformer patterns, i.e., the window transformers, dilated transformers and global transformers. We further investigate the combination of them and propose a Multi-granularity Transformer (MugFormer). Specifically, the window transformer layer is aggregated with other transformer layers to compose three transformer groups, namely, Local Transformer Group, Dilated Transformer Group and Global Transformer Group, which efficiently aggregate both local and global information for accurate reconstruction. Extensive experiments on five benchmark datasets demonstrate that our MugFormer performs favorably against state-of-the-art methods in terms of both quantitative and qualitative results.

引用

页码：138 / 154

页数：17

共 53 条

[11] Devlin J, 2019, Arxiv, DOI arXiv:1810.04805
[12] Learning a Deep Convolutional Network for Image Super-Resolution
Dong, Chao
Loy, Chen Change
He, Kaiming
Tang, Xiaoou
[J]. COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 : 184 - 199
[13] Nonlocally Centralized Sparse Representation for Image Restoration
Dong, Weisheng
Zhang, Lei
Shi, Guangming
Li, Xin
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (04) : 1618 - 1628
[14] Dong Z., 2022, arXiv
[15] Dosovitskiy A., 2021, INT C LEARN REPRESEN
[16] Image denoising via sparse and redundant representations over learned dictionaries
Elad, Michael
Aharon, Michal
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (12) : 3736 - 3745
[17] Haris M, 2018, Arxiv, DOI [arXiv:1803.11316, DOI 10.48550/ARXIV.1803.11316]
[18] Deep Back-Projection Networks For Super-Resolution
Haris, Muhammad
Shakhnarovich, Greg
Ukita, Norimichi
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1664 - 1673
[19] Huang JB, 2015, PROC CVPR IEEE, P5197, DOI 10.1109/CVPR.2015.7299156
[20] Kim J, 2016, PROC CVPR IEEE, P1637, DOI [10.1109/CVPR.2016.181, 10.1109/CVPR.2016.182]

← 1 2 3 4 5 6 →