Multi-granularity Transformer for Image Super-Resolution

被引:0
作者
Zhuge, Yunzhi [1 ]
Jia, Xu [2 ]
机构
[1] Univ Adelaide, Adelaide, SA, Australia
[2] Dalian Univ Technol, Sch Artificial Intelligence, Dalian, Peoples R China
来源
COMPUTER VISION - ACCV 2022, PT III | 2023年 / 13843卷
关键词
SPARSE;
D O I
10.1007/978-3-031-26313-2_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, transformers have made great success in computer vision. Thus far, most of those works focus on high-level tasks, e.g., image classification and object detection, and fewer attempts were made to solve low-level problems. In this work, we tackle image super-resolution. Specifically, transformer architectures with multi-granularity transformer groups are explored for complementary information interaction, to improve the accuracy of super-resolution. We exploit three transformer patterns, i.e., the window transformers, dilated transformers and global transformers. We further investigate the combination of them and propose a Multi-granularity Transformer (MugFormer). Specifically, the window transformer layer is aggregated with other transformer layers to compose three transformer groups, namely, Local Transformer Group, Dilated Transformer Group and Global Transformer Group, which efficiently aggregate both local and global information for accurate reconstruction. Extensive experiments on five benchmark datasets demonstrate that our MugFormer performs favorably against state-of-the-art methods in terms of both quantitative and qualitative results.
引用
收藏
页码:138 / 154
页数:17
相关论文
共 53 条
  • [11] Devlin J, 2019, Arxiv, DOI arXiv:1810.04805
  • [12] Learning a Deep Convolutional Network for Image Super-Resolution
    Dong, Chao
    Loy, Chen Change
    He, Kaiming
    Tang, Xiaoou
    [J]. COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 : 184 - 199
  • [13] Nonlocally Centralized Sparse Representation for Image Restoration
    Dong, Weisheng
    Zhang, Lei
    Shi, Guangming
    Li, Xin
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (04) : 1618 - 1628
  • [14] Dong Z., 2022, arXiv
  • [15] Dosovitskiy A., 2021, INT C LEARN REPRESEN
  • [16] Image denoising via sparse and redundant representations over learned dictionaries
    Elad, Michael
    Aharon, Michal
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (12) : 3736 - 3745
  • [17] Haris M, 2018, Arxiv, DOI [arXiv:1803.11316, DOI 10.48550/ARXIV.1803.11316]
  • [18] Deep Back-Projection Networks For Super-Resolution
    Haris, Muhammad
    Shakhnarovich, Greg
    Ukita, Norimichi
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1664 - 1673
  • [19] Huang JB, 2015, PROC CVPR IEEE, P5197, DOI 10.1109/CVPR.2015.7299156
  • [20] Kim J, 2016, PROC CVPR IEEE, P1637, DOI [10.1109/CVPR.2016.181, 10.1109/CVPR.2016.182]