GLEAN: Generative Latent Bank for Image Super-Resolution and Beyond

被引:12
作者
Chan, Kelvin C. K. [1 ]
Xu, Xiangyu [1 ]
Wang, Xintao [2 ]
Gu, Jinwei [3 ,4 ]
Loy, Chen Change [1 ]
机构
[1] Nanyang Technol Univ NTU, S Lab, Singapore 639798, Singapore
[2] Tencent PCG, Appl Res Ctr, Shenzhen 518054, Guangdong, Peoples R China
[3] Tetras AI, San Francisco, CA 94105 USA
[4] Shanghai AI Lab, Shanghai 200237, Peoples R China
关键词
Image restoration; Generative adversarial networks; Task analysis; Superresolution; Generators; Faces; Optimization; Super-resolution; colorization; restoration; generative adversarial networks; generative prior;
D O I
10.1109/TPAMI.2022.3186715
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We show that pre-trained Generative Adversarial Networks (GANs) such as StyleGAN and BigGAN can be used as a latent bank to improve the performance of image super-resolution. While most existing perceptual-oriented approaches attempt to generate realistic outputs through learning with adversarial loss, our method, Generative LatEnt bANk (GLEAN), goes beyond existing practices by directly leveraging rich and diverse priors encapsulated in a pre-trained GAN. But unlike prevalent GAN inversion methods that require expensive image-specific optimization at runtime, our approach only needs a single forward pass for restoration. GLEAN can be easily incorporated in a simple encoder-bank-decoder architecture with multi-resolution skip connections. Employing priors from different generative models allows GLEAN to be applied to diverse categories (e.g., human faces, cats, buildings, and cars). We further present a lightweight version of GLEAN, named LightGLEAN, which retains only the critical components in GLEAN. Notably, LightGLEAN consists of only 21% of parameters and 35% of FLOPs while achieving comparable image quality. We extend our method to different tasks including image colorization and blind image restoration, and extensive experiments show that our proposed models perform favorably in comparison to existing methods. Codes and models are available at https://github.com/open-mmlab/mmediting.
引用
收藏
页码:3154 / 3168
页数:15
相关论文
共 72 条
  • [1] [Anonymous], 2015, PROC ICLR
  • [2] Antic J, 2018, DeOldify
  • [3] Semantic Photo Manipulation with a Generative Image Prior
    Bau, David
    Strobelt, Hendrik
    Peebles, William
    Wulff, Jonas
    Zhou, Bolei
    Zhu, Jun-Yan
    Torralba, Antonio
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2019, 38 (04):
  • [4] Brock A., 2019, PROC INT CONFLEARN R
  • [5] GLEAN: Generative Latent Bank for Large-Factor Image Super-Resolution
    Chan, Kelvin C. K.
    Wang, Xintao
    Xu, Xiangyu
    Gu, Jinwei
    Loy, Chen Change
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14240 - 14249
  • [6] Progressive Semantic-Aware Style Transformation for Blind Face Restoration
    Chen, Chaofeng
    Li, Xiaoming
    Yang, Lingbo
    Lin, Xianhui
    Zhang, Lei
    Wong, Kwan-Yee K.
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11891 - 11900
  • [7] Deep Colorization
    Cheng, Zezhou
    Yang, Qingxiong
    Sheng, Bin
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 415 - 423
  • [8] Pixel Recursive Super Resolution
    Dahl, Ryan
    Norouzi, Mohammad
    Shlens, Jonathon
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5449 - 5458
  • [9] Second-order Attention Network for Single Image Super-Resolution
    Dai, Tao
    Cai, Jianrui
    Zhang, Yongbing
    Xia, Shu-Tao
    Zhang, Lei
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11057 - 11066
  • [10] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848