GLEAN: Generative Latent Bank for Image Super-Resolution and Beyond

被引：12

作者：

Chan, Kelvin C. K. ^{[1
]}

Xu, Xiangyu ^{[1
]}

Wang, Xintao ^{[2
]}

Gu, Jinwei ^{[3
,4
]}

Loy, Chen Change ^{[1
]}

机构：

[1] Nanyang Technol Univ NTU, S Lab, Singapore 639798, Singapore

[2] Tencent PCG, Appl Res Ctr, Shenzhen 518054, Guangdong, Peoples R China

[3] Tetras AI, San Francisco, CA 94105 USA

[4] Shanghai AI Lab, Shanghai 200237, Peoples R China

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2023年 / 45卷 / 03期

关键词：

Image restoration; Generative adversarial networks; Task analysis; Superresolution; Generators; Faces; Optimization; Super-resolution; colorization; restoration; generative adversarial networks; generative prior;

D O I：

10.1109/TPAMI.2022.3186715

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We show that pre-trained Generative Adversarial Networks (GANs) such as StyleGAN and BigGAN can be used as a latent bank to improve the performance of image super-resolution. While most existing perceptual-oriented approaches attempt to generate realistic outputs through learning with adversarial loss, our method, Generative LatEnt bANk (GLEAN), goes beyond existing practices by directly leveraging rich and diverse priors encapsulated in a pre-trained GAN. But unlike prevalent GAN inversion methods that require expensive image-specific optimization at runtime, our approach only needs a single forward pass for restoration. GLEAN can be easily incorporated in a simple encoder-bank-decoder architecture with multi-resolution skip connections. Employing priors from different generative models allows GLEAN to be applied to diverse categories (e.g., human faces, cats, buildings, and cars). We further present a lightweight version of GLEAN, named LightGLEAN, which retains only the critical components in GLEAN. Notably, LightGLEAN consists of only 21% of parameters and 35% of FLOPs while achieving comparable image quality. We extend our method to different tasks including image colorization and blind image restoration, and extensive experiments show that our proposed models perform favorably in comparison to existing methods. Codes and models are available at https://github.com/open-mmlab/mmediting.

引用

页码：3154 / 3168

页数：15

共 72 条

[1] [Anonymous], 2015, PROC ICLR
[2] Antic J, 2018, DeOldify
[3] Semantic Photo Manipulation with a Generative Image Prior
Bau, David
Strobelt, Hendrik
Peebles, William
Wulff, Jonas
Zhou, Bolei
Zhu, Jun-Yan
Torralba, Antonio
[J]. ACM TRANSACTIONS ON GRAPHICS, 2019, 38 (04):
[4] Brock A., 2019, PROC INT CONFLEARN R
[5] GLEAN: Generative Latent Bank for Large-Factor Image Super-Resolution
Chan, Kelvin C. K.
Wang, Xintao
Xu, Xiangyu
Gu, Jinwei
Loy, Chen Change
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14240 - 14249
[6] Progressive Semantic-Aware Style Transformation for Blind Face Restoration
Chen, Chaofeng
Li, Xiaoming
Yang, Lingbo
Lin, Xianhui
Zhang, Lei
Wong, Kwan-Yee K.
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11891 - 11900
[7] Deep Colorization
Cheng, Zezhou
Yang, Qingxiong
Sheng, Bin
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 415 - 423
[8] Pixel Recursive Super Resolution
Dahl, Ryan
Norouzi, Mohammad
Shlens, Jonathon
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5449 - 5458
[9] Second-order Attention Network for Single Image Super-Resolution
Dai, Tao
Cai, Jianrui
Zhang, Yongbing
Xia, Shu-Tao
Zhang, Lei
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11057 - 11066
[10] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

← 1 2 3 4 5 6 7 8 →