StyleFlow: Attribute-conditioned Exploration of StyleGAN-Generated Images using Conditional Continuous Normalizing Flows

被引：292

作者：

Abdal, Rameen ^{[1
]}

Zhu, Peihao ^{[1
]}

Mitra, Niloy J. ^{[2
,3
]}

Wonka, Peter ^{[1
]}

机构：

[1] KAUST, Thuwal, Saudi Arabia

[2] UCL, London, England

[3] Adobe Res, San Jose, CA USA

来源：

ACM TRANSACTIONS ON GRAPHICS | 2021年 / 40卷 / 03期

关键词：

Generative adversarial networks; image editing;

D O I：

10.1145/3447648

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

High-quality, diverse, and photorealistic images can now be generated by unconditional GANs (e.g., StyleGAN). However, limited options exist to control the generation process using (semantic) attributes while still preserving the quality of the output. Further, due to the entangled nature of the GAN latent space, performing edits along one attribute can easily result in unwanted changes along other attributes. In this article, in the context of conditional exploration of entangled latent spaces, we investigate the two sub-problems of attribute-conditioned sampling and attribute-controlled editing. We present StyleFlow as a simple, effective, and robust solution to both the sub-problems by formulating conditional exploration as an instance of conditional continuous normalizing flows in the GAN latent space conditioned by attribute features. We evaluate our method using the face and the car latent space of StyleGAN, and demonstrate fine-grained disentangled edits along various attributes on both real photographs and StyleGAN generated images. For example, for faces, we vary camera pose, illumination variation, expression, facial hair, gender, and age. Finally, via extensive qualitative and quantitative comparisons, we demonstrate the superiority of StyleFlow over prior and several concurrent works.

引用

页数：21

共 68 条

[1] Image2StyleGAN++: How to Edit the Embedded Images? [J].

Abdal, Rameen ;

Qin, Yipeng ;

Wonka, Peter .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :8293-8302

[2] Image2StyleGAN: How to Embed Images Into the StyleGAN Latent Space? [J].

Abdal, Rameen ;

Qin, Yipeng ;

Wonka, Peter .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :4431-4440

[3] Deep Video-Based Performance Cloning [J].

Aberman, K. ;

Shi, M. ;

Liao, J. ;

Liscbinski, D. ;

Chen, B. ;

Cohen-Or, D. .

COMPUTER GRAPHICS FORUM, 2019, 38 (02) :219-233

[4]

Aittala Miika, 2019, CoRR

[5]

[Anonymous], 2012, IMPA FACE3D

[6]

[Anonymous], 2019, ARXIV191201865

[7]

Brock Andrew., 2018, Large scale GAN training for high fidelity natural image synthesis, DOI DOI 10.48550/ARXIV.1809.11096

[8]

Cao K., 2019, ACM T GRAPHIC, V37, P1, DOI DOI 10.1145/3272127.3275046

[9]

Chen R.T.Q., 2018, Advances in neural information processing systems, VVolume 31, P6571, DOI DOI 10.48550/ARXIV.1806.07366

[10] StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation [J].

Choi, Yunjey ;

Choi, Minje ;

Kim, Munyoung ;

Ha, Jung-Woo ;

Kim, Sunghun ;

Choo, Jaegul .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8789-8797

← 1 2 3 4 5 6 7 →