GAN-Control: Explicitly Controllable GANs

被引:77
作者
Shoshan, Alon [1 ]
Bhonker, Nadav [1 ]
Kviatkovsky, Igor [1 ]
Medioni, Gerard [1 ]
机构
[1] Amazon, Sunnyvale, CA 94089 USA
来源
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年
关键词
D O I
10.1109/ICCV48922.2021.01382
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a framework for training GANs with explicit control over generated facial images. We are able to control the generated image by settings exact attributes such as age, pose, expression, etc. Most approaches for manipulating GAN-generated images achieve partial control by leveraging the latent space disentanglement properties, obtained implicitly after standard GAN training. Such methods are able to change the relative intensity of certain attributes, but not explicitly set their values. Recently proposed methods, designed for explicit control over human faces, harness morphable 3D face models (3DMM) to allow fine-grained control capabilities in GANs. Unlike these methods, our control is not constrained to 3DMM parameters and is extendable beyond the domain of human faces. Using contrastive learning, we obtain GANs with an explicitly disentangled latent space. This disentanglement is utilized to train control-encoders mapping human-interpretable inputs to suitable latent vectors, thus allowing explicit control. In the domain of human faces we demonstrate control over identity, age, pose, expression, hair color and illumination. We also demonstrate control capabilities of our framework in the domains of painted portraits and dog image generation. We demonstrate that our approach achieves state-of-the-art performance both qualitatively and quantitatively.
引用
收藏
页码:14063 / 14073
页数:11
相关论文
共 53 条
[1]   Image2StyleGAN: How to Embed Images Into the StyleGAN Latent Space? [J].
Abdal, Rameen ;
Qin, Yipeng ;
Wonka, Peter .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :4431-4440
[2]  
Abdal Rameen, 2020, Styleflow: Attribute-conditioned exploration of stylegangenerated images using conditional continuous normalizing flows
[3]  
Aittala Miika, 2019, CoRR
[4]  
[Anonymous], 2018, CoRR
[5]  
[Anonymous], 2015, A neural algorithm of artistic style, DOI DOI 10.1167/16.12.326
[6]  
[Anonymous], 2019, IEEE ICCV, DOI DOI 10.1007/978-3-030-37228-6_1
[7]  
[Anonymous], 2017, ICML
[8]  
[Anonymous], 2014, 27THINT C NEURAL INF
[9]  
Balakrishnan Guha, 2020, CAUSAL BENCHMARKING
[10]   Towards Open-Set Identity Preserving Face Synthesis [J].
Bao, Jianmin ;
Chen, Dong ;
Wen, Fang ;
Li, Houqiang ;
Hua, Gang .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6713-6722