Mask guided diverse face image synthesis

被引:3
作者
Sun, Song [1 ]
Zhao, Bo [2 ]
Mateen, Muhammad [3 ]
Chen, Xin [1 ]
Wen, Junhao [1 ]
机构
[1] Chongqing Univ, Sch Big Data & Software Engn, Chongqing 401331, Peoples R China
[2] Univ British Columbia, Dept Comp Sci, Vancouve, BC V6T 1Z4, Canada
[3] Air Univ, Dept Comp Sci, Multan Campus, Multan 60000, Pakistan
关键词
face image generation; image translation; generative adversarial networks; GENERATIVE ADVERSARIAL NETWORKS;
D O I
10.1007/s11704-020-0400-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent studies have shown remarkable success in face image generation task. However, existing approaches have limited diversity, quality and controllability in generating results. To address these issues, we propose a novel end-to-end learning framework to generate diverse, realistic and controllable face images guided by face masks. The face mask provides a good geometric constraint for a face by specifying the size and location of different components of the face, such as eyes, nose and mouse. The framework consists of four components: style encoder, style decoder, generator and discriminator. The style encoder generates a style code which represents the style of the result face; the generator translate the input face mask into a real face based on the style code; the style decoder learns to reconstruct the style code from the generated face image; and the discriminator classifies an input face image as real or fake. With the style code, the proposed model can generate different face images matching the input face mask, and by manipulating the face mask, we can finely control the generated face image. We empirically demonstrate the effectiveness of our approach on mask guided face image synthesis task.
引用
收藏
页数:9
相关论文
共 40 条
[11]  
Karras Tero, 2017, CoRR
[12]  
Kim T, 2017, PR MACH LEARN RES, V70
[13]  
Kingma D. P., 2015, ACS SYM SER
[14]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[15]   MaskGAN: Towards Diverse and Interactive Facial Image Manipulation [J].
Lee, Cheng-Han ;
Liu, Ziwei ;
Wu, Lingyun ;
Luo, Ping .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :5548-5557
[16]  
Liu Xihui., 2019, Advances in Neural Information Processing Systems, P568
[17]   Deep Learning Face Attributes in the Wild [J].
Liu, Ziwei ;
Luo, Ping ;
Wang, Xiaogang ;
Tang, Xiaoou .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :3730-3738
[18]   Mode Seeking Generative Adversarial Networks for Diverse Image Synthesis [J].
Mao, Qi ;
Lee, Hsin-Ying ;
Tseng, Hung-Yu ;
Ma, Siwei ;
Yang, Ming-Hsuan .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1429-1437
[19]  
Mescheder L, 2018, PR MACH LEARN RES, V80
[20]  
Paszke A., 2017, P NIPS WORKSH AUT