Novel GAN Inversion Model with Latent Space Constraints for Face Reconstruction

被引：0

作者：

Yang, Jinglong ^{[1
]}

Chen, Xiongwen ^{[1
]}

Zhang, Han ^{[1
]}

机构：

[1] Nankai Univ, Coll Artificial Intelligence, Tianjin, Peoples R China

来源：

NEURAL INFORMATION PROCESSING, ICONIP 2021, PT III | 2021年 / 13110卷

基金：

中国国家自然科学基金;

关键词：

StyleGAN; GAN inversion; Encoder; Face reconstruction;

D O I：

10.1007/978-3-030-92238-2_51

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper considers how to encode a target face image into its StyleGAN latent space accurately and efficiently, with applications to allow the various image editing method being used on the real images. Compared with optimization-based methods using gradient descent on latent code iteratively, the learning-based method we adopt can encode target images with one forward propagation, which is better suited for real-world application. The key advances in this paper are: adopting the face recognition model as a constraint to keep the identity information intact and adding a classifier to encourage latent code to retain more attributes possessed in the original image. Experiments show our method can achieve an excellent reconstruction effect. The ablation study indicates the proposed design advances the GAN Inversion task qualitatively and quantitatively. However, the method may fail when there are other objects around the target face and generate a blurry patch around that object.

引用

页码：620 / 631

页数：12

共 22 条

[1] StyleFlow: Attribute-conditioned Exploration of StyleGAN-Generated Images using Conditional Continuous Normalizing Flows [J].

Abdal, Rameen ;

Zhu, Peihao ;

Mitra, Niloy J. ;

Wonka, Peter .

ACM TRANSACTIONS ON GRAPHICS, 2021, 40 (03)

[2] Image2StyleGAN: How to Embed Images Into the StyleGAN Latent Space? [J].

Abdal, Rameen ;

Qin, Yipeng ;

Wonka, Peter .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :4431-4440

[3]

Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401

[4]

Chen X, 2016, Arxiv, DOI [arXiv:1606.03657, DOI 10.5555/3157096.3157340, DOI 10.48550/ARXIV.1606.03657]

[5] ArcFace: Additive Angular Margin Loss for Deep Face Recognition [J].

Deng, Jiankang ;

Guo, Jia ;

Xue, Niannan ;

Zafeiriou, Stefanos .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4685-4694

[6]

Goodfellow I.J., 2014, arXiv, DOI 10.48550/ARXIV.1406.2661

[7]

Heusel Martin, 2018, NeurIPS

[8] Perceptual Losses for Real-Time Style Transfer and Super-Resolution [J].

Johnson, Justin ;

Alahi, Alexandre ;

Li Fei-Fei .

COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 :694-711

[9] A Style-Based Generator Architecture for Generative Adversarial Networks [J].

Karras, Tero ;

Laine, Samuli ;

Aila, Timo .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4396-4405

[10] ImageNet Classification with Deep Convolutional Neural Networks [J].

Krizhevsky, Alex ;

Sutskever, Ilya ;

Hinton, Geoffrey E. .

COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90

← 1 2 3 →