Deep generative image priors for semantic face manipulation

被引：10

作者：

Hou, Xianxu ^{[1
,2
,3
,4
]}

Shen, Linlin ^{[1
,2
,3
]}

Ming, Zhong ^{[2
]}

Qiu, Guoping ^{[5
,6
]}

机构：

[1] Shenzhen Univ, Natl Engn Lab Big Data Syst Comp Technol, Shenzhen, Peoples R China

[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen, Peoples R China

[3] Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen, Peoples R China

[4] Xian Jiaotong Liverpool Univ, Sch AI & Adv Comp, Suzhou, Peoples R China

[5] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen, Peoples R China

[6] Univ Nottingham, Sch Comp Sci, Nottingham, England

来源：

PATTERN RECOGNITION | 2023年 / 139卷

基金：

中国国家自然科学基金;

关键词：

GANs; Face attribute prediction; Semantic face manipulation; AGE; GENDER;

D O I：

10.1016/j.patcog.2023.109477

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Previous works on generative adversarial networks (GANs) mainly focus on how to synthesize highfidelity images. In this paper, we present a framework to leverage the knowledge learned by GANs for semantic face manipulation. In particular, we propose to control the semantics of synthesized faces by adapting the latent codes with an attribute prediction model. Moreover, in order to achieve a more accurate estimation of different facial attributes, we propose to pretrain the attribute prediction model by inverting the synthesized face images back to the GAN latent space. As a result, our method explicitly considers the semantics encoded in the latent space of a pretrained GAN and is able to faithfully edit various attributes like eyeglasses, smiling, bald, age, mustache and gender for high-resolution face images. Extensive experiments show that our method has superior performance compared to state of the art for both face attribute prediction and semantic face manipulation. (c) 2023 Elsevier Ltd. All rights reserved.

引用

页数：13

共 66 条

[1] StyleFlow: Attribute-conditioned Exploration of StyleGAN-Generated Images using Conditional Continuous Normalizing Flows
Abdal, Rameen
Zhu, Peihao
Mitra, Niloy J.
Wonka, Peter
[J]. ACM TRANSACTIONS ON GRAPHICS, 2021, 40 (03):
[2] Image2StyleGAN: How to Embed Images Into the StyleGAN Latent Space?
Abdal, Rameen
Qin, Yipeng
Wonka, Peter
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4431 - 4440
[3] Multi-Task CNN Model for Attribute Prediction
Abdulnabi, Abrar H.
Wang, Gang
Lu, Jiwen
Jia, Kui
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (11) : 1949 - 1959
[4] Efficient smile detection by Extreme Learning Machine
An, Le
Yang, Songfan
Bhanu, Bir
[J]. NEUROCOMPUTING, 2015, 149 : 354 - 363
[5] Towards Open-Set Identity Preserving Face Synthesis
Bao, Jianmin
Chen, Dong
Wen, Fang
Li, Houqiang
Hua, Gang
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6713 - 6722
[6] Bi nkowski M., 2018, INT C LEARNING REPRE
[7] A full data augmentation pipeline for small object detection based on generative adversarial networks
Bosquet, Brais
Cores, Daniel
Seidenari, Lorenzo
Brea, Victor M.
Mucientes, Manuel
Del Bimbo, Alberto
[J]. PATTERN RECOGNITION, 2023, 133
[8] Semantic Component Decomposition for Face Attribute Manipulation
Chen, Ying-Cong
Shen, Xiaohui
Lin, Zhe
Lu, Xin
Pao, I-Ming
Jia, Jiaya
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9851 - 9859
[9] StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation
Choi, Yunjey
Choi, Minje
Kim, Munyoung
Ha, Jung-Woo
Kim, Sunghun
Choo, Jaegul
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8789 - 8797
[10] Inverting the Generator of a Generative Adversarial Network
Creswell, Antonia
Bharath, Anil Anthony
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (07) : 1967 - 1974

← 1 2 3 4 5 6 7 →