Deep generative image priors for semantic face manipulation

被引:10
作者
Hou, Xianxu [1 ,2 ,3 ,4 ]
Shen, Linlin [1 ,2 ,3 ]
Ming, Zhong [2 ]
Qiu, Guoping [5 ,6 ]
机构
[1] Shenzhen Univ, Natl Engn Lab Big Data Syst Comp Technol, Shenzhen, Peoples R China
[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen, Peoples R China
[3] Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen, Peoples R China
[4] Xian Jiaotong Liverpool Univ, Sch AI & Adv Comp, Suzhou, Peoples R China
[5] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen, Peoples R China
[6] Univ Nottingham, Sch Comp Sci, Nottingham, England
基金
中国国家自然科学基金;
关键词
GANs; Face attribute prediction; Semantic face manipulation; AGE; GENDER;
D O I
10.1016/j.patcog.2023.109477
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous works on generative adversarial networks (GANs) mainly focus on how to synthesize highfidelity images. In this paper, we present a framework to leverage the knowledge learned by GANs for semantic face manipulation. In particular, we propose to control the semantics of synthesized faces by adapting the latent codes with an attribute prediction model. Moreover, in order to achieve a more accurate estimation of different facial attributes, we propose to pretrain the attribute prediction model by inverting the synthesized face images back to the GAN latent space. As a result, our method explicitly considers the semantics encoded in the latent space of a pretrained GAN and is able to faithfully edit various attributes like eyeglasses, smiling, bald, age, mustache and gender for high-resolution face images. Extensive experiments show that our method has superior performance compared to state of the art for both face attribute prediction and semantic face manipulation. (c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:13
相关论文
共 66 条
  • [1] StyleFlow: Attribute-conditioned Exploration of StyleGAN-Generated Images using Conditional Continuous Normalizing Flows
    Abdal, Rameen
    Zhu, Peihao
    Mitra, Niloy J.
    Wonka, Peter
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2021, 40 (03):
  • [2] Image2StyleGAN: How to Embed Images Into the StyleGAN Latent Space?
    Abdal, Rameen
    Qin, Yipeng
    Wonka, Peter
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4431 - 4440
  • [3] Multi-Task CNN Model for Attribute Prediction
    Abdulnabi, Abrar H.
    Wang, Gang
    Lu, Jiwen
    Jia, Kui
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (11) : 1949 - 1959
  • [4] Efficient smile detection by Extreme Learning Machine
    An, Le
    Yang, Songfan
    Bhanu, Bir
    [J]. NEUROCOMPUTING, 2015, 149 : 354 - 363
  • [5] Towards Open-Set Identity Preserving Face Synthesis
    Bao, Jianmin
    Chen, Dong
    Wen, Fang
    Li, Houqiang
    Hua, Gang
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6713 - 6722
  • [6] Bi nkowski M., 2018, INT C LEARNING REPRE
  • [7] A full data augmentation pipeline for small object detection based on generative adversarial networks
    Bosquet, Brais
    Cores, Daniel
    Seidenari, Lorenzo
    Brea, Victor M.
    Mucientes, Manuel
    Del Bimbo, Alberto
    [J]. PATTERN RECOGNITION, 2023, 133
  • [8] Semantic Component Decomposition for Face Attribute Manipulation
    Chen, Ying-Cong
    Shen, Xiaohui
    Lin, Zhe
    Lu, Xin
    Pao, I-Ming
    Jia, Jiaya
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9851 - 9859
  • [9] StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation
    Choi, Yunjey
    Choi, Minje
    Kim, Munyoung
    Ha, Jung-Woo
    Kim, Sunghun
    Choo, Jaegul
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8789 - 8797
  • [10] Inverting the Generator of a Generative Adversarial Network
    Creswell, Antonia
    Bharath, Anil Anthony
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (07) : 1967 - 1974