VR-FAM: VARIANCE-REDUCED ENCODER WITH NONLINEAR TRANSFORMATION FOR FACIAL ATTRIBUTE MANIPULATION

被引:1
作者
Yuan, Yifan [1 ]
Ma, Siteng [1 ]
Zhang, Junping [1 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai Key Lab Intelligent Informat Proc, Shanghai, Peoples R China
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年
基金
中国国家自然科学基金;
关键词
Facial attribute manipulation; disentangle learning; StyleGAN; GANs; style transfer;
D O I
10.1109/ICASSP43922.2022.9746046
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Facial attribute manipulation (FAM) aims to infer desired facial images by modifying specific attributes while keeping others unchanged. Existing works suffer from the entanglement of facial attributes, leading to unexpected artifacts and the loss of facial identity information after editing. To alleviate these issues, we propose a novel FAM framework based on Sty1eGAN, termed VR-FAM, which can meet the requirements of FAM-editing ability, distortion, and fidelity. First, we propose a variance-reduced encoder to make the latent space close to the one of Sty1eGAN. Second, we present a nonlinear latent transformation network, which can convert the source latent code to target latent code in line with the nonlinear latent space of StyleGAN. Experimentally, we evaluate the proposed FAM framework on the benchmark FFHQ dataset and demonstrate the improvement gain over the recently published models in terms of edit accuracy and fidelity.
引用
收藏
页码:1755 / 1759
页数:5
相关论文
共 21 条
[1]   Image2StyleGAN: How to Embed Images Into the StyleGAN Latent Space? [J].
Abdal, Rameen ;
Qin, Yipeng ;
Wonka, Peter .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :4431-4440
[2]   The Perception-Distortion Tradeoff [J].
Blau, Yochai ;
Michaeli, Tomer .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6228-6237
[3]  
Deng J., 2019 IEEECVF C COMPU, P4685
[4]  
Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
[5]   AttGAN: Facial Attribute Editing by Only Changing What You Want [J].
He, Zhenliang ;
Zuo, Wangmeng ;
Kan, Meina ;
Shan, Shiguang ;
Chen, Xilin .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (11) :5464-5478
[6]  
Hertzmann A., 2020, ADV NEURAL INFORM PR, V33
[7]  
Heusel M, 2017, ADV NEUR IN, V30
[8]   Analyzing and Improving the Image Quality of StyleGAN [J].
Karras, Tero ;
Laine, Samuli ;
Aittala, Miika ;
Hellsten, Janne ;
Lehtinen, Jaakko ;
Aila, Timo .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :8107-8116
[9]   A Style-Based Generator Architecture for Generative Adversarial Networks [J].
Karras, Tero ;
Laine, Samuli ;
Aila, Timo .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4396-4405
[10]  
Karras Tero, 2017, CoRR