Identity-Aware Variational Autoencoder for Face Swapping

被引:0
|
作者
Li, Zonglin [1 ]
Zhang, Zhaoxin [1 ]
He, Shengfeng [2 ]
Meng, Quanling [1 ]
Zhang, Shengping [1 ]
Zhong, Bineng [3 ,4 ]
Ji, Rongrong [5 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Weihai 264209, Peoples R China
[2] Singapore Management Univ, Sch Comp & Informat Syst, Singapore 188065, Singapore
[3] Guangxi Normal Univ, Sch Comp Sci & Engn, Guilin 541004, Peoples R China
[4] Guangxi Normal Univ, Sch Software, Guilin 541004, Peoples R China
[5] Xiamen Univ, Sch Informat, Xiamen 361005, Peoples R China
关键词
Faces; Face recognition; Training; Three-dimensional displays; Decoding; Task analysis; Shape; Face swapping; variational autoencoder; weak-supervised training; UNIFIED FRAMEWORK; IMAGE; MODEL;
D O I
10.1109/TCSVT.2024.3349909
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Face swapping aims to transfer the identity of a source face to a target face image while preserving the target attributes (e.g., facial expression, head pose, illumination, and background). Most existing methods use a face recognition model to extract global features from the source face and directly fuse them with the target to generate a swapping result. However, identity-irrelevant attributes (e.g., hairstyle and facial appearances) contribute a lot to the recognition task, and thus swapping this task-specific feature inevitably interfuses source attributes with target ones. In this paper, we propose an identity-aware variational autoencoder (ID-VAE) based face swapping framework, dubbed VAFSwap, which learns disentangled identity and attribute representations for high-fidelity face swapping. In particular, we overcome the unpaired training barrier of VAE and impose a proxy identity on the latent space by exploiting the weak supervision from an auxiliary image set whose identity is averaged from multiple collected face images. To explicitly guide the identity fusion, we further devise an identity-associated matrix that corresponds different face regions with their identity representations to perform identity-related feature interactions. Finally, we incorporate spatial dimensions into the latent space and exploit the generative priors of a pre-trained face generator, allowing the effective elimination of noticeable swapping artifacts. Extensive experiments on the FaceForensics++ and CelebA-HQ datasets demonstrate that our method outperforms the state-of-the-art significantly.
引用
收藏
页码:5466 / 5479
页数:14
相关论文
共 50 条
  • [21] Review-aware heterogeneous variational autoencoder recommendation model
    Liu S.
    Zhang J.
    Chen X.
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2022, 62 (01): : 88 - 97
  • [22] VARIATIONAL AUTOENCODER FOR SPEECH ENHANCEMENT WITH A NOISE-AWARE ENCODER
    Fang, Huajian
    Carbajal, Guillaume
    Wermter, Stefan
    Gerkmann, Timo
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 676 - 680
  • [23] Depth-Aware Object Tracking With a Conditional Variational Autoencoder
    Huang, Wenhui
    Gu, Jason
    Guo, Yinchen
    IEEE ACCESS, 2021, 9 : 94537 - 94547
  • [24] Hiding Face Into Background: A Proactive Countermeasure Against Malicious Face Swapping
    Shen, Xiaofeng
    Yao, Heng
    Tan, Shunquan
    Qin, Chuan
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (08) : 10613 - 10623
  • [25] Disentangled Variational Autoencoder for Emotion Recognition in Conversations
    Yang, Kailai
    Zhang, Tianlin
    Ananiadou, Sophia
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (02) : 508 - 518
  • [26] SelfSwapper: Self-supervised Face Swapping via Shape Agnostic Masked AutoEncoder
    Lee, Jaeseong
    Hyung, Junha
    Jung, Sohyun
    Choo, Jaegul
    COMPUTER VISION - ECCV 2024, PT LV, 2025, 15113 : 383 - 400
  • [27] Joint Identity-Aware Mixstyle and Graph-Enhanced Prototype for Clothes-Changing Person Re-Identification
    Zhao, Zhiwei
    Liu, Bin
    Lu, Yan
    Chu, Qi
    Yu, Nenghai
    Chen, Chang Wen
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3457 - 3468
  • [28] High-Fidelity Monocular Face Reconstruction Based on an Unsupervised Model-Based Face Autoencoder
    Tewari, Ayush
    Zollhofer, Michael
    Bernard, Florian
    Garrido, Pablo
    Kim, Hyeongwoo
    Perez, Patrick
    Theobalt, Christian
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (02) : 357 - 370
  • [29] eVAE: Evolutionary Variational Autoencoder
    Wu, Zhangkai
    Cao, Longbing
    Qi, Lei
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (02) : 3288 - 3299
  • [30] A Variational Autoencoder Mixture Model for Online Behavior Recommendation
    Nguyen, Minh-Duc
    Cho, Yoon-Sik
    IEEE ACCESS, 2020, 8 (08) : 132736 - 132747