Identity-Aware Variational Autoencoder for Face Swapping

被引:0
|
作者
Li, Zonglin [1 ]
Zhang, Zhaoxin [1 ]
He, Shengfeng [2 ]
Meng, Quanling [1 ]
Zhang, Shengping [1 ]
Zhong, Bineng [3 ,4 ]
Ji, Rongrong [5 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Weihai 264209, Peoples R China
[2] Singapore Management Univ, Sch Comp & Informat Syst, Singapore 188065, Singapore
[3] Guangxi Normal Univ, Sch Comp Sci & Engn, Guilin 541004, Peoples R China
[4] Guangxi Normal Univ, Sch Software, Guilin 541004, Peoples R China
[5] Xiamen Univ, Sch Informat, Xiamen 361005, Peoples R China
关键词
Faces; Face recognition; Training; Three-dimensional displays; Decoding; Task analysis; Shape; Face swapping; variational autoencoder; weak-supervised training; UNIFIED FRAMEWORK; IMAGE; MODEL;
D O I
10.1109/TCSVT.2024.3349909
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Face swapping aims to transfer the identity of a source face to a target face image while preserving the target attributes (e.g., facial expression, head pose, illumination, and background). Most existing methods use a face recognition model to extract global features from the source face and directly fuse them with the target to generate a swapping result. However, identity-irrelevant attributes (e.g., hairstyle and facial appearances) contribute a lot to the recognition task, and thus swapping this task-specific feature inevitably interfuses source attributes with target ones. In this paper, we propose an identity-aware variational autoencoder (ID-VAE) based face swapping framework, dubbed VAFSwap, which learns disentangled identity and attribute representations for high-fidelity face swapping. In particular, we overcome the unpaired training barrier of VAE and impose a proxy identity on the latent space by exploiting the weak supervision from an auxiliary image set whose identity is averaged from multiple collected face images. To explicitly guide the identity fusion, we further devise an identity-associated matrix that corresponds different face regions with their identity representations to perform identity-related feature interactions. Finally, we incorporate spatial dimensions into the latent space and exploit the generative priors of a pre-trained face generator, allowing the effective elimination of noticeable swapping artifacts. Extensive experiments on the FaceForensics++ and CelebA-HQ datasets demonstrate that our method outperforms the state-of-the-art significantly.
引用
收藏
页码:5466 / 5479
页数:14
相关论文
共 50 条
  • [41] 3PFS: Protecting Pedestrian Privacy Through Face Swapping
    Zhao, Zixian
    Zhang, Xingchen
    Demiris, Yiannis
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 16845 - 16854
  • [42] Emotion-Regularized Conditional Variational Autoencoder for Emotional Response Generation
    Ruan, Yu-Ping
    Ling, Zhen-Hua
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (01) : 842 - 848
  • [43] Generating Accurate and Diverse Audio Captions Through Variational Autoencoder Framework
    Zhang, Yiming
    Du, Ruoyi
    Tan, Zheng-Hua
    Wang, Wenwu
    Ma, Zhanyu
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 2520 - 2524
  • [44] Building Face Recognition System with Triplet-based Stacked Variational Denoising Autoencoder
    Le, Xuan Tuan
    SOICT 2019: PROCEEDINGS OF THE TENTH INTERNATIONAL SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY, 2019, : 106 - 110
  • [45] Design of Metamaterials for Absorbers Based on Variational Autoencoder
    Li, Qi
    Wang, Jianwei
    Lei, Tao
    Xiang, Tianyu
    Qin, Chanchan
    Yang, Maoze
    IEEE ACCESS, 2024, 12 : 92328 - 92336
  • [46] Face Swapping via Reverse Contrastive Learning and Explicit Identity-Attribute Disentanglement
    Wang, Tao
    Zheng, Chunhou
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XII, ICIC 2024, 2024, 14873 : 216 - 227
  • [47] Non-Parallel Whisper-to-Normal Speaking Style Conversion Using Auxiliary Classifier Variational Autoencoder
    Seki, Shogo
    Kameoka, Hirokazu
    Kaneko, Takuhiro
    Tanaka, Kou
    IEEE ACCESS, 2023, 11 : 44590 - 44599
  • [48] Discriminative Mixture Variational Autoencoder for Semisupervised Classification
    Chen, Jian
    Du, Lan
    Liao, Leiyao
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (05) : 3032 - 3046
  • [49] Plausible 3D Face Wrinkle Generation Using Variational Autoencoders
    Deng, Qixin
    Ma, Luming
    Jin, Aobo
    Bi, Huikun
    Le, Binh Huy
    Deng, Zhigang
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2022, 28 (09) : 3113 - 3125
  • [50] Laughter synthesis: A comparison between Variational autoencoder and Autoencoder
    Mansouri, Nadia
    Lachiri, Zied
    2020 5TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP'2020), 2020,