SimSwap plus plus : Towards Faster and High-Quality Identity Swapping

被引:2
作者
Chen, Xuanhong [1 ]
Ni, Bingbing [1 ]
Liu, Yutian [1 ]
Liu, Naiyuan [2 ]
Zeng, Zhilin [1 ]
Wang, Hang [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai 200240, Peoples R China
[2] Univ Technol Sydney, Sydney, NSW 2007, Australia
基金
中国国家自然科学基金;
关键词
Generative adversarial networks; image translation; face swapping; FACE;
D O I
10.1109/TPAMI.2023.3307156
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Face identity editing (FIE) shows great value in AI content creation. Low-resolution FIE approaches have achieved tremendous progress, but high-quality FIE struggles. Two major challenges hinder higher-resolution and higher-performance development of FIE: lack of high-resolution dataset and unacceptable complexity forbidding for mobile platforms. To address both issues, we establish a novel large-scale, high-quality dataset tailored for FIE. Based on our SimSwap (Chen et al. 2020), we propose an upgraded version named SimSwap++ with significantly boosted model efficiency. SimSwap++ features two major innovations for high-performance model compression. First, a novel computational primitive named Conditional Dynamic Convolution (CD-Conv) is proposed to address the inefficiency of conditional schemes (e.g., AdaIN) in tiny models. CD-Conv achieves anisotropic processing and injection with significantly lower complexity compared to standard conditional operators, e.g., modulated convolution. Second, a Morphable Knowledge Distillation (MKD) is presented to further trim the overall model. Unlike conventional homogeneous teacher-student structures, MKD is designed to be heterogeneous and mutually compensable, endowing the student with the multi-path morphable property; thus, our student maximally inherits the teacher's knowledge after distillation while further reducing its complexity through structure re-parameterization. Extensive experiments demonstrate that our SimSwap++ achieves state-of-the-art performance (97.55% ID accuracy on FaceForensics++) with extremely low complexity (2.5 GFLOPs).
引用
收藏
页码:576 / 592
页数:17
相关论文
共 68 条
[1]  
Aguinaldo A, 2019, Arxiv, DOI arXiv:1902.00159
[2]  
[Anonymous], 2018, Deepfakes
[3]  
[Anonymous], 2019, Deepfacelab
[4]  
Arjovsky M, 2017, Arxiv, DOI [arXiv:1701.07875, 10.48550/arXiv.1701.07875]
[5]   Towards Open-Set Identity Preserving Face Synthesis [J].
Bao, Jianmin ;
Chen, Dong ;
Wen, Fang ;
Li, Houqiang ;
Hua, Gang .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6713-6722
[6]   A morphable model for the synthesis of 3D faces [J].
Blanz, V ;
Vetter, T .
SIGGRAPH 99 CONFERENCE PROCEEDINGS, 1999, :187-194
[7]   VGGFace2: A dataset for recognising faces across pose and age [J].
Cao, Qiong ;
Shen, Li ;
Xie, Weidi ;
Parkhi, Omkar M. ;
Zisserman, Andrew .
PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, :67-74
[8]  
Chen HT, 2020, Arxiv, DOI arXiv:2003.03519
[9]   SimSwap: An Efficient Framework For High Fidelity Face Swapping [J].
Chen, Renwang ;
Chen, Xuanhong ;
Ni, Bingbing ;
Ge, Yanhao .
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, :2003-2011
[10]  
Cortes C, 2012, J MACH LEARN RES, V13, P795