SimSwap plus plus : Towards Faster and High-Quality Identity Swapping

被引：2

作者：

Chen, Xuanhong ^{[1
]}

Ni, Bingbing ^{[1
]}

Liu, Yutian ^{[1
]}

Liu, Naiyuan ^{[2
]}

Zeng, Zhilin ^{[1
]}

Wang, Hang ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Shanghai 200240, Peoples R China

[2] Univ Technol Sydney, Sydney, NSW 2007, Australia

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2024年 / 46卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Generative adversarial networks; image translation; face swapping; FACE;

D O I：

10.1109/TPAMI.2023.3307156

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Face identity editing (FIE) shows great value in AI content creation. Low-resolution FIE approaches have achieved tremendous progress, but high-quality FIE struggles. Two major challenges hinder higher-resolution and higher-performance development of FIE: lack of high-resolution dataset and unacceptable complexity forbidding for mobile platforms. To address both issues, we establish a novel large-scale, high-quality dataset tailored for FIE. Based on our SimSwap (Chen et al. 2020), we propose an upgraded version named SimSwap++ with significantly boosted model efficiency. SimSwap++ features two major innovations for high-performance model compression. First, a novel computational primitive named Conditional Dynamic Convolution (CD-Conv) is proposed to address the inefficiency of conditional schemes (e.g., AdaIN) in tiny models. CD-Conv achieves anisotropic processing and injection with significantly lower complexity compared to standard conditional operators, e.g., modulated convolution. Second, a Morphable Knowledge Distillation (MKD) is presented to further trim the overall model. Unlike conventional homogeneous teacher-student structures, MKD is designed to be heterogeneous and mutually compensable, endowing the student with the multi-path morphable property; thus, our student maximally inherits the teacher's knowledge after distillation while further reducing its complexity through structure re-parameterization. Extensive experiments demonstrate that our SimSwap++ achieves state-of-the-art performance (97.55% ID accuracy on FaceForensics++) with extremely low complexity (2.5 GFLOPs).

引用

页码：576 / 592

页数：17

共 68 条

[1]

Aguinaldo A, 2019, Arxiv, DOI arXiv:1902.00159

[2]

[Anonymous], 2018, Deepfakes

[3]

[Anonymous], 2019, Deepfacelab

[4]

Arjovsky M, 2017, Arxiv, DOI [arXiv:1701.07875, 10.48550/arXiv.1701.07875]

[5] Towards Open-Set Identity Preserving Face Synthesis [J].

Bao, Jianmin ;

Chen, Dong ;

Wen, Fang ;

Li, Houqiang ;

Hua, Gang .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6713-6722

[6] A morphable model for the synthesis of 3D faces [J].

Blanz, V ;

Vetter, T .

SIGGRAPH 99 CONFERENCE PROCEEDINGS, 1999, :187-194

[7] VGGFace2: A dataset for recognising faces across pose and age [J].

Cao, Qiong ;

Shen, Li ;

Xie, Weidi ;

Parkhi, Omkar M. ;

Zisserman, Andrew .

PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, :67-74

[8]

Chen HT, 2020, Arxiv, DOI arXiv:2003.03519

[9] SimSwap: An Efficient Framework For High Fidelity Face Swapping [J].

Chen, Renwang ;

Chen, Xuanhong ;

Ni, Bingbing ;

Ge, Yanhao .

MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, :2003-2011

[10]

Cortes C, 2012, J MACH LEARN RES, V13, P795

← 1 2 3 4 5 6 7 →