A lightweight 3D convolutional neural network for deepfake detection

被引：35

作者：

Liu, Jiarui ^{[1
]}

Zhu, Kaiman ^{[1
]}

Lu, Wei ^{[1
]}

Luo, Xiangyang ^{[2
]}

Zhao, Xianfeng ^{[3
,4
]}

机构：

[1] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangdong Prov Key Lab Informat Secur Technol, Minist Educ,Key Lab Machine Intelligence & Adv Co, Guangzhou, Peoples R China

[2] State Key Lab Math Engn & Adv Comp, Zhengzhou, Peoples R China

[3] Chinese Acad Sci, Inst Informat Engn, State Key Lab Informat Secur, Beijing, Peoples R China

[4] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China

来源：

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS | 2021年 / 36卷 / 09期

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

3D CNN; deepfake; deepfake detection; face swapping; face manipulation;

D O I：

10.1002/int.22499

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The rapid development of DeepFake technologies has brought great challenges to the authenticity of video contents. It is of vital importance to develop DeepFake detection methods, among which three-dimensional (3D) convolution neural networks (CNN) have attracted wide interest and achieved satisfying performances. However, there are few 3D CNNs designed for DeepFake detection and the parameters of them are large, which cause heavy memory and storage consumption. In this paper, a lightweight 3D CNN is proposed for DeepFake detection. Channel transformation module is designed to extract features with much fewer parameters in higher level. Serving as spatial-temporal module, 3D CNNs are adopted to fuse the spatial features in time dimension. To suppress frame content and highlight frame texture, spatial rich model features are extracted from the input frames, which helps the spatial-temporal module achieve better performance. Experimental results show that the number of parameters of the proposed network is much less than those of other networks and the proposed network outperforms other state-of-the-art DeepFake detection methods on mainstream DeepFake data sets.

引用

页码：4990 / 5004

页数：15

共 44 条

[1]

Afchar D, 2018, IEEE INT WORKS INFOR

[2] Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J].

Carreira, Joao ;

Zisserman, Andrew .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4724-4733

[3] FSSPOTTER: SPOTTING FACE-SWAPPED VIDEO BY SPATIAL AND TEMPORAL CLUES [J].

Chen, Peng ;

Liu, Jin ;

Liang, Tao ;

Zhou, Guangzhi ;

Gao, Hongchao ;

Dai, Jiao ;

Han, Jizhong .

2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,

[4]

Chen Z., 2021, INT J INTELL SYST, V36

[5] Privacy preservation for image data: A GAN-based method [J].

Chen, Zhenfei ;

Zhu, Tianqing ;

Xiong, Ping ;

Wang, Chenguang ;

Ren, Wei .

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (04) :1668-1685

[6]

Choi DH, 2020, IEEE IMAGE PROC, P823, DOI [10.1109/ICIP40778.2020.9190655, 10.1109/icip40778.2020.9190655]

[7]

CHOLLET F, 2017, PROC CVPR IEEE, P1800, DOI DOI 10.1109/CVPR.2017.195

[8] Recasting Residual-based Local Descriptors as Convolutional Neural Networks: an Application to Image Forgery Detection [J].

Cozzolino, Davide ;

Poggi, Giovanni ;

Verdoliva, Luisa .

IH&MMSEC'17: PROCEEDINGS OF THE 2017 ACM WORKSHOP ON INFORMATION HIDING AND MULTIMEDIA SECURITY, 2017, :159-164

[9]

Cozzolino Davide, 2018, FORENSICTRANSFER WEA

[10] On the Detection of Digital Face Manipulation [J].

Dang, Hao ;

Liu, Feng ;

Stehouwer, Joel ;

Liu, Xiaoming ;

Jain, Anil K. .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :5780-5789

← 1 2 3 4 5 →