Two-Branch Recurrent Network for Isolating Deepfakes in Videos

被引:302
作者
Masi, Iacopo [1 ]
Killekar, Aditya [1 ]
Mascarenhas, Royston Marian [1 ]
Gurudatt, Shenoy Pratik [1 ]
AbdAlmageed, Wael [1 ]
机构
[1] USC, Informat Sci Inst, Marina Del Rey, CA USA
来源
COMPUTER VISION - ECCV 2020, PT VII | 2020年 / 12352卷
关键词
Deepfake detection; Two-branch recurrent net; Loss function;
D O I
10.1007/978-3-030-58571-6_39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The current spike of hyper-realistic faces artificially generated using deepfakes calls for media forensics solutions that are tailored to video streams and work reliably with a low false alarm rate at the video level. We present a method for deepfake detection based on a two-branch network structure that isolates digitally manipulated faces by learning to amplify artifacts while suppressing the high-level face content. Unlike current methods that extract spatial frequencies as a preprocessing step, we propose a two-branch structure: one branch propagates the original information, while the other branch suppresses the face content yet amplifies multi-band frequencies using a Laplacian of Gaussian (LoG) as a bottleneck layer. To better isolate manipulated faces, we derive a novel cost function that, unlike regular classification, compresses the variability of natural faces and pushes away the unrealistic facial samples in the feature space. Our two novel components show promising results on the FaceForensics++, Celeb-DF, and Facebook's DFDC preview benchmarks, when compared to prior work. We then offer a full, detailed ablation study of our network architecture and cost function. Finally, although the bar is still high to get very remarkable figures at a very low false alarm rate, our study shows that we can achieve good video-level performance when cross-testing in terms of video-level AUC.
引用
收藏
页码:667 / 684
页数:18
相关论文
共 69 条
[1]  
Afchar D, 2018, IEEE INT WORKS INFOR
[2]  
Agarwal S., 2019, CVPR WORKSH JUN
[3]  
[Anonymous], 2019, CNN
[4]  
[Anonymous], 2007, Technical Report 07-49, DOI 10.1.1. 122.8268
[5]  
[Anonymous], 2016, P 4 ACM WORKSH 138IN
[6]  
apps.apple, ZAO app
[7]   How far are we from solving the 2D & 3D Face Alignment problem? (and a dataset of 230,000 3D facial landmarks) [J].
Bulat, Adrian ;
Tzimiropoulos, Georgios .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1021-1030
[8]   THE LAPLACIAN PYRAMID AS A COMPACT IMAGE CODE [J].
BURT, PJ ;
ADELSON, EH .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1983, 31 (04) :532-540
[9]   Xception: Deep Learning with Depthwise Separable Convolutions [J].
Chollet, Francois .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807
[10]  
Cozzolino D, 2019, Arxiv, DOI [arXiv:1812.02510, 10.48550/arXiv.1812.02510]