Locate and Verify: A Two-Stream Network for Improved Deepfake Detection

被引:25
作者
Shuai, Chao [1 ,2 ]
Zhong, Jieming [1 ]
Wu, Shuang [3 ]
Lin, Feng [1 ]
Wang, Zhibo [1 ]
Ba, Zhongjie [1 ]
Liu, Zhenguang [1 ]
Cavallaro, Lorenzo [1 ,4 ]
Ren, Kui [1 ]
机构
[1] Zhejiang Univ, Hangzhou, Zhejiang, Peoples R China
[2] ZJU Hangzhou Global Sci & Technol Innovat Ctr, Hangzhou, Zhejiang, Peoples R China
[3] Black Sesame Technol, Singapore, Singapore
[4] UCL, London, England
来源
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023 | 2023年
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Deepfake detection; two-stream network; semi-supervised learning;
D O I
10.1145/3581783.3612386
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deepfake has taken the world by storm, triggering a trust crisis. Current deepfake detection methods are typically inadequate in generalizability, with a tendency to overfit to image contents such as the background, which are frequently occurring but relatively unimportant in the training dataset. Furthermore, current methods heavily rely on a few dominant forgery regions and may ignore other equally important regions, leading to inadequate uncovering of forgery cues. In this paper, we strive to address these shortcomings from three aspects: (1) We propose an innovative two-stream network that effectively enlarges the potential regions from which the model extracts forgery evidence. (2) We devise three functional modules to handle the multi-stream and multi-scale features in a collaborative learning scheme. (3) Confronted with the challenge of obtaining forgery annotations, we propose a Semi-supervised Patch Similarity Learning strategy to estimate patch-level forged location annotations. Empirically, our method demonstrates significantly improved robustness and generalizability, outperforming previous methods on six benchmarks, and improving the frame-level AUC on Deepfake Detection Challenge preview dataset from 0.797 to 0.835 and video-level AUC on CelebDF_v1 dataset from 0.811 to 0.847. Our implementation is available at https://github.com/sccsok/Locateand-Verify.
引用
收藏
页码:7131 / 7142
页数:12
相关论文
共 60 条
[1]  
[Anonymous], 2016, FaceSwap
[2]  
[Anonymous], 2020, Deepfakes
[3]  
[Anonymous], 2020, YouTube
[4]  
[Anonymous], 2022, COMPUTER VISION EC 5, DOI DOI 10.1007/978-3-031-20065-623
[5]   SimSwap: An Efficient Framework For High Fidelity Face Swapping [J].
Chen, Renwang ;
Chen, Xuanhong ;
Ni, Bingbing ;
Ge, Yanhao .
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, :2003-2011
[6]  
Chen S, 2021, AAAI CONF ARTIF INTE, V35, P1081
[7]   Xception: Deep Learning with Depthwise Separable Convolutions [J].
Chollet, Francois .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807
[8]   ID-Reveal: Identity-aware DeepFake Video Detection [J].
Cozzolino, Davide ;
Roessler, Andreas ;
Thies, Justus ;
Niessner, Matthias ;
Verdoliva, Luisa .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :15088-15097
[9]   On the Detection of Digital Face Manipulation [J].
Dang, Hao ;
Liu, Feng ;
Stehouwer, Joel ;
Liu, Xiaoming ;
Jain, Anil K. .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :5780-5789
[10]   Towards Solving the DeepFake Problem : An Analysis on Improving DeepFake Detection using Dynamic Face Augmentation [J].
Das, Sowmen ;
Seferbekov, Selim ;
Datta, Arup ;
Islam, Md Saiful ;
Amin, Md Ruhul .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, :3769-3778