Locate and Verify: A Two-Stream Network for Improved Deepfake Detection

被引:25
作者
Shuai, Chao [1 ,2 ]
Zhong, Jieming [1 ]
Wu, Shuang [3 ]
Lin, Feng [1 ]
Wang, Zhibo [1 ]
Ba, Zhongjie [1 ]
Liu, Zhenguang [1 ]
Cavallaro, Lorenzo [1 ,4 ]
Ren, Kui [1 ]
机构
[1] Zhejiang Univ, Hangzhou, Zhejiang, Peoples R China
[2] ZJU Hangzhou Global Sci & Technol Innovat Ctr, Hangzhou, Zhejiang, Peoples R China
[3] Black Sesame Technol, Singapore, Singapore
[4] UCL, London, England
来源
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023 | 2023年
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Deepfake detection; two-stream network; semi-supervised learning;
D O I
10.1145/3581783.3612386
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deepfake has taken the world by storm, triggering a trust crisis. Current deepfake detection methods are typically inadequate in generalizability, with a tendency to overfit to image contents such as the background, which are frequently occurring but relatively unimportant in the training dataset. Furthermore, current methods heavily rely on a few dominant forgery regions and may ignore other equally important regions, leading to inadequate uncovering of forgery cues. In this paper, we strive to address these shortcomings from three aspects: (1) We propose an innovative two-stream network that effectively enlarges the potential regions from which the model extracts forgery evidence. (2) We devise three functional modules to handle the multi-stream and multi-scale features in a collaborative learning scheme. (3) Confronted with the challenge of obtaining forgery annotations, we propose a Semi-supervised Patch Similarity Learning strategy to estimate patch-level forged location annotations. Empirically, our method demonstrates significantly improved robustness and generalizability, outperforming previous methods on six benchmarks, and improving the frame-level AUC on Deepfake Detection Challenge preview dataset from 0.797 to 0.835 and video-level AUC on CelebDF_v1 dataset from 0.811 to 0.847. Our implementation is available at https://github.com/sccsok/Locateand-Verify.
引用
收藏
页码:7131 / 7142
页数:12
相关论文
共 60 条
[21]  
Hjelm R.D., 2018, Learning deep representations by mutual information estimation and maximization
[22]   Implicit Identity Driven Deepfake Face Swapping Detection [J].
Huang, Baojin ;
Wang, Zhongyuan ;
Yang, Jifan ;
Ai, Jiaxin ;
Zou, Qin ;
Wang, Qian ;
Ye, Dengpan .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, :4490-4499
[23]   DeeperForensics-1.0: A Large-Scale Dataset for Real-World Face Forgery Detection [J].
Jiang, Liming ;
Li, Ren ;
Wu, Wayne ;
Qian, Chen ;
Loy, Chen Change .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :2886-2895
[24]  
Kim J.-H., 2016, arXiv, DOI 10.48550/arXiv.1610.04325
[25]  
Kingsbury D, 2015, P1, DOI [10.48550/arXiv.1312.6114, 10.48550/arXiv.1412.6980, DOI 10.1145/1830483.1830503]
[26]   Wavelet-enhanced Weakly Supervised Local Feature Learning for Face Forgery Detection [J].
Li, Jiaming ;
Xie, Hongtao ;
Yu, Lingyun ;
Zhang, Yongdong .
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, :1299-1308
[27]   Frequency-aware Discriminative Feature Learning Supervised by Single-Center Loss for Face Forgery Detection [J].
Li, Jiaming ;
Xie, Hongtao ;
Li, Jiahong ;
Wang, Zhongyuan ;
Zhang, Yongdong .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :6454-6463
[28]   Face X-ray for More General Face Forgery Detection [J].
Li, Lingzhi ;
Bao, Jianmin ;
Zhang, Ting ;
Yang, Hao ;
Chen, Dong ;
Wen, Fang ;
Guo, Baining .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :5000-5009
[29]  
[李艳歌 Li Yange], 2018, [高分子通报, Polymer Bulletin], P46
[30]   Numerical Study of Separation Characteristics of Piston-Type Explosive Bolt [J].
Li, Yanhua ;
Wang, Jingcheng ;
Xiong, Shihui ;
Cheng, Li ;
Wen, Yuquan ;
Li, Zhiliang .
SHOCK AND VIBRATION, 2019, 2019