Locate and Verify: A Two-Stream Network for Improved Deepfake Detection

被引：25

作者：

Shuai, Chao ^{[1
,2
]}

Zhong, Jieming ^{[1
]}

Wu, Shuang ^{[3
]}

Lin, Feng ^{[1
]}

Wang, Zhibo ^{[1
]}

Ba, Zhongjie ^{[1
]}

Liu, Zhenguang ^{[1
]}

Cavallaro, Lorenzo ^{[1
,4
]}

Ren, Kui ^{[1
]}

机构：

[1] Zhejiang Univ, Hangzhou, Zhejiang, Peoples R China

[2] ZJU Hangzhou Global Sci & Technol Innovat Ctr, Hangzhou, Zhejiang, Peoples R China

[3] Black Sesame Technol, Singapore, Singapore

[4] UCL, London, England

来源：

PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023 | 2023年

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Deepfake detection; two-stream network; semi-supervised learning;

D O I：

10.1145/3581783.3612386

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deepfake has taken the world by storm, triggering a trust crisis. Current deepfake detection methods are typically inadequate in generalizability, with a tendency to overfit to image contents such as the background, which are frequently occurring but relatively unimportant in the training dataset. Furthermore, current methods heavily rely on a few dominant forgery regions and may ignore other equally important regions, leading to inadequate uncovering of forgery cues. In this paper, we strive to address these shortcomings from three aspects: (1) We propose an innovative two-stream network that effectively enlarges the potential regions from which the model extracts forgery evidence. (2) We devise three functional modules to handle the multi-stream and multi-scale features in a collaborative learning scheme. (3) Confronted with the challenge of obtaining forgery annotations, we propose a Semi-supervised Patch Similarity Learning strategy to estimate patch-level forged location annotations. Empirically, our method demonstrates significantly improved robustness and generalizability, outperforming previous methods on six benchmarks, and improving the frame-level AUC on Deepfake Detection Challenge preview dataset from 0.797 to 0.835 and video-level AUC on CelebDF_v1 dataset from 0.811 to 0.847. Our implementation is available at https://github.com/sccsok/Locateand-Verify.

引用

页码：7131 / 7142

页数：12

共 60 条

[1]

[Anonymous], 2016, FaceSwap

[2]

[Anonymous], 2020, Deepfakes

[3]

[Anonymous], 2020, YouTube

[4]

[Anonymous], 2022, COMPUTER VISION EC 5, DOI DOI 10.1007/978-3-031-20065-623

[5] SimSwap: An Efficient Framework For High Fidelity Face Swapping [J].

Chen, Renwang ;

Chen, Xuanhong ;

Ni, Bingbing ;

Ge, Yanhao .

MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, :2003-2011

[6]

Chen S, 2021, AAAI CONF ARTIF INTE, V35, P1081

[7] Xception: Deep Learning with Depthwise Separable Convolutions [J].

Chollet, Francois .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807

[8] ID-Reveal: Identity-aware DeepFake Video Detection [J].

Cozzolino, Davide ;

Roessler, Andreas ;

Thies, Justus ;

Niessner, Matthias ;

Verdoliva, Luisa .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :15088-15097

[9] On the Detection of Digital Face Manipulation [J].

Dang, Hao ;

Liu, Feng ;

Stehouwer, Joel ;

Liu, Xiaoming ;

Jain, Anil K. .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :5780-5789

[10] Towards Solving the DeepFake Problem : An Analysis on Improving DeepFake Detection using Dynamic Face Augmentation [J].

Das, Sowmen ;

Seferbekov, Selim ;

Datta, Arup ;

Islam, Md Saiful ;

Amin, Md Ruhul .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, :3769-3778

← 1 2 3 4 5 6 →