Fake visual content detection using two-stream convolutional neural networks

被引：0

作者：

Bilal Yousaf

Muhammad Usama

Waqas Sultani

Arif Mahmood

Junaid Qadir

机构：

[1] Information Technology University (ITU),Department of Computer Science

[2] Lahore University of Management Sciences (LUMS),Department of Computer Science and Engineering (CSE), College of Engineering

[3] Qatar University,Department of Electrical Engineering

[4] Information Technology University (ITU),undefined

来源：

Neural Computing and Applications | 2022年 / 34卷

关键词：

Deepfakes; Two-stream network; Frequency stream; Combination of discrete Fourier transform and discrete wavelet;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Rapid progress in adversarial learning has enabled the generation of realistic-looking fake visual content. To distinguish between fake and real visual content, several detection techniques have been proposed. The performance of most of these techniques however drops off significantly if the test and the training data are sampled from different distributions. This motivates efforts towards improving the generalization of fake detectors. Since current fake content generation techniques do not accurately model the frequency spectrum of the natural images, we observe that the frequency spectrum of the fake visual data contains discriminative characteristics that can be used to detect fake content. We also observe that the information captured in the frequency spectrum is different from that of the spatial domain. Using these insights, we propose to complement frequency and spatial domain features using a two-stream convolutional neural network architecture called TwoStreamNet. We demonstrate the improved generalization of the proposed two-stream network to several unseen generation architectures, datasets, and techniques. The proposed detector has demonstrated significant performance improvement compared to the current state-of-the-art fake content detectors with the fusing of frequency and spatial domain streams also improving the generalization of the detector.

引用

页码：7991 / 8004

页数：13

共 40 条

[1]

Chesney B(2019)Deep fakes: a looming challenge for privacy, democracy, and national security Calif L Rev 107 1753-191

[2]

Citron D(2017)Synthesizing Obama: learning lip sync from audio ACM Trans Graph (TOG) 36 95-221

[3]

Suwajanakorn S(2015)Real-time expression transfer for facial reenactment ACM Trans Graph 34 183-767

[4]

Seitz SM(2014)Exposing region splicing forgeries with blind local noise estimation Int J Comput Vis 110 202-1252

[5]

Kemelmacher-Shlizerman I(2005)Exposing digital forgeries by detecting traces of resampling IEEE Trans Signal Process 53 758-1944

[6]

Thies J(2017)Image forgery localization via integrating tampering possibility maps IEEE Trans Inf Forensics Secur 12 1240-392

[7]

Zollhöfer M(2018)Fake colorized image detection IEEE Trans Inf Forensics Secur 13 1932-17

[8]

Nießner M(2018)Image forensics based on planar contact constraints of 3d objects IEEE Trans Inf Forensics Secur 13 377-2787

[9]

Valgaerts L(2019)Recolored image detection via a deep discriminative model IEEE Trans Inf Forensics Secur 14 5-135

[10]

Stamminger M(2018)Distinguishing between natural and computer-generated images using convolutional neural networks IEEE Trans Inf Forensics Secur 13 2772-undefined

← 1 2 3 4 →