Fighting Fake News: Two Stream Network for Deepfake Detection via Learnable SRM

被引：42

作者：

Han B. ^{[1
]}

Han X. ^{[2
]}

Zhang H. ^{[1
]}

Li J. ^{[1
]}

Cao X. ^{[1
]}

机构：

[1] State Key Laboratory of Information Security, Institute of Information Engineering, Chinese Academy of Sciences, Beijing

[2] Shenzhen Research Institute of Big Data, The Chinese University of Hong Kong at Shenzhen, Shenzhen

来源：

IEEE Transactions on Biometrics, Behavior, and Identity Science | 2021年 / 3卷 / 03期

基金：

中国国家自然科学基金;

关键词：

deep learning; Deepfake; fake news; Multimedia forensics; SRM;

D O I：

10.1109/TBIOM.2021.3065735

中图分类号：

学科分类号：

摘要：

Benefitting from the development of deep generative networks, modern fake news generation methods called Deepfake rapidly go viral over the Internet, calling for efficient detection methods. Existing Deepfake detection methods basically use binary classification networks trained on frame-level inputs and lack leveraging temporal information in videos. Besides, the accuracy of these methods will rapidly decrease when processing low-quality data. In this work, we propose a two-stream network to detect Deepfake in video level with the capability of handling low-quality data. The proposed architecture firstly divides the input video into segments and then feeds selected frames of each segment into two streams: The first stream takes RGB information as input and tries to learn the semantic inconsistency. The second stream parallelly leverages noise features extracted by spatial rich model (SRM) filters. Additionally, our experiments found that traditional SRM filters with fixed weights contribute insignificant improvement, we thus design novel learnable SRM filters, which can better fit the noise inconsistency in tampered regions. Segmental fusion and stream fusion are conducted at last to combine the information from segments and streams. We evaluate our algorithm on the existing largest Deepfake dataset FaceForensics++ and the experimental results show that we obtain state-of-the-art performance. © 2019 IEEE.

引用

页码：320 / 331

页数：11

共 57 条

[1]

Fridrich A.J., Soukal B.D., Lukas A.J., Detection of copy-move forgery in digital images, Proc. Digit. Forensic Res. Workshop, pp. 19-23, (2003)

[2]

Li W., Yu N., Rotation robust detection of copy-move forgery, Proc. Ieee Int. Conf. Image Process, pp. 2113-2116, (2010)

[3]

Korus P., Huang J., Multi-scale fusion for improved localization of malicious tampering in digital images, Ieee Trans. Image Process., 25, pp. 1312-1326, (2016)

[4]

Deepfakes, (2020)

[5]

Krizhevsky A., Sutskever I., Hinton G.E., ImageNet classification with deep convolutional neural networks, Proc. Int. Conf. Neural Inf. Process. Syst., pp. 1106-1114, (2012)

[6]

Zhou P., Han X., Morariu V.I., Davis L.S., Learning rich features for image manipulation detection, Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit, pp. 1053-1061, (2018)

[7]

Zhou P., Han X., Morariu V.I., Davis L.S., Two-stream neural networks for tampered face detection, Proc. Ieee Conf. Comput. Vis. Pattern Recognit. Workshops (CVPR), pp. 1831-1839, (2017)

[8]

Wang L., Et al., Temporal segment networks: Towards good practices for deep action recognition, Proc. Eur. Conf. Comput. Vis., pp. 20-36, (2016)

[9]

Dale K., Sunkavalli K., Johnson M.K., Vlasic D., Matusik W., Pfister H., Video face replacement, Acm Trans. Graph., 30, 6, (2011)

[10]

Garrido P., Valgaerts L., Rehmsen O., Thormahlen T., Perez P., Theobalt C., Automatic face reenactment, Proc. Ieee Conf. Comput. Vis. Pattern Recognit, pp. 4217-4224, (2014)

← 1 2 3 4 5 6 →