Deepfake Video Detection Based on Spatial, Spectral, and Temporal Inconsistencies Using Multimodal Deep Learning

被引:22
|
作者
Lewis, John K. [1 ]
Toubal, Imad Eddine [2 ]
Chen, Helen [3 ]
Sandesera, Vishal [4 ]
Lomnitz, Michael [4 ]
Hampel-Arias, Zigfried [4 ]
Prasad, Calyam [2 ]
Palaniappan, Kannappan [2 ]
机构
[1] Florida Southern Coll, Lakeland, FL 33801 USA
[2] Univ Missouri, Columbia, MO 65211 USA
[3] Univ Maryland, College Pk, MD USA
[4] IQT Labs, Arlington, VA USA
基金
美国国家科学基金会;
关键词
deepfake detection; deep learning; multi-modal; computer vision;
D O I
10.1109/AIPR50011.2020.9425167
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Authentication of digital media has become an everpressing necessity for modern society. Since the introduction of Generative Adversarial Networks (GANs), synthetic media has become increasingly difficult to identify. Synthetic videos that contain altered faces and/or voices of a person are known as deepfakes and threaten trust and privacy in digital media. Deepfakes can be weaponized for political advantage, slander, and to undermine the reputation of public figures. Despite imperfections of deepfakes, people struggle to distinguish between authentic and manipulated images and videos. Consequently, it is important to have automated systems that accurately and efficiently classify the validity of digital content. Many recent deepfake detection methods use single frames of video and focus on the spatial information in the image to infer the authenticity of the video. Some promising approaches exploit the temporal inconsistencies of manipulated videos; however, research primarily focuses on spatial features. We propose a hybrid deep learning approach that uses spatial, spectral, and temporal content that is coupled in a consistent way to differentiate real and fake videos. We show that the Discrete Cosine transform can improve deepfake detection by capturing spectral features of individual frames. In this work, we build a multimodal network that explores new features to detect deepfake videos, achieving 61.95% accuracy on the Facebook Deepfake Detection Challenge (DFDC) dataset.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] AN EFFICIENT DEEP VIDEO MODEL FOR DEEPFAKE DETECTION
    Sun, Ruipeng
    Zhao, Ziyuan
    Shen, Li
    Zeng, Zeng
    Li, Yuxin
    Veeravalli, Bharadwaj
    Yang Xulei
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 351 - 355
  • [22] Temporal and spatial feature based approaches in drowsiness detection using deep learning technique
    Pandey, Nageshwar Nath
    Muppalaneni, Naresh Babu
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2021, 18 (06) : 2287 - 2299
  • [23] Temporal and spatial feature based approaches in drowsiness detection using deep learning technique
    Nageshwar Nath Pandey
    Naresh Babu Muppalaneni
    Journal of Real-Time Image Processing, 2021, 18 : 2287 - 2299
  • [24] Deepfake Detection through Deep Learning
    Pan, Deng
    Sun, Lixian
    Wang, Rui
    Zhang, Xingjian
    Sinnott, Richard O.
    2020 IEEE/ACM INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING, APPLICATIONS AND TECHNOLOGIES (BDCAT 2020), 2020, : 134 - 143
  • [25] On the consensus of synchronous temporal and spatial views: A novel multimodal deep learning method for social video prediction
    Xiao, Shuaiyong
    Wang, Jianxiong
    Wang, Jiwei
    Chen, Runlin
    Chen, Gang
    INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (01)
  • [26] Multi-model DeepFake Detection Using Deep and Temporal Features
    John, Jerry
    Sherif, Bismin V.
    THIRD INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND CAPSULE NETWORKS (ICIPCN 2022), 2022, 514 : 672 - 684
  • [27] Spatiotemporal Inconsistency Learning for DeepFake Video Detection
    Gu, Zhihao
    Chen, Yang
    Yao, Taiping
    Ding, Shouhong
    Li, Jilin
    Huang, Feiyue
    Ma, Lizhuang
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3473 - 3481
  • [28] Video Transformer for Deepfake Detection with Incremental Learning
    Khan, Sohail Ahmed
    Dai, Hang
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1821 - 1828
  • [29] Real-Time Deepfake Video Detection Using Eye Movement Analysis with a Hybrid Deep Learning Approach
    Javed, Muhammad
    Zhang, Zhaohui
    Dahri, Fida Hussain
    Laghari, Asif Ali
    ELECTRONICS, 2024, 13 (15)
  • [30] An efficient cybersecurity framework for facial video forensics detection based on multimodal deep learning
    Sedik, Ahmed
    Faragallah, Osama S.
    El-sayed, Hala S.
    El-Banby, Ghada M.
    Abd El-Samie, Fathi E.
    Khalaf, Ashraf A. M.
    El-Shafai, Walid
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (02): : 1251 - 1268