Video Transformer for Deepfake Detection with Incremental Learning

被引:41
|
作者
Khan, Sohail Ahmed [1 ]
Dai, Hang [1 ]
机构
[1] Mohamed bin Zayed Univ Artificial Intelligence, Abu Dhabi, U Arab Emirates
关键词
Deepfakes detection; face forensics; transformer; video analysis;
D O I
10.1145/3474085.3475332
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Face forgery by deepfake is widely spread over the internet and this raises severe societal concerns. In this paper, we propose a novel video transformer with incremental learning for detecting deepfake videos. To better align the input face images, we use a 3D face reconstruction method to generate UV texture from a single input face image. The aligned face image can also provide pose, eyes blink and mouth movement information that cannot be perceived in the UV texture image, so we use both face images and their UV texture maps to extract the image features. We present an incremental learning strategy to fine-tune the proposed model on a smaller amount of data and achieve better deepfake detection performance. The comprehensive experiments on various public deepfake datasets demonstrate that the proposed video transformer model with incremental learning achieves state-of-the-art performance in the deepfake video detection task with enhanced feature learning from the sequenced
引用
收藏
页码:1821 / 1828
页数:8
相关论文
共 50 条
  • [1] Deepfake Video Detection with Spatiotemporal Dropout Transformer
    Zhang, Daichi
    Lin, Fanzhao
    Hua, Yingying
    Wang, Pengju
    Zeng, Dan
    Ge, Shiming
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5833 - 5841
  • [2] Cascaded Network Based on EfficientNet and Transformer for Deepfake Video Detection
    Deng, Liwei
    Wang, Jiandong
    Liu, Zhen
    NEURAL PROCESSING LETTERS, 2023, 55 (06) : 7057 - 7076
  • [3] Cascaded Network Based on EfficientNet and Transformer for Deepfake Video Detection
    Liwei Deng
    Jiandong Wang
    Zhen Liu
    Neural Processing Letters, 2023, 55 : 7057 - 7076
  • [4] Improved Deepfake Video Detection Using Convolutional Vision Transformer
    Deressa, Deressa Wodajo
    Lambert, Peter
    Van Wallendael, Glenn
    Atnafu, Solomon
    Mareen, Hannes
    2024 IEEE GAMING, ENTERTAINMENT, AND MEDIA CONFERENCE, GEM 2024, 2024, : 492 - 497
  • [5] MSVT: Multiple Spatiotemporal Views Transformer for DeepFake Video Detection
    Yu, Yang
    Ni, Rongrong
    Zhao, Yao
    Yang, Siyuan
    Xia, Fen
    Jiang, Ning
    Zhao, Guoqing
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 4462 - 4471
  • [6] Spatiotemporal Inconsistency Learning for DeepFake Video Detection
    Gu, Zhihao
    Chen, Yang
    Yao, Taiping
    Ding, Shouhong
    Li, Jilin
    Huang, Feiyue
    Ma, Lizhuang
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3473 - 3481
  • [7] ISTVT: Interpretable Spatial-Temporal Video Transformer for Deepfake Detection
    Zhao, Cairong
    Wang, Chutian
    Hu, Guosheng
    Chen, Haonan
    Liu, Chun
    Tang, Jinhui
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 1335 - 1348
  • [8] Sharp Multiple Instance Learning for DeepFake Video Detection
    Li, Xiaodan
    Lang, Yining
    Chen, Yuefeng
    Mao, Xiaofeng
    He, Yuan
    Wang, Shuhui
    Xue, Hui
    Lu, Quan
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1864 - 1872
  • [9] On the Generalization of Deep Learning Models in Video Deepfake Detection
    Coccomini, Davide Alessandro
    Caldelli, Roberto
    Falchi, Fabrizio
    Gennaro, Claudio
    JOURNAL OF IMAGING, 2023, 9 (05)
  • [10] Deepfake Video Detection via Predictive Representation Learning
    Ge, Shiming
    Lin, Fanzhao
    Li, Chenyu
    Zhang, Daichi
    Wang, Weiping
    Zeng, Dan
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (02)