Interactive Two-Stream Network Across Modalities for Deepfake Detection

被引:8
|
作者
Wu, Jianghao [1 ]
Zhang, Baopeng [1 ]
Li, Zhaoyang [1 ]
Pang, Guilin [1 ]
Teng, Zhu [1 ]
Fan, Jianping [2 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing 100044, Peoples R China
[2] Lenovo Res, AI Lab, Beijing 100085, Peoples R China
基金
中国国家自然科学基金;
关键词
Deepfake detection; inconsistency representation; cross-modality learning;
D O I
10.1109/TCSVT.2023.3269841
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As face forgery techniques have become more mature, the proliferation of deepfakes may threaten the security of human society. Although existing deepfake detection methods achieve good performance for in-dataset evaluation, it remains to be improved in the generalization ability, where the representation of the imperceptible artifacts plays a significant role. In this paper, we propose an Interactive Two-Stream Network (ITSNet) to explore the discriminant inconsistency representation from the perspective of cross-modality. In particular, the patch-wise Decomposable Discrete Cosine Transform (DDCT) is adopted to extract fine-grained high-frequency clues, and information from different modalities communicates with each other via a designed interaction module. To perceive the temporal inconsistency, we first develop a Short-term Embedding Module (SEM) to refine subtle local inconsistency representation between adjacent frames, and then a Long-term Embedding Module (LEM) is designed to further refine the erratic temporal inconsistency representation from the long-range perspective. Extensive experimental results conducted on three public datasets show that ITSNet outperforms the state-of-the-art methods both in terms of in-dataset and cross-dataset evaluations.
引用
收藏
页码:6418 / 6430
页数:13
相关论文
共 50 条
  • [1] Deepfake Detection using a Two-Stream Capsule Network
    Joseph, Zane
    Nyirenda, Clement
    2021 IST-AFRICA CONFERENCE (IST-AFRICA), 2021,
  • [2] Hierarchical supervisions with two-stream network for Deepfake detection
    Liang, Yufei
    Wang, Mengmeng
    Jin, Yining
    Pan, Shuwen
    Liu, Yong
    PATTERN RECOGNITION LETTERS, 2023, 172 : 121 - 127
  • [3] Locate and Verify: A Two-Stream Network for Improved Deepfake Detection
    Shuai, Chao
    Zhong, Jieming
    Wu, Shuang
    Lin, Feng
    Wang, Zhibo
    Ba, Zhongjie
    Liu, Zhenguang
    Cavallaro, Lorenzo
    Ren, Kui
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 7131 - 7142
  • [4] Frequency Domain Deepfake Detection Based on Two-Stream Neural Network
    Xu Yijia
    Dong, Zhang Dong
    Sun Chengyu
    FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022, 2022, 12705
  • [5] Saliency detection network with two-stream encoder and interactive decoder
    Yang, Aiping
    Cheng, Simeng
    Song, Shangyang
    Wang, Jinbin
    Ji, Zhong
    Pang, Yanwei
    Cao, Jiale
    NEUROCOMPUTING, 2022, 509 : 56 - 67
  • [6] Two-Stream Xception Structure Based on Feature Fusion for DeepFake Detection
    Bin Wang
    Liqing Huang
    Tianqiang Huang
    Feng Ye
    International Journal of Computational Intelligence Systems, 16
  • [7] Two-Stream Xception Structure Based on Feature Fusion for DeepFake Detection
    Wang, Bin
    Huang, Liqing
    Huang, Tianqiang
    Ye, Feng
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2023, 16 (01)
  • [8] BiFAT: Bilateral Filtering and Attention Mechanisms in a Two-Stream Model for Deepfake Detection
    Zhang, Lei
    Yi, Ceyuan
    Liu, Liang
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT II, 2024, 15017 : 231 - 247
  • [9] Potential Attacks of DeepFake on eKYC Systems and Remedy for eKYC with DeepFake Detection Using Two-Stream Network of Facial Appearance and Motion Features
    Do T.-L.
    Tran M.-K.
    Nguyen H.H.
    Tran M.-T.
    SN Computer Science, 3 (6)
  • [10] Interactive Two-Stream Decoder for Accurate and Fast Saliency Detection
    Zhou, Huajun
    Xie, Xiaohua
    Lai, Jian-Huang
    Chen, Zixuan
    Yang, Lingxiao
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 9138 - 9147