Generalizable face forgery detection based on adaptive spatial-frequency information mining

被引:0
作者
Qi, Yongfeng [1 ]
Xie, Hongli [1 ,2 ]
Gao, Yajuan [1 ]
Lin, Yuanzhe [1 ]
Zhang, Heng [1 ]
Han, Haixi [1 ]
机构
[1] Northwest Normal Univ, Coll Comp Sci & Engn, Lanzhou 730070, Gansu, Peoples R China
[2] Northwest Normal Univ, Key Lab Cryptog & Data Analyt, Lanzhou 730070, Gansu, Peoples R China
基金
中国国家自然科学基金;
关键词
Deepfake detection; Face forgery detection; Frequency-aware learning; Spatial texture enhancement; DCT;
D O I
10.1007/s00530-025-01893-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the current field of face forgery detection, researchers are focused on recognizing forgery cues through the combination of frequency information and convolutional neural networks (CNN). However, existing methods often fail to capture spatial correlations with image content when extracting frequency features, making it difficult to accurately recognize highly simulated forged images. In addition, these methods perform well on homogeneous datasets, but their effectiveness decreases significantly when evaluated on cross-dataset samples. To address these issues, we propose a novel adaptive spatial-frequency information mining (ASFIM) method for generalizable face forgery detection. Specifically, the ASFIM method first processes the original RGB image through a frequency-aware learning module. This module extracts forgery frequency information closely related to the image content, which is subsequently used as input for frequency branching. Next, a spatial texture enhancement module is introduced to enable interaction between spatial and frequency features at an early stage. This approach not only strengthens the expressiveness of forgery features in the spatial domain but also provides an effective guide for recognizing forgery cues in the frequency domain. Finally, we designed the cross-domain interactive attention (CDIA) module to enhance forgery cues by deeply fusing spatial texture and frequency-aware features. Extensive experimental results demonstrate that the proposed ASFIM method outperforms various advanced methods in terms of generalization ability across challenging benchmark tests.
引用
收藏
页数:16
相关论文
共 60 条
[31]   Spatial-Phase Shallow Learning: Rethinking Face Forgery Detection in Frequency Domain [J].
Liu, Honggu ;
Li, Xiaodan ;
Zhou, Wenbo ;
Chen, Yuefeng ;
He, Yuan ;
Xue, Hui ;
Zhang, Weiming ;
Yu, Nenghai .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :772-781
[32]   Detection of Deepfake Videos Using Long-Distance Attention [J].
Lu, Wei ;
Liu, Lingyi ;
Zhang, Bolin ;
Luo, Junwei ;
Zhao, Xianfeng ;
Zhou, Yicong ;
Huang, Jiwu .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) :9366-9379
[33]   Generalizing Face Forgery Detection with High-frequency Features [J].
Luo, Yuchen ;
Zhang, Yong ;
Yan, Junchi ;
Liu, Wei .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :16312-16321
[34]   Two-Branch Recurrent Network for Isolating Deepfakes in Videos [J].
Masi, Iacopo ;
Killekar, Aditya ;
Mascarenhas, Royston Marian ;
Gurudatt, Shenoy Pratik ;
AbdAlmageed, Wael .
COMPUTER VISION - ECCV 2020, PT VII, 2020, 12352 :667-684
[35]   Hierarchical Frequency-Assisted Interactive Networks for Face Manipulation Detection [J].
Miao, Changtao ;
Tan, Zichang ;
Chu, Qi ;
Yu, Nenghai ;
Guo, Guodong .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2022, 17 :3008-3021
[36]   DeepFake Detection Based on Discrepancies Between Faces and Their Context [J].
Nirkin, Yuval ;
Wolf, Lior ;
Keller, Yosi ;
Hassner, Tal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) :6111-6121
[37]   DeepRhythm: Exposing DeepFakes with Attentional Visual Heartbeat Rhythms [J].
Qi, Hua ;
Guo, Qing ;
Juefei-Xu, Felix ;
Xie, Xiaofei ;
Ma, Lei ;
Feng, Wei ;
Liu, Yang ;
Zhao, Jianjun .
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, :4318-4327
[38]   Thinking in Frequency: Face Forgery Detection by Mining Frequency-Aware Clues [J].
Qian, Yuyang ;
Yin, Guojun ;
Sheng, Lu ;
Chen, Zixuan ;
Shao, Jing .
COMPUTER VISION - ECCV 2020, PT XII, 2020, 12357 :86-103
[39]   FaceForensics plus plus : Learning to Detect Manipulated Facial Images [J].
Roessler, Andreas ;
Cozzolino, Davide ;
Verdoliva, Luisa ;
Riess, Christian ;
Thies, Justus ;
Niessner, Matthias .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :1-11
[40]   ImageNet Large Scale Visual Recognition Challenge [J].
Russakovsky, Olga ;
Deng, Jia ;
Su, Hao ;
Krause, Jonathan ;
Satheesh, Sanjeev ;
Ma, Sean ;
Huang, Zhiheng ;
Karpathy, Andrej ;
Khosla, Aditya ;
Bernstein, Michael ;
Berg, Alexander C. ;
Fei-Fei, Li .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 115 (03) :211-252