SiamNet: Exploiting source camera noise discrepancies using Siamese Network for Deepfake Detection

被引:8
作者
Kingra, Staffy [1 ]
Aggarwal, Naveen [1 ]
Kaur, Nirmal [1 ]
机构
[1] Panjab Univ, UIET, Chandigarh, India
关键词
Video forensics; Deepfake detection; Facial manipulation detection; Camera noise; Face-patch;
D O I
10.1016/j.ins.2023.119341
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent advancements in deep neural networks especially GAN (Generative Adversarial Network) have resulted in the creation of more realistic deepfake media. This technology can swap the source person's face or alter facial expressions in an image or video; Media manipulated in such a way is termed deepfake. This type of manipulated media poses potential risks to journalism, politics, court proceedings and various social aspects. While existing approaches concentrate on the use of deep neural networks to directly extract facial artefacts for deepfake detection, they do not examine subtle inconsistencies in/across frame/frames. Moreover, state-of-the-art deepfake detection networks appear more complex and tend to overfit on specific artefacts which limits their generalizability on unseen data. This paper proposed a novel technique that tackles the problem of manipulated face detection in videos and images by exploiting the noise pattern inconsistency amongst face region and rest of the frame. To enable a comparison between the noise patterns of these two regions, we propose a two-stream Siamese-like network called SiamNet. This network can extract the noise patterns of the face region and patch through separate streams without increasing the number of parameters, thereby enhancing its efficiency and effectiveness. Each branch consists of pretrained Inception-v3 architecture for camera noise extraction. Siamese training is utilized to compare both noise patterns computed through different base models. The proposed two-branch network, SiamNet is found efficient for several large-scale deepfake datasets such as FF++, Celeb-DF, DFD and DFDC achieving accuracy rates of 99.7%, 98.3%, 96.08% and 89.2% respectively. Furthermore, the proposed technique exhibits greater generalizability and outperforms state-of-the-art of deepfake detection methods. Performance of the proposed model is also evaluated on FaceForensics benchmark dataset against different approaches.
引用
收藏
页数:18
相关论文
共 50 条
[1]  
Afchar D, 2018, IEEE INT WORKS INFOR
[2]   Generative Adversarial Ensemble Learning for Face Forensics [J].
Baek, Jae-Yong ;
Yoo, Yong-Sang ;
Bae, Seung-Hwan .
IEEE ACCESS, 2020, 8 :45421-45431
[3]  
Bayar B., 2016, SER IH MMSEC 16, P5, DOI 10.1145/2909827.2930786
[4]   Optical Flow based CNN for detection of unlearnt deepfake manipulations [J].
Caldelli, Roberto ;
Galteri, Leonardo ;
Amerini, Irene ;
Del Bimbo, Alberto .
PATTERN RECOGNITION LETTERS, 2021, 146 :31-37
[5]   Detecting deepfake videos based on spatiotemporal attention and convolutional LSTM [J].
Chen, Beijing ;
Li, Tianmu ;
Ding, Weiping .
INFORMATION SCIENCES, 2022, 601 :58-70
[6]   Watching the BiG artifacts: Exposing DeepFake videos via Bi-granularity artifacts [J].
Chen, Han ;
Li, Yuezun ;
Lin, Dongdong ;
Li, Bin ;
Wu, Junqiang .
PATTERN RECOGNITION, 2023, 135
[7]  
Dogonadze N, 2020, Arxiv, DOI arXiv:2004.11804
[8]  
Dolhansky B, 2020, Arxiv, DOI arXiv:2006.07397
[9]  
Dufour Nick, 2019, Google AI Blog
[10]  
Durall M., 2020, P IEEECVF C COMPUTER, P7890