Unmasking Deception: Empowering Deepfake Detection with Vision Transformer Network

被引:7
作者
Arshed, Muhammad Asad [1 ,2 ]
Alwadain, Ayed [3 ]
Ali, Rao Faizan [2 ]
Mumtaz, Shahzad [4 ]
Ibrahim, Muhammad [1 ]
Muneer, Amgad [5 ,6 ]
机构
[1] Islamia Univ Bahawalpur, Dept Comp Sci, Bahawalpur 63100, Pakistan
[2] Univ Management & Technol, Sch Syst & Technol, Lahore 54770, Pakistan
[3] King Saud Univ, Community Coll, Comp Sci Dept, Riyadh 145111, Saudi Arabia
[4] Islamia Univ Bahawalpur, Dept Data Sci, Bahawalpur 63100, Pakistan
[5] Univ Texas MD Anderson Canc Ctr, Dept Imaging Phys, Houston, TX 77030 USA
[6] Univ Teknol Petronas, Dept Comp & Informat Sci, Seri Iskandar 32160, Malaysia
关键词
deepfake; identification; Vision Transformer; pretrained; fine tuning;
D O I
10.3390/math11173710
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
With the development of image-generating technologies, significant progress has been made in the field of facial manipulation techniques. These techniques allow people to easily modify media information, such as videos and images, by substituting the identity or facial expression of one person with the face of another. This has significantly increased the availability and accessibility of such tools and manipulated content termed 'deepfakes'. Developing an accurate method for detecting fake images needs time to prevent their misuse and manipulation. This paper examines the capabilities of the Vision Transformer (ViT), i.e., extracting global features to detect deepfake images effectively. After conducting comprehensive experiments, our method demonstrates a high level of effectiveness, achieving a detection accuracy, precision, recall, and F1 rate of 99.5 to 100% for both the original and mixture data set. According to our existing understanding, this study is a research endeavor incorporating real-world applications, specifically examining Snapchat-filtered images.
引用
收藏
页数:13
相关论文
共 35 条
  • [1] Akhtar Zahid, 2020, 2020 IEEE International Conference on Humanized Computing and Communication with Artificial Intelligence (HCCAI), P55, DOI 10.1109/HCCAI49649.2020.00015
  • [2] Akhtar Z., 2019, PROC IEEE INT S TECH, P1, DOI DOI 10.1109/HST47167.2019.9033005
  • [3] [Anonymous], Media Forensics
  • [4] [Anonymous], IEEE C PUBLICATION
  • [5] [Anonymous], 140 k Real and Fake Faces|Kaggle
  • [6] [Anonymous], Share the Moment|Snapchat
  • [7] Bekci B, 2020, SIG PROCESS COMMUN
  • [8] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
  • [9] Ciftci UA, 2020, IEEE/IAPR INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2020)
  • [10] Cordonnier J.B., 2020, P 8 INT C LEARNING R