Deepfake Video Detection Method Improved by GRU and Involution

被引:1
作者
Liu, Yalin [1 ]
Lu, Tianliang [1 ]
机构
[1] College of Information and Cyber Security, People’s Public Security University of China, Beijing
关键词
capsule network; deepfake; focalloss; gated recurrent unit (GRU); Involution;
D O I
10.3778/j.issn.1002-8331.2206-0510
中图分类号
学科分类号
摘要
In recent years, the wide spread of deepfake video on the network has caused a negative impact. In order to solve the problems of low accuracy of existing detection models and insufficient and comprehensive information extraction, an improved deepfake video detection method based on gated recurrent unit (GRU)and Involution is proposed. Firstly, a feature extraction network is constructed based on Involution operator to extract global feature information, which enhances the spatial modeling ability of face image from spatial and channel information. Then, the temporal features are extracted through the location and inter-frame information of the main capsule layer and GRU concern features. Finally, focalloss is used as the loss function to balance the samples in the training model phase. The method is tested in Deepfakes, FaceSwap and Celeb-DF datasets, and the experimental results show that the method is better than the mainstream detection methods. The improved comparative experiments further prove the effectiveness of the detection method. © The Author(s) 2023.
引用
收藏
页码:276 / 283
页数:7
相关论文
共 33 条
[1]  
STUPP C., Fraudsters used AI to mimic CEO’s voice in unusual cybercrime case[J], The Wall Street Journal, 30, 8, (2019)
[2]  
SUWAJANAKORN S, SEITZ S I, KEMELMACHERSHLIZERMAN I., Synthesizing Obama:learning lip sync from,audio[J], ACM Transactions on Graphics, 36, 4, (2017)
[3]  
KOOPMAN M, RODRIGUEZ A M, GERADTS Z., Detection of deepfake video manipulation[C], Proceedings of the 20th Irish Machine Vision and Image Processing Conference, pp. 133-136, (2018)
[4]  
LI J C, LIU B B, HU Y J,, Et al., Deepfake video detection based on consistency of illumination direction[J], Journal of Nanjing University of Aeronautics & Astronautics, 52, 5, pp. 760-767, (2020)
[5]  
YANG J, XIAO S, LI A, Et al., Detecting fake images by identifying potential texture difference[J], Future Generation Computer Systems, 125, 7553, pp. 127-135, (2021)
[6]  
DEMIR I, CIFTCI U A., Where do deep fakes look? Synthetic face detection via gaze tracking[C], Proceedings of the 2021 ACM Symposium on Eye Tracking Research and Applications, pp. 1-11, (2021)
[7]  
LI X, LANG Y, CHEN Y, Et al., Sharp multiple instance learning for deepfake video detection[C], Proceedings of the 28th ACM International Conference on Multimedia, pp. 1864-1872, (2020)
[8]  
NGUYEN H, ECHIZEN I., Use of a capsule network to detect fake images and videos[J], (2019)
[9]  
HARA K, KATAOK H, SATOH Y., Learning spatio- temporal features with 3D residual networks for action recognition[C], Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, pp. 3154-3160, (2017)
[10]  
CARREIRA J, ZISSERMAN A., Quo vadis,action recognition? A new model and the kinetics dataset[C], Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, pp. 4724-4733, (2017)