A facial geometry based detection model for face manipulation using CNN-LSTM architecture

被引:16
作者
Liang, Peifeng [1 ]
Liu, Gang [3 ]
Xiong, Zenggang [2 ]
Fan, Honghui [1 ]
Zhu, Hongjin [1 ]
Zhang, Xuemin [2 ]
机构
[1] Jiangsu Univ Technol, Sch Comp Engn, Changzhou 213001, Jiangsu, Peoples R China
[2] Hubei Engn Univ, Sch Comp & Informat Sci, Xiaogan 432000, Hubei, Peoples R China
[3] Henan Univ Technol, Sch Artificial Intelligence & Big Data, Zhengzhou 450001, Peoples R China
基金
中国国家自然科学基金;
关键词
DeepFake detection; CNN-LSTM; Facial geometry prior module; Resampling; Facial analysis; ENERGY MINIMIZATION;
D O I
10.1016/j.ins.2023.03.079
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This issue of DeepFake technique that may cause great threat to privacy, democracy, and national security has attracted the attention of deep learning researchers. DeepFake detection, therefore, has been a very hot issue in deep learning research. The face landmark feature maps are often used by many DeepFake approaches in generating fake faces. It also provides key information to help to detect manipulated face images. In this paper, we propose a detection approach for manipulated face images. To make full use of face landmark information, we propose a facial geometry prior module (FGPM) to extract facial geometry feature maps. Then the facial geometry feature maps are embedded into upsampling feature maps generated by the CNN-LSTM network. The proposed FGPM exploits facial maps and frequency domain correlation to analyze the discriminative characteristics between manipulated and non-manipulated regions by incorporating the CNN-LSTM network. Finally, a decoder is used to learn the mapping from low -resolution feature maps to pixel-wise to predict manipulation localization. Or a softmax classifier is used to predict real or fake face images. By experimenting on several popular datasets, the proposed detection model has demonstrated the capability of localizing manipulation at the pixel level and with a high prediction on real or fake face images.
引用
收藏
页码:370 / 383
页数:14
相关论文
共 49 条
[1]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[2]   Towards Open-Set Identity Preserving Face Synthesis [J].
Bao, Jianmin ;
Chen, Dong ;
Wen, Fang ;
Li, Houqiang ;
Hua, Gang .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6713-6722
[3]   Hybrid LSTM and Encoder-Decoder Architecture for Detection of Image Forgeries [J].
Bappy, Jawadul H. ;
Simons, Cody ;
Nataraj, Lakshmanan ;
Manjunath, B. S. ;
Roy-Chowdhury, Amit K. .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (07) :3286-3300
[4]   Face swapping: Automatically replacing faces in photographs [J].
Bitouk, Dmitri ;
Kumar, Neeraj ;
Dhillon, Samreen ;
Belhumeur, Peter ;
Nayar, Shree K. .
ACM TRANSACTIONS ON GRAPHICS, 2008, 27 (03)
[5]   Detection and Localization of Image Forgeries using Resampling Features and Deep Learning [J].
Bunk, Jason ;
Bappy, Jawadul H. ;
Mohammed, Tajuddin Manhar ;
Nataraj, Lakshmanan ;
Flenner, Arjuna ;
Manjunath, B. S. ;
Chandrasekaran, Shivkumar ;
Roy-Chowdhury, Amit K. ;
Peterson, Lawrence .
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :1881-1889
[6]   Detecting deepfake videos based on spatiotemporal attention and convolutional LSTM [J].
Chen, Beijing ;
Li, Tianmu ;
Ding, Weiping .
INFORMATION SCIENCES, 2022, 601 :58-70
[7]  
Ciftci Umur Aybars, 2020, IEEE Trans Pattern Anal Mach Intell, VPP, DOI 10.1109/TPAMI.2020.3009287
[8]   ArcFace: Additive Angular Margin Loss for Deep Face Recognition [J].
Deng, Jiankang ;
Guo, Jia ;
Xue, Niannan ;
Zafeiriou, Stefanos .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4685-4694
[9]  
Dong XY, 2020, Arxiv, DOI arXiv:2012.03930
[10]  
Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672