A Robust GAN-Generated Face Detection Method Based on Dual-Color Spaces and an Improved Xception

被引:74
作者
Chen, Beijing [1 ,2 ,3 ]
Liu, Xin [1 ,2 ,3 ]
Zheng, Yuhui [1 ,2 ,3 ]
Zhao, Guoying [4 ]
Shi, Yun-Qing [5 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Engn Res Ctr Digital Forens, Minist Educ, Nanjing 210044, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Sch Comp, Nanjing 210044, Peoples R China
[3] Nanjing Univ Informat Sci & Technol, Jiangsu Collaborat Innovat Ctr Atmospher Environm, Nanjing 210044, Peoples R China
[4] Univ Oulu, Ctr Machine Vis & Signal Anal, Oulu 90014, Finland
[5] New Jersey Inst Technol, Dept Elect & Comp Engn, Newark, NJ 07102 USA
基金
中国国家自然科学基金;
关键词
Faces; Feature extraction; Image color analysis; Convolution; Face detection; Robustness; Convolutional neural networks; Generated face; generative adversarial network; Xception; color space; NETWORKS;
D O I
10.1109/TCSVT.2021.3116679
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In recent years, generative adversarial networks (GANs) have been widely used to generate realistic fake face images, which can easily deceive human beings. To detect these images, some methods have been proposed. However, their detection performance will be degraded greatly when the testing samples are post-processed. In this paper, some experimental studies on detecting post-processed GAN-generated face images find that (a) both the luminance component and chrominance components play an important role, and (b) the RGB and YCbCr color spaces achieve better performance than the HSV and Lab color spaces. Therefore, to enhance the robustness, both the luminance component and chrominance components of dual-color spaces (RGB and YCbCr) are considered to utilize color information effectively. In addition, the convolutional block attention module and multilayer feature aggregation module are introduced into the Xception model to enhance its feature representation power and aggregate multilayer features, respectively. Finally, a robust dual-stream network is designed by integrating dual-color spaces RGB and YCbCr and using an improved Xception model. Experimental results demonstrate that our method outperforms some existing methods, especially in its robustness against different types of post-processing operations, such as JPEG compression, Gaussian blurring, gamma correction, and median filtering.
引用
收藏
页码:3527 / 3538
页数:12
相关论文
共 54 条
[21]  
Junbo Z., 2017, PROC 5 INT C LEARN R
[22]   A Style-Based Generator Architecture for Generative Adversarial Networks [J].
Karras, Tero ;
Laine, Samuli ;
Aila, Timo .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4396-4405
[23]  
King D., 2018, DLIB C LIB OPTIMIZAT
[24]  
Kingma DP, 2014, ADV NEUR IN, V27
[25]   Appearance Matters, So Does Audio: Revealing the Hidden Face via Cross-Modality Transfer [J].
Kong, Chenqi ;
Chen, Baoliang ;
Yang, Wenhan ;
Li, Haoliang ;
Chen, Peilin ;
Wang, Shiqi .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (01) :423-436
[26]   Identification of deep network generated images using disparities in color components [J].
Li, Haodong ;
Li, Bin ;
Tan, Shunquan ;
Huang, Jiwu .
SIGNAL PROCESSING, 2020, 174
[27]   DR-GAN: Automatic Radial Distortion Rectification Using Conditional GAN in Real-Time [J].
Liao, Kang ;
Lin, Chunyu ;
Zhao, Yao ;
Gabbouj, Moncef .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (03) :725-733
[28]   Robust Detection of Image Operator Chain With Two-Stream Convolutional Neural Network [J].
Liao, Xin ;
Li, Kaide ;
Zhu, Xinshan ;
Liu, K. J. Ray .
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2020, 14 (05) :955-968
[29]  
Lin M, 2014, PUBLIC HEALTH NUTR, V17, P2029, DOI [10.1017/S1368980013002176, 10.1109/PLASMA.2013.6634954]
[30]   Locating splicing forgery by fully convolutional networks and conditional random field [J].
Liu, Bo ;
Pun, Chi-Man .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2018, 66 :103-112