Convolutional long short-term memory-based approach for deepfakes detection from videos

被引:5
作者
Nawaz, Marriam [1 ]
Javed, Ali [1 ]
Irtaza, Aun [2 ]
机构
[1] UET Taxila, Dept Software Engn, Taxila 47050, Punjab, Pakistan
[2] UET Taxila, Dept Comp Sci, Taxila 47050, Punjab, Pakistan
关键词
CNN; Deepfakes; Bi-LSTM; Deep learning; Multimedia forensic; SALIENCY DETECTION; IMAGES;
D O I
10.1007/s11042-023-16196-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The great development in the area of Artificial Intelligence (AI) has introduced tremendous advancements in information technology. Moreover, the introduction of lightweight machine learning (ML) techniques allows the applications to work with limited storage and processing power. Deepfakes is among the most famous type of such applications of this era which generates a large amount of fake and modified audiovisual data. The creation of such fake data has introduced a serious risk to the security and confidentiality of humans all around the globe. Accurate detection and classification of actual and deepfakes content is a challenging task due to the progression of Generative adversarial networks (GANs) which produce such convincing manipulated content that it's impossible for people to recognize it through their naked eyes. In this work, we have presented deep learning (DL)-based approach namely the convolutional long short-term memory (C-LSTM) method for deepfakes detection from videos. More specifically, the spatial information from the input sample is calculated by employing various pre-trained models like VGG16, VGG19, ResNet50, XceptionNet, and GoogleNet, DenseNet. Further, we have proposed a novel feature descriptor called the Dense-Swish-Net121. Whereas the Bi-LSTM model is utilized to compute the temporal information. Lastly, the results are predicted based on both the frame level and temporal level information to make the final decision. A detailed comparison of all CNN models with the Bi-LSTM approach is performed and has confirmed through the reported results that the proposed Dense-Swish-Net121 with Bi-LSTM approach performs well for deepfakes detection.
引用
收藏
页码:16977 / 17000
页数:24
相关论文
共 49 条
[1]  
Agarwal S., 2019, CVPR WORKSH, P38, DOI [10.4108/eai.18-7-2019, DOI 10.4108/EAI.18-7-2019]
[2]  
Agarwal S., 2019, CVPR WORKSHOPS, V1
[3]   DCNet: DenseNet-77-based CornerNet model for the tomato plant leaf disease detection and classification [J].
Albahli, Saleh ;
Nawaz, Marriam .
FRONTIERS IN PLANT SCIENCE, 2022, 13
[4]   Recognition and Detection of Diabetic Retinopathy Using Densenet-65 Based Faster-RCNN [J].
Albahli, Saleh ;
Nazir, Tahira ;
Irtaza, Aun ;
Javed, Ali .
CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 67 (02) :1333-1351
[5]   A novel deep learning method for detection and classification of plant diseases [J].
Albattah, Waleed ;
Nawaz, Marriam ;
Javed, Ali ;
Masood, Momina ;
Albahli, Saleh .
COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (01) :507-524
[6]  
Ballester P, 2016, AAAI CONF ARTIF INTE, P1124
[7]  
Baltrusaitis T, 2016, IEEE WINT CONF APPL
[8]   Exposing Computer Generated Images by Eye's Region Classification via Transfer Learning of VGG19 CNN [J].
Carvalho, Tiago ;
de Rezende, Edmar R. S. ;
Alves, Matheus T. P. ;
Balieiro, Fernanda K. C. ;
Sovat, Ricardo B. .
2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, :866-870
[9]   Exploring Rich and Efficient Spatial Temporal Interactions for Real-Time Video Salient Object Detection [J].
Chen, Chenglizhao ;
Wang, Guotao ;
Peng, Chong ;
Fang, Yuming ;
Zhang, Dingwen ;
Qin, Hong .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :3995-4007
[10]   Improved Robust Video Saliency Detection Based on Long-Term Spatial-Temporal Information [J].
Chen, Chenglizhao ;
Wang, Guotao ;
Peng, Chong ;
Zhang, Xiaowei ;
Qin, Hong .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :1090-1100