Video Deepfake classification using particle swarm optimization-based evolving ensemble models

被引:8
作者
Zhang, Li [1 ]
Zhao, Dezong [2 ]
Lim, Chee Peng [3 ]
Asadi, Houshyar [3 ]
Huang, Haoqian [4 ]
Yu, Yonghong [5 ]
Gao, Rong [6 ]
机构
[1] Univ London, Dept Comp Sci, Royal Holloway, London TW20 0EX, Surrey, England
[2] Univ Glasgow, James Watt Sch Engn, Glasgow City G12 8QQ, Scotland
[3] Deakin Univ, Inst Intelligent Syst Res & Innovat, Geelong, Vic 3216, Australia
[4] Hohai Univ, Coll Energy & Elect Engn, Nanjing 210098, Peoples R China
[5] Nanjing Univ Posts & Telecommun, Coll Tongda, Nanjing 210023, Peoples R China
[6] Hubei Univ Technol, Sch Comp Sci, Wuhan 430068, Peoples R China
关键词
Video deepfake classification; Hybrid deep neural network; 3d convolutional neural network; Evolutionary algorithm; Evolving ensemble classifier; FIREFLY ALGORITHM; ARCHITECTURES; REGRESSION; NETWORKS; IMAGES;
D O I
10.1016/j.knosys.2024.111461
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent breakthrough of deep learning based generative models has led to the escalated generation of photorealistic synthetic videos with significant visual quality. Automated reliable detection of such forged videos requires the extraction of fine-grained discriminative spatial-temporal cues. To tackle such challenges, we propose weighted and evolving ensemble models comprising 3D Convolutional Neural Networks (CNNs) and CNNRecurrent Neural Networks (RNNs) with Particle Swarm Optimization (PSO) based network topology and hyperparameter optimization for video authenticity classification. A new PSO algorithm is proposed, which embeds Muller's method and fixed-point iteration based leader enhancement, reinforcement learning-based optimal search action selection, a petal spiral simulated search mechanism, and cross-breed elite signal generation based on adaptive geometric surfaces. The PSO variant optimizes the RNN topologies in CNN-RNN, as well as key learning configurations of 3D CNNs, with the attempt to extract effective discriminative spatial-temporal cues. Both weighted and evolving ensemble strategies are used for ensemble formulation with aforementioned optimized networks as base classifiers. In particular, the proposed PSO algorithm is used to identify optimal subsets of optimized base networks for dynamic ensemble generation to balance between ensemble complexity and performance. Evaluated using several well-known synthetic video datasets, our approach outperforms existing studies and various ensemble models devised by other search methods with statistical significance for video authenticity classification. The proposed PSO model also illustrates statistical superiority over a number of search methods for solving optimization problems pertaining to a variety of artificial landscapes with diverse geometrical layouts.
引用
收藏
页数:40
相关论文
共 123 条
[1]  
Afchar D, 2018, IEEE INT WORKS INFOR
[2]   A Review of Modern Audio Deepfake Detection Methods: Challenges and Future Directions [J].
Almutairi, Zaynab ;
Elgibreen, Hebah .
ALGORITHMS, 2022, 15 (05)
[3]  
Baker B., 2016, P INT C LEARNING REP
[4]   Deep4SNet: deep learning for fake speech classification [J].
Ballesteros, M. Dora ;
Rodriguez-Ortega, Yohanna ;
Renza, Diego ;
Arce, Gonzalo .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 184
[5]  
Bayar B., 2016, P 4 ACM WORKSH INF H, P5, DOI DOI 10.1145/2909827.2930786
[6]   EVOLVING DEEP ENSEMBLES FOR DETECTING COVID-19 IN CHEST X-RAYS [J].
Bosowski, Piotr ;
Bosowska, Joanna ;
Nalepa, Jakub .
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, :3772-3776
[7]   A survey of evolutionary algorithms for supervised ensemble learning [J].
Cagnini, Henry E. L. ;
Das Dores, Silvia C. N. ;
Freitas, Alex A. ;
Barros, Rodrigo C. .
KNOWLEDGE ENGINEERING REVIEW, 2023, 38 (05)
[8]   Ensemble learning particle swarm optimization for real-time UWB indoor localization [J].
Cai, Xufen ;
Ye, Long ;
Zhang, Qin .
EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2018,
[9]   Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J].
Carreira, Joao ;
Zisserman, Andrew .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4724-4733
[10]   Detecting deepfake videos based on spatiotemporal attention and convolutional LSTM [J].
Chen, Beijing ;
Li, Tianmu ;
Ding, Weiping .
INFORMATION SCIENCES, 2022, 601 :58-70