Video Deepfake classification using particle swarm optimization-based evolving ensemble models

被引:8
作者
Zhang, Li [1 ]
Zhao, Dezong [2 ]
Lim, Chee Peng [3 ]
Asadi, Houshyar [3 ]
Huang, Haoqian [4 ]
Yu, Yonghong [5 ]
Gao, Rong [6 ]
机构
[1] Univ London, Dept Comp Sci, Royal Holloway, London TW20 0EX, Surrey, England
[2] Univ Glasgow, James Watt Sch Engn, Glasgow City G12 8QQ, Scotland
[3] Deakin Univ, Inst Intelligent Syst Res & Innovat, Geelong, Vic 3216, Australia
[4] Hohai Univ, Coll Energy & Elect Engn, Nanjing 210098, Peoples R China
[5] Nanjing Univ Posts & Telecommun, Coll Tongda, Nanjing 210023, Peoples R China
[6] Hubei Univ Technol, Sch Comp Sci, Wuhan 430068, Peoples R China
关键词
Video deepfake classification; Hybrid deep neural network; 3d convolutional neural network; Evolutionary algorithm; Evolving ensemble classifier; FIREFLY ALGORITHM; ARCHITECTURES; REGRESSION; NETWORKS; IMAGES;
D O I
10.1016/j.knosys.2024.111461
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent breakthrough of deep learning based generative models has led to the escalated generation of photorealistic synthetic videos with significant visual quality. Automated reliable detection of such forged videos requires the extraction of fine-grained discriminative spatial-temporal cues. To tackle such challenges, we propose weighted and evolving ensemble models comprising 3D Convolutional Neural Networks (CNNs) and CNNRecurrent Neural Networks (RNNs) with Particle Swarm Optimization (PSO) based network topology and hyperparameter optimization for video authenticity classification. A new PSO algorithm is proposed, which embeds Muller's method and fixed-point iteration based leader enhancement, reinforcement learning-based optimal search action selection, a petal spiral simulated search mechanism, and cross-breed elite signal generation based on adaptive geometric surfaces. The PSO variant optimizes the RNN topologies in CNN-RNN, as well as key learning configurations of 3D CNNs, with the attempt to extract effective discriminative spatial-temporal cues. Both weighted and evolving ensemble strategies are used for ensemble formulation with aforementioned optimized networks as base classifiers. In particular, the proposed PSO algorithm is used to identify optimal subsets of optimized base networks for dynamic ensemble generation to balance between ensemble complexity and performance. Evaluated using several well-known synthetic video datasets, our approach outperforms existing studies and various ensemble models devised by other search methods with statistical significance for video authenticity classification. The proposed PSO model also illustrates statistical superiority over a number of search methods for solving optimization problems pertaining to a variety of artificial landscapes with diverse geometrical layouts.
引用
收藏
页数:40
相关论文
共 123 条
[41]   Evolving Deep Architecture Generation with Residual Connections for Image Classification Using Particle Swarm Optimization [J].
Lawrence, Tom ;
Zhang, Li ;
Rogage, Kay ;
Lim, Chee Peng .
SENSORS, 2021, 21 (23)
[42]   A ranking-system-based switching particle swarm optimizer with dynamic learning strategies [J].
Li, Han ;
Li, Juan ;
Wu, Peishu ;
You, Yancheng ;
Zeng, Nianyin .
NEUROCOMPUTING, 2022, 494 :356-367
[43]   Face X-ray for More General Face Forgery Detection [J].
Li, Lingzhi ;
Bao, Jianmin ;
Zhang, Ting ;
Yang, Hao ;
Chen, Dong ;
Wen, Fang ;
Guo, Baining .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :5000-5009
[44]   Sharp Multiple Instance Learning for DeepFake Video Detection [J].
Li, Xiaodan ;
Lang, Yining ;
Chen, Yuefeng ;
Mao, Xiaofeng ;
He, Yuan ;
Wang, Shuhui ;
Xue, Hui ;
Lu, Quan .
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, :1864-1872
[45]  
[李艳歌 Li Yange], 2018, [高分子通报, Polymer Bulletin], P46
[46]  
Li YZ, 2018, IEEE INT WORKS INFOR
[47]   Celeb-DF: A Large-scale Challenging Dataset for DeepFake Forensics [J].
Li, Yuezun ;
Yang, Xin ;
Sun, Pu ;
Qi, Honggang ;
Lyu, Siwei .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :3204-3213
[48]   Towards improved multifactorial particle swarm optimization learning of fuzzy cognitive maps: A case study on air quality prediction [J].
Liang, Weiling ;
Zhang, Yingjun ;
Liu, Xiaoqian ;
Yin, Hui ;
Wang, Jingping ;
Yang, Yanyan .
APPLIED SOFT COMPUTING, 2022, 130
[49]   A hybrid approach for high-dimensional optimization: Combining particle swarm optimization with mechanisms in neuro-endocrine-immune systems [J].
Liu, Bao ;
Xu, Mei ;
Gao, Lei ;
Yang, Jinying ;
Di, Xin .
KNOWLEDGE-BASED SYSTEMS, 2022, 253
[50]   I3D-Shufflenet Based Human Action Recognition [J].
Liu, Guocheng ;
Zhang, Caixia ;
Xu, Qingyang ;
Cheng, Ruoshi ;
Song, Yong ;
Yuan, Xianfeng ;
Sun, Jie .
ALGORITHMS, 2020, 13 (11)