APELID: Enhancing real-time intrusion detection with augmented WGAN and parallel ensemble learning

被引:20
作者
Vo, Hoang V. [1 ]
Du, Hanh P. [1 ]
Nguyen, Hoa N. [1 ]
机构
[1] Vietnam Natl Univ Hanoi, VNU Univ Engn & Technol, Dept Informat Syst, Hanoi 100000, Vietnam
关键词
AI-powered intrusion detection; Traffic deep analysis; Data augmentation; Wasserstein generative adversarial networks; Deep neural network; EXtreme gradient boosting; Gradient boosting on decision trees; Bagging meta-estimator; Parallel ensemble learning; NEURAL-NETWORK; CLASSIFICATION; MODEL;
D O I
10.1016/j.cose.2023.103567
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes an AI-powered intrusion detection method that improves intrusion detection performance by increasing the quality of the training set and employing numerous potent AI models. Composed of the Augmented Wasserstein Generative Adversarial Networks (AWGAN) and Parallel Ensemble Learning-based Intrusion Detection (PELID) algorithms, it is referred to as APELID. First, to augment the training set quality, AWGAN combines a clustering algorithm to select representative samples from the majority classes and WGAN to generate more realistic samples from the minority classes. Second, PELID employs a weighted ensemble of multiple efficient AI models in parallel to improve the efficacy of AI-powered intrusion detection. In addition, APELID also incorporates a sandbox-based malware analyzer. It aims to enrich the indicators of compromise for preventing malicious files that have been transferred over the network. Rigorous experiments utilizing well-known datasets, such as CSE-CIC-IDS2018 and NSL-KDD, are conducted in order to evaluate APELID. Hence, it achieves an outstanding F1-score of 99.99% and 99.65% and a remarkably low false negative rate of 0.00% and 0.34%, respectively, which is superior to state-of-the-art methods. In addition, the average PELID-based detection time (i.e., 22.29 mu s/flow) for a single network traffic flow is fast enough to detect intrusions in real-time.
引用
收藏
页数:14
相关论文
共 58 条
[1]  
Adam-Bourdarios C., 2014, P 2014 INT C HIGH EN, P19
[2]   Optuna: A Next-generation Hyperparameter Optimization Framework [J].
Akiba, Takuya ;
Sano, Shotaro ;
Yanase, Toshihiko ;
Ohta, Takeru ;
Koyama, Masanori .
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, :2623-2631
[3]   STL-HDL: A new hybrid network intrusion detection system for imbalanced dataset on big data environment [J].
Al, Samed ;
Dener, Murat .
COMPUTERS & SECURITY, 2021, 110
[4]   The sound of intrusion: A novel network intrusion detection system [J].
Aldarwbi, Mohammed Y. ;
Lashkari, Arash H. ;
Ghorbani, Ali A. .
COMPUTERS & ELECTRICAL ENGINEERING, 2022, 104
[5]  
Alrawashdeh K, 2016, 2016 15TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2016), P195, DOI [10.1109/ICMLA.2016.167, 10.1109/ICMLA.2016.0040]
[6]  
Alvarez-Horcajo J, 2017, IEEE INT CONF CL NET, P65
[7]  
Arjovsky M, 2017, PR MACH LEARN RES, V70
[8]   Discovering Parametric Activation Functions [J].
Bingham, Garrett ;
Miikkulainen, Risto .
NEURAL NETWORKS, 2022, 148 :48-65
[9]  
Bontemps L, 2017, Arxiv, DOI [arXiv:1703.09752, 10.48550/arXiv.1703.09752, DOI 10.48550/ARXIV.1703.09752]
[10]   A Hierarchical Hybrid Intrusion Detection Approach in IoT Scenarios [J].
Bovenzi, Giampaolo ;
Aceto, Giuseppe ;
Ciuonzo, Domenico ;
Persico, Valerio ;
Pescape, Antonio .
2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,