Machine Learning Algorithms for Raw and Unbalanced Intrusion Detection Data in a Multi-Class Classification Problem

被引:12
作者
Bacevicius, Mantas [1 ]
Paulauskaite-Taraseviciene, Agne [1 ]
机构
[1] Kaunas Univ Technol, Fac Informat, Studentu 50, LT-51368 Kaunas, Lithuania
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 12期
关键词
intrusion; machine learning; XAI; imbalanced dataset; multi-class classification; DETECTION SYSTEM; IN-VEHICLE; NETWORK;
D O I
10.3390/app13127328
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Various machine learning algorithms have been applied to network intrusion classification problems, including both binary and multi-class classifications. Despite the existence of numerous studies involving unbalanced network intrusion datasets, such as CIC-IDS2017, a prevalent approach is to address the issue by either merging the classes to optimize their numbers or retaining only the most dominant ones. However, there is no consistent trend showing that accuracy always decreases as the number of classes increases. Furthermore, it is essential for cybersecurity practitioners to recognize the specific type of attack and comprehend the causal factors that contribute to the resulting outcomes. This study focuses on tackling the challenges associated with evaluating the performance of multi-class classification for network intrusions using highly imbalanced raw data that encompasses the CIC-IDS2017 and CSE-CIC-IDS2018 datasets. The research concentrates on investigating diverse machine learning (ML) models, including Logistic Regression, Random Forest, Decision Trees, CNNs, and Artificial Neural Networks. Additionally, it explores the utilization of explainable AI (XAI) methods to interpret the obtained results. The results obtained indicated that decision trees using the CART algorithm performed best on the 28-class classification task, with an average macro F1-score of 0.96878.
引用
收藏
页数:21
相关论文
共 52 条
[1]   Features Dimensionality Reduction Approaches for Machine Learning Based Network Intrusion Detection [J].
Abdulhammed, Razan ;
Musafer, Hassan ;
Alessa, Ali ;
Faezipour, Miad ;
Abuzneid, Abdelshakour .
ELECTRONICS, 2019, 8 (03)
[2]  
Abdulrahman A. A., 2020, Samarra J. Pure Appl. Sci, V2, P132, DOI DOI 10.54153/SJPAS.2020.V2I3.86
[3]  
Alshamy Reem, 2021, Advances in Cyber Security: Third International Conference, ACeS 2021, Penang, Malaysia, August 24-25, 2021, Revised Selected Papers. Communications in Computer and Information Science (1487), P361, DOI 10.1007/978-981-16-8059-5_22
[4]   A Cloud Intrusion Detection Systems Based on DNN Using Backpropagation and PSO on the CSE-CIC-IDS2018 Dataset [J].
Alzughaibi, Saud ;
El Khediri, Salim .
APPLIED SCIENCES-BASEL, 2023, 13 (04)
[5]   Network intrusion detection using multi-architectural modular deep neural network [J].
Atefinia, Ramin ;
Ahmadi, Mahmood .
JOURNAL OF SUPERCOMPUTING, 2021, 77 (04) :3571-3593
[6]   Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI [J].
Barredo Arrieta, Alejandro ;
Diaz-Rodriguez, Natalia ;
Del Ser, Javier ;
Bennetot, Adrien ;
Tabik, Siham ;
Barbado, Alberto ;
Garcia, Salvador ;
Gil-Lopez, Sergio ;
Molina, Daniel ;
Benjamins, Richard ;
Chatila, Raja ;
Herrera, Francisco .
INFORMATION FUSION, 2020, 58 :82-115
[7]   Decision-forest voting scheme for classification of rare classes in network intrusion detection [J].
Brabec, Jan ;
Machlica, Lukas .
2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, :3325-3330
[8]   Study of Multi-Class Classification Algorithms' Performance on Highly Imbalanced Network Intrusion Datasets [J].
Bulavas, Viktoras ;
Marcinkevicius, Virginijus ;
Ruminski, Jacek .
INFORMATICA, 2021, 32 (03) :441-475
[9]  
Erlacher F, 2018, IEEE IFIP NETW OPER
[10]  
Farhan BI., 2022, Indonesian J Electric Eng Comput Sci, V26, P1165, DOI DOI 10.11591/IJEECS.V26.I2.PP1165-1172