Using Markov Chain Monte Carlo Algorithm for Sampling Imbalance Binary IDS Datasets

被引:0
作者
Abedzadeh, Najmeh [1 ]
Jacobs, Dr. Matthew [1 ]
机构
[1] Catholic Univ Amer, EECS Dept, Sch Engn, Washington, DC USA
来源
2022 31ST INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS (ICCCN 2022) | 2022年
关键词
Imbalanced binary IDS Datasets; machine learning; CSE-CIC-IDS2018; Markov Chain Monte Carlo; Generative Adversarial Networks; NETWORK;
D O I
10.1109/ICCCN54977.2022.9868900
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Malicious cyber-attacks can hide themselves in large imbalanced datasets which makes it difficult for Intrusion Detection Systems (IDS) to detect them. Much research has been done in using machine learning and deep learning algorithms for Intrusion Detection Systems, but only a small number of them address imbalanced datasets. In this paper, we apply Markov Chain Monte Carlo (MCMC) Algorithm, Generative Adversarial Networks (GANs) algorithm, and oversampling to the CSE-CIC-IDS2018 dataset to balance the dataset. Then, we compare different Machine Learning algorithms with original dataset, balanced dataset with oversampling, balanced dataset with GAN, and balanced dataset with MCMC. The results show that neither MCMC Algorithm nor GANs is a good algorithm for balancing imbalance binary IDS datasets. The results show that Logistic Regression with original imbalanced dataset provided the best performance in predicting the attacks in imbalance binary IDS datasets with accuracy of 0.88, precision of 0.88, recall of 0.99, Mean Error of 0.11, Root Mean Square Error of 0.33, and Mean Absolute Error of 0.11. Neural Network and GANs performance were close to Logistic Regression specifically regarding accuracy, but Logistic Regression was faster.
引用
收藏
页数:7
相关论文
共 16 条
[1]   Deep and Machine Learning Approaches for Anomaly-Based Intrusion Detection of Imbalanced Network Traffic [J].
Abdulhammed, Razan ;
Faezipour, Miad ;
Abuzneid, Abdelshakour ;
AbuMallouh, Arafat .
IEEE SENSORS LETTERS, 2019, 3 (01)
[2]  
Ali A., 2015, INT J ADV SOFT COMPU, V7, P176
[3]  
Bailey Alexander, 2020, INTRO MCMC
[4]  
Bedi P., 2020, Procedia Comput. Sci, V171, p780_789
[5]  
Brownlee Jason, 2019, DEV CONDITIONAL GAN
[6]   Classification of Imbalanced Data by Combining the Complementary Neural Network and SMOTE Algorithm [J].
Jeatrakul, Piyasak ;
Wong, Kok Wai ;
Fung, Chun Che .
NEURAL INFORMATION PROCESSING: MODELS AND APPLICATIONS, PT II, 2010, 6444 :152-159
[7]  
Koehrsen Will, 2018, MARKOV CHAIN MONTE C
[8]  
Linekar Dino, 2021, 2020 INT C COMMUNICA
[9]   Intrusion Detection of Imbalanced Network Traffic Based on Machine Learning and Deep Learning [J].
Liu, Lan ;
Wang, Pengcheng ;
Lin, Jun ;
Liu, Langzhou .
IEEE ACCESS, 2021, 9 :7550-7563
[10]  
MASEER ZIADOON KAMIL, 2021, IEEE, V9