SMMO-CoFS: Synthetic Multi-minority Oversampling with Collaborative Feature Selection for Network Intrusion Detection System

被引:0
作者
Yeshalem Gezahegn Damtew
Hongmei Chen
机构
[1] Southwest Jiaotong University,School of Computing and Artificial Intelligence
[2] Debre Berhan University,College of Computing Science
来源
International Journal of Computational Intelligence Systems | / 16卷
关键词
Multi-class balancing; Multi-minority over-sampling; Feature selection; Machine learning; Network intrusion detection system;
D O I
暂无
中图分类号
学科分类号
摘要
Researchers publish various studies to improve the performance of network intrusion detection systems. However, there is still a high false alarm rate and missing intrusions due to class imbalance in the multi-class dataset. This imbalanced distribution of classes results in low detection accuracy for the minority classes. This paper proposes a Synthetic Multi-minority Oversampling (SMMO) framework by integrating with a collaborative feature selection (CoFS) approach in network intrusion detection systems. Our framework aims to increase the detection accuracy of the extreme minority classes (i.e., user-to-root and remote-to-local attacks) by improving the dataset’s class distribution and selecting relevant features. In our framework, SMMO generates synthetic data and iteratively over-samples multi-minority classes. And the collaboration of correlation-based feature selection with an evolutionary algorithm selects essential features. We evaluate our framework with a random forest, J48, BayesNet, and AdaBoostM1. In a multi-class NSL-KDD dataset, the experimental results show that the proposed framework significantly improves the detection accuracy of the extreme minority classes compared with other approaches.
引用
收藏
相关论文
共 96 条
[1]  
Abd Rahman R(2016)Evolutionary algorithm with roulette-tournament selection for solving aquaculture diet formulation Math. Probl. Eng. Math. Probl. Eng. 2016 1-10
[2]  
Ramli R(2018)A hybrid intrusion detection system: Integrating hybrid feature selection approach with heterogeneous ensemble of intelligent classifiers Int. J. Netw. Secur. 20 41-55
[3]  
Jamari Z(2020)A review of intrusion detection systems using machine and deep learning in internet of things: Challenges, solutions and future directions Electron 9 1177-1176
[4]  
Amrita KKR(2016)A survey of data mining and machine learning methods for cyber security intrusion detection IEEE Commun. Surv. Tuts 18 1153-163
[5]  
Asharf J(2012)Balancing distribution of intrusion detection data using sample selection J. Inf. Secur. Res. 3 153-357
[6]  
Moustafa N(2002)Smote: synthetic minority over-sampling technique J. Artif. Intell. Res. 16 321-1642
[7]  
Khurshid H(2010)Ramoboost: ranked minority oversampling in boosting IEEE Trans. Neural Netw. 21 1624-30
[8]  
Buczak AL(2006)Statistical comparisons of classifiers over multiple data sets J. ML Res. 7 1-12
[9]  
Guven E(2021)Dgm: a data generative model to improve minority class presence in anomaly detection domain Neural Comput. Appl. 2021 1-162
[10]  
Chaïri I(2021)Gga: a modified genetic algorithm with gradient-based local search for solving constrained optimization problems Inf. Sci. 547 136-36