A Hybrid Intrusion Detection System Based on Feature Selection and Weighted Stacking Classifier

被引:26
作者
Zhao, Ruizhe [1 ]
Mu, Yingxue [2 ,3 ]
Zou, Long [1 ]
Wen, Xiumei [2 ,3 ]
机构
[1] Hebei Univ Architecture, Dept Informat Engn, Zhangjiakou 075000, Peoples R China
[2] Hebei Univ Architecture, Dept Informat Management, Zhangjiakou 075000, Peoples R China
[3] Big Data Technol Innovat Ctr Zhangjiakou, Zhangjiakou 075000, Peoples R China
关键词
Classification algorithms; Feature extraction; Stacking; Intrusion detection; Approximation algorithms; Sociology; Correlation; Intrusion detection system; feature selection; weighted Stacking; CFS-DE; cyber security; ENSEMBLE; OPTIMIZATION; INTERNET; MODEL;
D O I
10.1109/ACCESS.2022.3186975
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cyber-attacks occur more frequently with the rapid growth in the Internet. Intrusion detection systems (IDS) have become an important part of protecting system security. There are still some challenges preventing IDS from further improving its classification performance. Firstly, the complexity of high-dimensional features challenges the speed and the performance of the classification for IDS. Secondly, the classification performance of traditional Stacking algorithm can be easily affected by the base classifiers. Tackling both challenges above, we propose a hybrid intrusion detection system based on a CFS-DE feature selection algorithm and a weighted Stacking classification algorithm. To limit the dimension of the features, we deployed the CFS-DE algorithm, which searches for the optimal feature subset. Afterwards, a weighted Stacking algorithm is proposed, which increases the weights of the base classifiers with good training results and drops those base classifiers with bad ones to improve the classification performance. As such, the model enhances the classification efficiency and yielding better accuracy. All experiments in this study were conducted on the NSL-KDD and CSE-CIC-IDS2018 data sets. The results based on KDDTest+ show that our proposed model has accuracy of 87.44%, precision of 89.09%, recall of 87.44% and F1-score of 88.25%. The results based on CSE-CIC-IDS2018 show that our proposed model has accuracy of 99.87%, precision of 99.88%, recall of 99.87% and F1-score of 99.88%. Compared with traditional machine learning models and models mentioned in other papers, out proposed CFS-DE-weighted-Stacking IDS has the best classification performance.
引用
收藏
页码:71414 / 71426
页数:13
相关论文
共 54 条
[1]   A survey of intrusion detection systems based on ensemble and hybrid classifiers [J].
Aburomman, Abdulla Amin ;
Reaz, Mamun Bin Ibne .
COMPUTERS & SECURITY, 2017, 65 :135-152
[2]  
Abusitta A, 2018, 2018 21ST CONFERENCE ON INNOVATION IN CLOUDS, INTERNET AND NETWORKS AND WORKSHOPS (ICIN)
[3]   Network intrusion detection system: A systematic study of machine learning and deep learning approaches [J].
Ahmad, Zeeshan ;
Shahid Khan, Adnan ;
Wai Shiang, Cheah ;
Abdullah, Johari ;
Ahmad, Farhan .
TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2021, 32 (01)
[4]   A Survey of Random Forest Based Methods for Intrusion Detection Systems [J].
Alves Resende, Paulo Angelo ;
Drummond, Andre Costa .
ACM COMPUTING SURVEYS, 2018, 51 (03)
[5]   Building an Intrusion Detection System Using a Filter-Based Feature Selection Algorithm [J].
Ambusaidi, Mohammed A. ;
He, Xiangjian ;
Nanda, Priyadarsi ;
Tan, Zhiyuan .
IEEE TRANSACTIONS ON COMPUTERS, 2016, 65 (10) :2986-2998
[6]  
[Anonymous], 2003, Correlation-based feature selection for machine learning
[7]  
Catillo Marta, 2020, Web, Artificial Intelligence and Network Applications. Proceedings of the Workshops of the 34th International Conference on Advanced Information Networking and Applications (WAINA-2020). Advances in Intelligent Systems and Computing (AISC 1150), P687, DOI 10.1007/978-3-030-44038-1_63
[8]  
Chowdhury R., 2021, DATA DRIVEN APPROACH, P337
[9]   Inter-dataset generalization strength of supervised machine learning methods for intrusion detection [J].
D'hooge, Laurens ;
Wauters, Tim ;
Volckaert, Bruno ;
De Turck, Filip .
JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2020, 54
[10]   A survey on ensemble learning [J].
Dong, Xibin ;
Yu, Zhiwen ;
Cao, Wenming ;
Shi, Yifan ;
Ma, Qianli .
FRONTIERS OF COMPUTER SCIENCE, 2020, 14 (02) :241-258