Identifying and Benchmarking Key Features for Cyber Intrusion Detection: An Ensemble Approach

被引:71
作者
Binbusayyis, Adel [1 ]
Vaiyapuri, Thavavel [1 ]
机构
[1] Prince Sattam Bin Abdulaziz Univ, Coll Comp Sci & Engn, Al Kharj 11942, Saudi Arabia
关键词
Anomaly intrusion detection; correlation; consistency; data analytic lifecycle; diversity measure; ensemble learning; feature selection; information gain; ReliefF; stability measure; FEATURE-SELECTION; DETECTION SYSTEM; ALGORITHM;
D O I
10.1109/ACCESS.2019.2929487
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In today's interconnected era, intrusion detection system (IDS) has the potential to be the frontier of defense against cyberattacks and plays an essential role in achieving security of networking resources and infrastructures. The performance of IDS depends highly on data features. Selecting the most informative features eliminating the redundant and irrelevant features from network traffic data for IDS is still an open research issue. The key impetus of this paper is to identify and benchmark the potential set of features that can characterize network traffic for intrusion detection. In this correspondence, an ensemble approach is proposed. As a first step, the approach applies four different feature evaluation measures, such as correlation, consistency, information, and distance, to select the more crucial features for intrusion detection. Second, it applies the subset combination strategy to merge the output of the four measures and achieve the potential feature set. Along with this, a new framework that adopts the data analytic lifecycle practices is explored to employ the proposed ensemble for building an effective IDS. The effectiveness of the proposed approach is demonstrated by conducting several experiments on four intrusion detection evaluation datasets, namely KDDCup' 99, NSL-KDD, UNSW-NB15, and CICIDS2017. The obtained results prove that the proposed approach contributes more potential features compared to the state-of-the-art approaches, leading to achieve a promising performance gain in the detection rate of 3.2%, the false alarm rate of 38%, and the detection time of 12%. Furthermore, ROC and statistical significance are analyzed for the identified feature subset to strongly conform its acceptability as a future benchmark for building an effective IDS.
引用
收藏
页码:106495 / 106513
页数:19
相关论文
共 63 条
[41]   Enhanced Network Anomaly Detection Based on Deep Neural Networks [J].
Naseer, Sheraz ;
Saleem, Yasir ;
Khalid, Shehzad ;
Bashir, Muhammad Khawar ;
Han, Jihun ;
Iqbal, Muhammad Munwar ;
Han, Kijun .
IEEE ACCESS, 2018, 6 :48231-48246
[42]   Ensemble-based multi-filter feature selection method for DDoS detection in cloud computing [J].
Osanaiye, Opeyemi ;
Cai, Haibin ;
Choo, Kim-Kwang Raymond ;
Dehghantanha, Ali ;
Xu, Zheng ;
Dlodlo, Mqhele .
EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2016,
[43]   Measuring Diversity and Accuracy in ANN Ensembles [J].
Paz Sesmero, M. ;
Manuel Alonso-Weber, Juan ;
Giuliani, Alessandro ;
Armano, Giuliano ;
Sanchis, Araceli .
ADVANCES IN ARTIFICIAL INTELLIGENCE, CAEPIA 2018, 2018, 11160 :108-117
[44]  
Quinlan J. R., 1986, Machine Learning, V1, P81, DOI 10.1023/A:1022643204877
[45]  
Sadiku M.N. O., 2019, Emerging Internet-Based Technologies
[46]  
Sarstedt M., 2019, Concise guide to market research, V1, P151, DOI DOI 10.1007/978-3-662-56707-46
[47]   On developing an automatic threshold applied to feature selection ensembles [J].
Seijo-Pardo, B. ;
Bolon-Canedo, V ;
Alonso-Betanzos, A. .
INFORMATION FUSION, 2019, 45 :227-245
[48]   Firefly algorithm based feature selection for network intrusion detection [J].
Selvakumar, B. ;
Muneeswaran, K. .
COMPUTERS & SECURITY, 2019, 81 :148-155
[49]   Intelligent temporal classification and fuzzy rough set-based feature selection algorithm for intrusion detection system in WSNs [J].
Selvakumar, K. ;
Karuppiah, Marimuthu ;
SaiRamesh, L. ;
Islam, S. K. Hafizul ;
Hassan, Mohammad Mehedi ;
Fortino, Giancarlo ;
Choo, Kim-Kwang Raymond .
INFORMATION SCIENCES, 2019, 497 :77-90
[50]   Toward Generating a New Intrusion Detection Dataset and Intrusion Traffic Characterization [J].
Sharafaldin, Iman ;
Lashkari, Arash Habibi ;
Ghorbani, Ali A. .
ICISSP: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS SECURITY AND PRIVACY, 2018, :108-116