Improving AdaBoost-based Intrusion Detection System (IDS) Performance on CIC IDS 2017 Dataset

被引:121
作者
Yulianto, Arif [1 ]
Sukarno, Parman [1 ]
Suwastika, Novian Anggis [1 ]
机构
[1] Telkom Univ, Sch Comp, Bandung, West Java, Indonesia
来源
2ND INTERNATIONAL CONFERENCE ON DATA AND INFORMATION SCIENCE | 2019年 / 1192卷
关键词
D O I
10.1088/1742-6596/1192/1/012018
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper considers the use of Synthetic Minority Oversampling Technique (SMOTE), Principal Component Analysis (PCA), and Ensemble Feature Selection (EFS) to improve the performance of AdaBoost-based Intrusion Detection System (IDS) on the latest and challenging CIC IDS 2017 Dataset [1]. Previous research [1] has proposed the use of AdaBoost classifier to cope with the new dataset. However, due to several problems such as imbalance of training data and inappropriate selection of classification methods, the performance is still inferior. In this research, we aim at constructing an improvement performance intrusion detection approach to handle the imbalance of training data, SMOTE is selected to tackle the problem. Moreover, Principal Component Analysis (PCA) and Ensemble Feature Selection (EFS) are applied as the feature selection to select important attributes from the new dataset. The evaluation results show that the proposed AdaBoost classifier using PCA and SMOTE yields Area Under the Receiver Operating Characteristic curve (AUROC) of 92% and the AdaBoost classifier using EFS and SMOTE produces an accuracy, precision, recall, and F1 Score of 81.83 %, 81.83%, 100%, and 90.01% respectively.
引用
收藏
页数:9
相关论文
共 23 条
[1]  
Aburomman AA, 2016, 2016 INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL, ELECTRONIC AND SYSTEMS ENGINEERING (ICAEES), P95, DOI 10.1109/ICAEES.2016.7888016
[2]  
Amudha P, 2015, ScientificWorldJournal, V2015, P574589, DOI 10.1155/2015/574589
[3]  
[Anonymous], 2006, PAPER PRESENTED AT T
[4]  
[Anonymous], 2009, BUSINESS INTELLIGENC, P1, DOI [10.1002/9780470753866.ch1, DOI 10.1002/9780470753866.CH1]
[5]   A decision-theoretic generalization of on-line learning and an application to boosting [J].
Freund, Y ;
Schapire, RE .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1997, 55 (01) :119-139
[6]  
Gharsellaoui AE, 2016, ADV SAT MULTMED SYS
[7]  
Gorunescu F, 2011, INTEL SYST REF LIBR, P1, DOI 10.1007/978-3-642-19721-5
[8]  
Li KW, 2017, 2017 IEEE 2ND INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA), P30, DOI 10.1109/ICBDA.2017.8078849
[9]   Intrusion Detection using Naive Bayes Classifier with Feature Reduction [J].
Mukherjee, Saurabh ;
Sharma, Neelam .
2ND INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION, CONTROL AND INFORMATION TECHNOLOGY (C3IT-2012), 2012, 4 :119-128
[10]   EFS: an ensemble feature selection tool implemented as R-package and web-application [J].
Neumann, Ursula ;
Genze, Nikita ;
Heider, Dominik .
BIODATA MINING, 2017, 10