Intrusion Detection Using Big Data and Deep Learning Techniques

被引:118
作者
Faker, Osama [1 ]
Dogdu, Erdogan [1 ,2 ]
机构
[1] Cankaya Univ, Ankara, Turkey
[2] Georgia State Univ, Atlanta, GA 30303 USA
来源
PROCEEDINGS OF THE 2019 ANNUAL ACM SOUTHEAST CONFERENCE (ACMSE 2019) | 2019年
关键词
Intrusion detection system; big data; machine learning; artificial neural networks; deep learning; ensemble techniques; feature selection; NETWORK; ALGORITHM; SYSTEM;
D O I
10.1145/3299815.3314439
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, Big Data and Deep Learning Techniques are integrated to improve the performance of intrusion detection systems. Three classifiers are used to classify network traffic datasets, and these are Deep Feed-Forward Neural Network (DNN) and two ensemble techniques, Random Forest and Gradient Boosting Tree (GBT). To select the most relevant attributes from the datasets, we use a homogeneity metric to evaluate features. Two recently published datasets UNSW NB15 and CICIDS2017 are used to evaluate the proposed method. 5-fold cross validation is used in this work to evaluate the machine learning models. We implemented the method using the distributed computing environment Apache Spark, integrated with Keras Deep Learning Library to implement the deep learning technique while the ensemble techniques are implemented using Apache Spark Machine Learning Library. The results show a high accuracy with DNN for binary and multiclass classification on UNSW NB15 dataset with accuracies at 99.16% for binary classification and 97.01% for multiclass classification. While GBT classifier achieved the best accuracy for binary classification with the CICIDS2017 dataset at 99.99%, for multiclass classification DNN has the highest accuracy with 99.56%.
引用
收藏
页码:86 / 93
页数:8
相关论文
共 36 条
[1]   Experimental Evaluation of a Multi-Layer Feed-Forward Artificial Neural Network Classifier for Network Intrusion Detection System [J].
Al-Zewairi, Malek ;
Almajali, Sufyan ;
Awajan, Arafat .
2017 INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2017, :167-172
[2]  
[Anonymous], THESIS
[3]  
[Anonymous], EUR S ART NEUR NETW
[4]  
[Anonymous], 2011, 6 INT C
[5]  
[Anonymous], INFORM SECURITY J GL
[6]  
[Anonymous], 2018, SECUR PRIV
[7]  
[Anonymous], 2017, 2017 INT C DAT SOFTW
[8]  
[Anonymous], MACH LEARN MACH LEARN
[9]   Performance evaluation of intrusion detection based on machine learning using Apache Spark [J].
Belouch, Mustapha ;
El Hadaj, Salah ;
Idhammad, Mohamed .
PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS2017), 2018, 127 :1-6
[10]  
Belouch M, 2017, INT J ADV COMPUT SC, V8, P389