Reducing Data Complexity in Feature Extraction and Feature Selection for Big Data Security Analytics

被引:9
作者
Sisiaridis, Dimitrios [1 ]
Markowitch, Olivier [1 ]
机构
[1] Univ Libre Bruxelles, QualSec Grp, Dept Informat, Brussels, Belgium
来源
2018 1ST INTERNATIONAL CONFERENCE ON DATA INTELLIGENCE AND SECURITY (ICDIS 2018) | 2018年
关键词
feature selection; feature extraction; machine learning; artificial intelligence;
D O I
10.1109/ICDIS.2018.00014
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Feature extraction and feature selection are the first tasks in pre-processing of input logs in order to detect cybersecurity threats and attacks by utilizing data mining techniques in the field of Artificial Intelligence. When it comes to the analysis of heterogeneous data derived from different sources, these tasks are found to be time-consuming and difficult to be managed efficiently. In this paper, we present an approach for handling feature extraction and feature selection utilizing machine learning algorithms for security analytics of heterogeneous data derived from different network sensors. The approach is implemented in Apache Spark, using its python API, named pyspark.
引用
收藏
页码:43 / 48
页数:6
相关论文
共 16 条
[1]  
Aggarwal CC, 2014, CH CRC DATA MIN KNOW, P1
[2]  
[Anonymous], INT J INFORM SECURIT
[3]  
[Anonymous], 2016, ICML ANOMALY DETECTI
[4]  
[Anonymous], 2009, NATURAL LANGUAGE PRO, DOI DOI 10.1007/S10579-010-9124-X
[5]   Unsupervised Network Intrusion Detection Systems: Detecting the Unknown without Knowledge [J].
Casas, Pedro ;
Mazel, Johan ;
Owezarski, Philippe .
COMPUTER COMMUNICATIONS, 2012, 35 (07) :772-783
[6]  
Danilov D., 1997, Principal components of time series: the caterpillarmethod, P1
[7]  
Inukollu V.N., 2014, International Journal of Network Security its Applications, V6, P45, DOI [10.5121/ijnsa.2014.6304, DOI 10.5121/IJNSA.2014.6304]
[8]  
Johnson T, 2014, IEEE GLOB COMM CONF, P544, DOI 10.1109/GLOCOM.2014.7036864
[9]  
Kasiviswanathan S.P., 2011, CIKM, P745
[10]  
Lange M., 2016, INT J NETWORK SECURI