Conversion methods for symbolic features: A comparison applied to an intrusion detection problem

被引:21
作者
Hernandez-Pereira, E. [1 ]
Suarez-Romero, J. A. [1 ]
Fontenla-Romero, O. [1 ]
Alonso-Betanzos, A. [1 ]
机构
[1] Univ A Coruna, Dept Comp Sci, Lab Res & Dev Artificial Intelligence, La Coruna 15071, Spain
关键词
Symbolic features; Intrusion detection systems; Machine learning; Anomaly detection;
D O I
10.1016/j.eswa.2009.02.054
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The success of any Intrusion Detection System (IDS) lies in the selection of a set of significant features, that can be quantitative or qualitative, taken out from a network traffic data stream. The machine learning methods provide potential solutions for the IDS problem. However, most of these methods used for classification are not able to handle symbolic attributes directly. In this paper, three methods for symbolic features conversion - indicator variables, conditional probabilities and the Separability Split Value method - are contrasted with the arbitrary conversion method, all of them applied to an intrusion detection problem, the KDD Cup 99 data set. In particular, three classification methods were subsequently applied to the dataset: a one-layer feedforward neural network, a support vector machine and a multilayer feedforward neural network. The results obtained demonstrate that the three conversion methods improve the prediction ability of the classifiers utilized, with respect to the arbitrary and commonly used assignment of numerical values. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:10612 / 10617
页数:6
相关论文
共 20 条
[1]  
AHA DW, 1991, MACH LEARN, V6, P37, DOI 10.1007/BF00153759
[2]  
ALONSOBETANZOS A, 2007, P 15 EUR S ART NEUR
[3]  
[Anonymous], P KDD 2001 KNOWL DIS
[4]  
[Anonymous], P 5 C NEUR NETW THEI
[5]  
[Anonymous], 1999, KDD CUP 1999 DAT
[6]  
Bishop Christopher M, 1995, Neural networks for pattern recognition
[7]   A global optimum approach for one-layer neural networks [J].
Castillo, E ;
Fontenla-Romero, O ;
Guijarro-Berdiñas, B ;
Alonso-Betanzos, A .
NEURAL COMPUTATION, 2002, 14 (06) :1429-1449
[8]  
CLARK MC, 1994, INT C PATT RECOG, P245, DOI 10.1109/ICPR.1994.576912
[9]  
DUCH W, 2000, 5 C NEUR NETW SOFT C, P180
[10]  
ESKIN E, 2002, AGEOMETRIC FRAMEWORK