A study on the feature selection of network traffic for intrusion detection purpose

被引:0
作者
Ma, Wanli [1 ]
Tran, Dat [1 ]
Sharma, Dharmendra [1 ]
机构
[1] Univ Canberra, Fac Informat Sci & Engn, Canberra, ACT 2601, Australia
来源
ISI 2008: 2008 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS | 2008年
关键词
intrusion detection; clustering methods; feature extraction;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The 3 most important issues for anomaly detection based intrusion detection systems by using data mining methods are: feature selection, data value normalization, and the choice of data mining algorithms. In this paper, we study primarily the feature selection of network traffic and its impact on the detection rates. We use KDD CUP 1999 dataset as the sample for the study. We group the features of the dataset into 4 groups: Group I contains the basic network traffic features; Group 11 is actually not network traffic related, but the features collected from hosts; Group III and IV are temporally aggregated features. In this paper, we demonstrate the different detection rates of choosing the different combinations of these groups. We also demonstrate the effectiveness and the ineffectiveness in finding anomalies by looking at the network data alone. In addition, we also briefly investigate the effectiveness of data normalization. To validate our findings, we conducted the same experiments with 3 different clustering algorithms - K-means clustering, fuzzy C means clustering (FCM), and fuzz), entropy clustering (FE).
引用
收藏
页码:245 / +
页数:2
相关论文
共 11 条
  • [1] CARUSO C, 2004, 5 INT C DAT MIN TEXT
  • [2] Chan P.K., 2003, A machine learning approach to anomaly detection
  • [3] DARPA, 1999, DARPA INTR DET EV DA
  • [4] LI X, 2004, 2 S INT SEC INF TUCS
  • [5] MCHUGH J, 2000, RECENT ADV INTRUSION
  • [6] PORTNOY L, 2001, P ACM CCS WORKSH DAT
  • [7] Stolfo S., 2000, P 2000 DARPA INF SUR
  • [8] Taylor C., 2001, P NEW SEC PAR WORKSH
  • [9] Taylor C., 2002, 10 NEW SEC PAR WORKS
  • [10] WANG Q, 2005, SPIE C DAT MIN INTR