Correlation-based feature selection for intrusion detection design

被引:0
作者
Chou, Te-Shun [1 ]
Yen, Kang K. [1 ]
Luo, Jun [1 ]
Pissinou, Niki [2 ]
Makki, Kia [2 ]
机构
[1] Florida Int Univ, Dept Elect & Comp Engn, Miami, FL 33199 USA
[2] Florida Int Univ, Telecommun & Informat Technol Inst, Miami, FL USA
来源
2007 IEEE MILITARY COMMUNICATIONS CONFERENCE, VOLS 1-8 | 2007年
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In a large amount of monitoring network traffic data, not every feature of the data is relevant to the intrusion detection task. In this paper, we aim to reduce the dimensionality of the original feature space by removing irrelevant and redundant features. A correlation-based feature selection algorithm is proposed for selecting a subset of most informative features. Six data sets retrieved from UCI databases and an intrusion detection benchmark data set, DARPA KDD99, are used to train and to test C4.5 and naive bayes machine learning algorithms. We compare our proposed approach with two correlation-based feature selection algorithms, CFS and FCBF and the results indicate that our approach achieves the highest averaged accuracies. Our feature selection algorithm could effectively reduce the size of data set.
引用
收藏
页码:2300 / +
页数:2
相关论文
共 17 条
  • [1] ALMUALLIM H, 1991, PROCEEDINGS : NINTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, P547
  • [2] [Anonymous], 1997, Machine Learning
  • [3] BASHAH N, 2005, T ENG COMPUTING TECH, V6, P291
  • [4] BIESIADA J, 2005, P 4 INT C COMP REC S
  • [5] BLACKE CL, 1998, UCI REPOSITORY MACH
  • [6] Faraoun K.M., 2007, INT J COMPUT INTELL, V3, P161
  • [7] Research on collaborative negotiation for e-commerce.
    Feng, YQ
    Lei, Y
    Li, Y
    Cao, RZ
    [J]. 2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 2085 - 2088
  • [8] Guyon I, 2003, J MACH LEARN RES, P1157, DOI [10.1016/j.aca.2011.07.027, DOI 10.1016/J.ACA.2011.07.027]
  • [9] Hall MA, 1998, Correlation-based feature subset selection for machine learning
  • [10] John GH, 1994, P 11 INT C MACH LEAR, P121, DOI 10.1016/B978-1-55860-335-6.50023-4