Research of Clustering Algorithm based on Information Entropy and Frequency Sensitive Discrepancy Metric in Anomaly Detection

被引:6
作者
Li, Han [1 ]
Wu, Qiuxin [1 ]
机构
[1] Beijing Informat Sci & Technol Univ, Sch Appl Sci, Beijing 100000, Peoples R China
来源
PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CLOUD COMPUTING COMPANION (ISCC-C) | 2014年
关键词
k-menas; Clustering; Anomaly Detection; Information Entropy; Discrepancy Metric;
D O I
10.1109/ISCC-C.2013.108
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Anomaly detection is an active branch of intrusion detection technology which can detect intrusion behaviors including system or users' non-normal behavior and unauthorized use of computer resources. Clustering analysis is an unsupervised method to group data set into multiple clusters. Using clustering algorithm to detect anomaly behavior has good scalability and adaptability. This paper mainly focuses on improving k-means clustering algorithm, and uses it to detect the abnormal records. Our goal is to increase the DR value and decrease the FAR value in anomaly detection by calculating appropriate value of parameters and improve the clustering algorithm. In our IE&FSDM algorithm, we use network records' minimum standard information entropy to compute the initial cluster centers. In testing phase, discrepancy metric is introduced to help calculate exact number of clusters in testing data set. Using the results of initial cluster centers calculated in the pre-phase, IE&FSDM compute the actual clusters by converging cluster centers and obtains the actual cluster centers according to the frequency sensitive discrepancy metric. Then comply with the improved k-means algorithm, iterative calculate until divide all network data into corresponding clusters, and according to the results of cluster we can classify the normal and abnormal network behaviors. At last, we use KDD CUP1999 dataset to implement IE&FSDM algorithm. Test results show that comparing with previous clustering methods, IE&FSDM algorithm improve the detection rate of anomaly behavior and reduce the false alarm rate.
引用
收藏
页码:799 / 805
页数:7
相关论文
共 12 条
  • [1] Chang A., 2013, Computer Communication and Informatics (ICCCI), 2013 International Conference on, P1, DOI DOI 10.1080/21681724.2013.853370
  • [2] Anomaly Detection Based on Enhanced DBScan Algorithm
    Chen, Zhenguo
    Li, YongFei
    [J]. CEIS 2011, 2011, 15
  • [3] Denatious D.K., 2012, Computer Communication and Informatics (ICCCI), 2012 International Conference on, P1
  • [4] Intrusion and misuse detection in large-scale systems
    Erbacher, RF
    Walker, KL
    Frincke, DA
    [J]. IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2002, 22 (01) : 38 - 47
  • [5] k′-Means algorithms for clustering analysis with frequency sensitive discrepancy metrics
    Fang, Chonglun
    Jin, Wei
    Ma, Jinwen
    [J]. PATTERN RECOGNITION LETTERS, 2013, 34 (05) : 580 - 586
  • [6] Fang Chonglun, 2009, 2 INT C BIOM ENG INF, P17
  • [7] Fast global k-means clustering using cluster membership and inequality
    Lai, Jim Z. C.
    Huang, Tsung-Jen
    [J]. PATTERN RECOGNITION, 2010, 43 (05) : 1954 - 1963
  • [8] Li Han, 2010, Proceedings 2010 International Symposium on Intelligence Information Processing and Trusted Computing (IPTC 2010), P458, DOI 10.1109/IPTC.2010.94
  • [9] University of California, 2010, KDD CUP 1999 DAT
  • [10] Wang Xiangbin, 2012, Computer Measurement & Control, V20, P321