Clustering ellipses for anomaly detection

被引:74
作者
Moshtaghi, Masud [1 ]
Havens, Timothy C. [2 ]
Bezdek, James C. [1 ,2 ]
Park, Laurence [3 ]
Leckie, Christopher [1 ]
Rajasegarar, Sutharshan [4 ]
Keller, James M. [2 ]
Palaniswami, Marimuthu [4 ]
机构
[1] Univ Melbourne, Dept Comp Sci & Software Engn, Melbourne, Vic, Australia
[2] Univ Missouri, Dept Elect & Comp Engn, Columbia, MO 65211 USA
[3] Univ Western Sydney, Sch Comp & Math, Penrith, NSW 1797, Australia
[4] Univ Melbourne, Dept Elect & Elect Engn, Melbourne, Vic, Australia
基金
美国国家科学基金会; 美国国家航空航天局; 美国国家卫生研究院;
关键词
Cluster analysis; Elliptical anomalies in wireless sensor networks; Reordered dissimilarity images; Similarity of ellipsoids; Single linkage clustering; Visual assessment; VISUAL ASSESSMENT; TENDENCY;
D O I
10.1016/j.patcog.2010.07.024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Comparing, clustering and merging ellipsoids are problems that arise in various applications, e.g., anomaly detection in wireless sensor networks and motif-based patterned fabrics. We develop a theory underlying three measures of similarity that can be used to find groups of similar ellipsoids in p-space. Clusters of ellipsoids are suggested by dark blocks along the diagonal of a reordered dissimilarity image (RDI). The RDI is built with the recursive iVAT algorithm using any of the three (dis) similarity measures as input and performs two functions: (i) it is used to visually assess and estimate the number of possible clusters in the data; and (ii) it offers a means for comparing the three similarity measures. Finally, we apply the single linkage and CLODD clustering algorithms to three two-dimensional data sets using each of the three dissimilarity matrices as input. Two data sets are synthetic, and the third is a set of real WSN data that has one known second order node anomaly. We conclude that focal distance is the best measure of elliptical similarity, iVAT images are a reliable basis for estimating cluster structures in sets of ellipsoids, and single linkage can successfully extract the indicated clusters. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:55 / 69
页数:15
相关论文
共 37 条
[1]  
[Anonymous], 2000, MPS SIAM SERIES OPTI
[2]  
[Anonymous], 2007, Applied multivariate statistical analysis, sixth edition M
[3]  
[Anonymous], 2009, Clustering
[4]  
Bezdek J., 1999, FUZZY MODELS ALGORIT
[5]   Visual assessment of clustering tendency for rectangular dissimilarity matrices [J].
Bezdek, James C. ;
Hathaway, Richard J. ;
Huband, Jacalyn M. .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2007, 15 (05) :890-903
[6]  
BORG I., 1987, Multidimensional Similarity Structure Analysis
[7]   A NOTE ON CORRELATION CLUSTERS AND CLUSTER SEARCH METHODS [J].
Cattell, Raymond .
PSYCHOMETRIKA, 1944, 9 (03) :169-184
[8]  
DICKERSON JA, 1993, P IEEE INT C NEUR NE, P1162
[9]  
Duda R. O., 1973, Pattern Classification and Scene Analysis, V3
[10]  
EVERITT B.S., 1978, GRAPHICAL TECHNIQUES