A Top-Down Approach for Hierarchical Cluster Exploration by Visualization

被引:0
作者
Zhang, Ke-Bing [1 ]
Orgun, Mehmet A. [1 ]
Busch, Peter A. [1 ]
Nayak, Abhaya C. [1 ]
机构
[1] Macquarie Univ, Dept Comp, Sydney, NSW 2109, Australia
来源
ADVANCED DATA MINING AND APPLICATIONS, ADMA 2010, PT I | 2010年 / 6440卷
关键词
Top-down data analysis; hierarchical cluster exploration; visualization;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the much increased capability of data collection and storage in the past decade, data miners have to deal with much larger datasets in knowledge discovery tasks. Very large observations may cause traditional clustering methods to break down and not be able to cope with such large volumes of data. To enable data miners effectively detect the hierarchical cluster structure of a very large dataset, we introduce a visualization technique HOV3 to plot the dataset into clear and meaningful subsets by using its statistical summaries. Therefore, data miners can focus on investigating a relatively smaller-sized subset and its nested clusters. In such a way, data miners can explore clusters of any subset and its offspring subsets in a top-down fashion. As a consequence, HOV3 provides data miners an effective method on the exploration of clusters in a hierarchy by visualization.
引用
收藏
页码:497 / 508
页数:12
相关论文
共 21 条
[1]  
Andritsos P, 2004, LECT NOTES COMPUT SC, V2992, P123
[2]  
[Anonymous], 2009, Clustering
[3]  
Berkhin P, 2006, GROUPING MULTIDIMENSIONAL DATA: RECENT ADVANCES IN CLUSTERING, P25
[4]   A hierarchical latent variable model for data visualization [J].
Bishop, CM ;
Tipping, ME .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (03) :281-293
[5]  
Chen C., 2007, HDB COMPUTATIONAL ST, VIII
[6]  
Eisen M., 2007, P ACM SIGMOD C 1998
[7]  
Guha S., 1998, CURE, P73, DOI DOI 10.1145/276305.276312
[8]  
Kandogan E., 2001, KDD-2001. Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P107, DOI 10.1145/502512.502530
[9]   Chameleon: Hierarchical clustering using dynamic modeling [J].
Karypis, G ;
Han, EH ;
Kumar, V .
COMPUTER, 1999, 32 (08) :68-+
[10]   Visual exploration of large relational data sets through 3D projections and footprint splatting [J].
Li, Y .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2003, 15 (06) :1460-1471