Circluster: Storing Cluster Shapes for Clustering

被引:0
作者
Shirali-Shahreza, Sajad [1 ]
Yeganeh, Soheil Hassas [1 ]
Abolhassani, Hassan [1 ]
Habibi, Jafar [1 ]
机构
[1] Sharif Univ Technol, Dept Comp Engn, Tehran, Iran
来源
2008 4TH INTERNATIONAL IEEE CONFERENCE INTELLIGENT SYSTEMS, VOLS 1 AND 2 | 2008年
关键词
Circluster; Cluster Shape; Clustering; Data Mining;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the important problems in knowledge discovery from data is clustering. Clustering is the problem of partitioning a set of data using unsupervised techniques. An important characteristic of a clustering technique is the shape of the cluster it can find. Clustering methods which are capable to find simple cluster shapes are usually fast but inaccurate for complex data sets. Ones capable to find complex cluster shapes are usually not fast but accurate. In this paper, we propose a simple clustering technique named circlusters. Circlusters are circles partitioned into different radius sectors. Circlusters can be used to create hybrid approaches with density based or partitioning based methods. We also propose a naive clustering method that is capable to find complex clusters in O(n). This method operates in two phases. In the first phase, circlusters are created to approximate the shape of the data set. In the second phase, connected circlusters are found to form the final clusters.
引用
收藏
页码:477 / 482
页数:6
相关论文
共 10 条
  • [1] Ankerst M., 1999, SIGMOD Record, V28, P49, DOI 10.1145/304181.304187
  • [2] Bradley P. S., 1998, Proceedings Fourth International Conference on Knowledge Discovery and Data Mining, P9
  • [3] K-modes clustering
    Chaturvedi, A
    Green, PE
    Carroll, JD
    [J]. JOURNAL OF CLASSIFICATION, 2001, 18 (01) : 35 - 55
  • [4] Ester M., 1996, Proc. Second Int. Conf. Knowl. Discov. Data Min, P226, DOI DOI 10.5555/3001460.3001507
  • [5] Han J., 2006, DATA MINING CONCEPTS
  • [6] Hinneburg A., 1998, Proceedings Fourth International Conference on Knowledge Discovery and Data Mining, P58
  • [7] Extensions to the k-means algorithm for clustering large data sets with categorical values
    Huang, ZX
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (03) : 283 - 304
  • [8] Kaufman L., 1990, Finding Groups in Data: An Introduction to Cluster Analysis, DOI DOI 10.1002/9780470316801
  • [9] MacQueen J., 1967, Proc fifth Berkeley Symp Math Stat Probab, V1, P281
  • [10] Ng R.T., 1994, P 20 VLDB C SANT CHI, P144