Incremental Clustering for Categorical Data Using Clustering Ensemble

被引:0
作者
Li Taoying [1 ]
Chne Yan [1 ]
Qu Lili [1 ]
Mu Xiangwei [1 ]
机构
[1] Dalian Maritime Univ, Transportat Management Coll, Dalian 116026, Peoples R China
来源
PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE | 2010年
关键词
DataMining; Clustering; Incremental Clustering; Clustering Ensemble; K-MEANS ALGORITHM;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
More and more data in practice is changing every minute and been collected in incremental mode, and incremental clustering has attracted much of researchers' attention. However, little research now focuses on partitioning categorical data in incremental mode. How to design incremental clustering for categorical data is an urgent problem. We propose an incremental clustering for categorical data using clustering ensemble in this paper. We firstly prune redundant attributes if needed, and then make use of true values of different attributes to form clustering memberships, and next use clustering ensemble to merge or divide clusters to gain optimal clustering. Finally, the proposed algorithm is applied in Yellow- Small dataset, Diagnosis dataset and Zoo dataset and results show that it is effective.
引用
收藏
页码:2519 / 2524
页数:6
相关论文
共 50 条
  • [31] A data labeling method for clustering categorical data
    Cao, Fuyuan
    Liang, Jiye
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (03) : 2381 - 2385
  • [32] Apply clustering to analyze categorical data in longitudinal studies
    Hassan, Mohammad Mahdi
    Blom, Martin
    Ansari, Gufran Ahmad
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2019, 19 (04): : 10 - 19
  • [33] Clustering Categorical Data Using a Swarm-based Method
    Izakian, Hesam
    Abraham, Ajith
    Snasel, Vaclav
    2009 WORLD CONGRESS ON NATURE & BIOLOGICALLY INSPIRED COMPUTING (NABIC 2009), 2009, : 1719 - +
  • [34] Clustering categorical data sets using tabu search techniques
    Ng, MK
    Wong, JC
    PATTERN RECOGNITION, 2002, 35 (12) : 2783 - 2790
  • [35] Categorical data clustering using tine combinations of attribute values
    Do, Hee-Jung
    Kim, Jae-Yearn
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2008, PT 2, PROCEEDINGS, 2008, 5073 : 220 - 231
  • [36] <bold>Clustering Categorical Data using Silhouette Coefficient as a Relocating Measure</bold>
    Aranganayagi, S.
    Thangavel, K.
    ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL III, PROCEEDINGS, 2007, : 13 - +
  • [37] DYNAMIC CLUSTERING FOR TIME INCREMENTAL DATA
    CHAUDHURI, BB
    PATTERN RECOGNITION LETTERS, 1994, 15 (01) : 27 - 34
  • [38] Incremental Clustering for Hierarchical Clustering
    Narita, Kakeru
    Hochin, Teruhisa
    Nomiya, Hiroki
    2018 5TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE/ INTELLIGENCE AND APPLIED INFORMATICS (CSII 2018), 2018, : 102 - 107
  • [39] A SCALABLE CLUSTERING METHOD FOR CATEGORICAL SEQUENCE DATA
    Oh, Seung-Joon
    Kim, Jae-Yearn
    INTERNATIONAL JOURNAL OF COMPUTATIONAL METHODS, 2005, 2 (02) : 167 - 180
  • [40] Kernel Subspace Clustering Algorithm for Categorical Data
    Xu K.-P.
    Chen L.-F.
    Sun H.-J.
    Wang B.-Z.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (11): : 3492 - 3505