Incremental Clustering for Categorical Data Using Clustering Ensemble

被引:0
作者
Li Taoying [1 ]
Chne Yan [1 ]
Qu Lili [1 ]
Mu Xiangwei [1 ]
机构
[1] Dalian Maritime Univ, Transportat Management Coll, Dalian 116026, Peoples R China
来源
PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE | 2010年
关键词
DataMining; Clustering; Incremental Clustering; Clustering Ensemble; K-MEANS ALGORITHM;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
More and more data in practice is changing every minute and been collected in incremental mode, and incremental clustering has attracted much of researchers' attention. However, little research now focuses on partitioning categorical data in incremental mode. How to design incremental clustering for categorical data is an urgent problem. We propose an incremental clustering for categorical data using clustering ensemble in this paper. We firstly prune redundant attributes if needed, and then make use of true values of different attributes to form clustering memberships, and next use clustering ensemble to merge or divide clusters to gain optimal clustering. Finally, the proposed algorithm is applied in Yellow- Small dataset, Diagnosis dataset and Zoo dataset and results show that it is effective.
引用
收藏
页码:2519 / 2524
页数:6
相关论文
共 50 条
  • [1] Clustering Categorical Data:A Cluster Ensemble Approach
    何增友
    High Technology Letters, 2003, (04) : 8 - 12
  • [2] An Incremental Clustering with Attribute Unbalance Considered for Categorical Data
    Chen, Jize
    Yang, Zhimin
    Yin, Jian
    Yang, Xiaobo
    Huang, Li
    COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS, 2009, 51 : 433 - +
  • [3] Fuzzy Clustering Ensemble Algorithm for Partitioning Categorical Data
    Li, Taoying
    Chen, Yan
    2009 INTERNATIONAL CONFERENCE ON BUSINESS INTELLIGENCE AND FINANCIAL ENGINEERING, PROCEEDINGS, 2009, : 170 - 174
  • [4] A Link-Based Cluster Ensemble Approach for Categorical Data Clustering
    Iam-On, Natthakan
    Boongoen, Tossapon
    Garrett, Simon
    Price, Chris
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (03) : 413 - 425
  • [5] Clustering categorical data streams
    He, Zengyou
    Xu, Xiaofei
    Deng, Shengchun
    Huang, Joshua Zhexue
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2011, 11 (04) : 185 - 192
  • [6] Weighted Delta Factor Cluster Ensemble Algorithm for Categorical Data Clustering in Data Mining
    Sengottaian, Sarumathi
    Natesan, Shanthi
    Mathivanan, Sharmila
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2017, 14 (03) : 275 - 284
  • [7] Rough subspace-based clustering ensemble for categorical data
    Gao, Can
    Pedrycz, Witold
    Miao, Duoqian
    SOFT COMPUTING, 2013, 17 (09) : 1643 - 1658
  • [8] Rough subspace-based clustering ensemble for categorical data
    Can Gao
    Witold Pedrycz
    Duoqian Miao
    Soft Computing, 2013, 17 : 1643 - 1658
  • [9] Space Structure and Clustering of Categorical Data
    Qian, Yuhua
    Li, Feijiang
    Liang, Jiye
    Liu, Bing
    Dang, Chuangyin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (10) : 2047 - 2059
  • [10] Clustering Categorical Data Based on Representatives
    Aranganayagi, S.
    Thangavel, K.
    THIRD 2008 INTERNATIONAL CONFERENCE ON CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, VOL 1, PROCEEDINGS, 2008, : 599 - +