Attribute weights-based clustering centres algorithm for initialising K-modes clustering

被引:5
|
作者
Peng, Liwen [1 ]
Liu, Yongguo [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Knowledge & Data Engn Lab Chinese Med, Chengdu, Peoples R China
关键词
Clustering centers; Weight; Density; Distance;
D O I
10.1007/s10586-018-1889-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The K-modes algorithm based on partitional clustering technology is a very popular and effective clustering method; moreover, it handles categorical data. However, the performance of the K-modes method is largely affected by the initial clustering centres. Random selection of the initial clustering centres commonly leads to non-repeatable clustering result. Hence, suitable choice of the initial clustering centres is crucial to realizing high-performance K-modes clustering. The present article develops an initialisation algorithm for K-modes. At initialisation, the distance between two instances calculated after weighting the attributes of the instances. Many studies have shown that if clustering is based only on distances or density between the instances, the clustering revolves around one centre or the outliers. Therefore, based on the attribute weights, we combine the distance and density measures to select the clustering centres. In experiments on several UCI machine learning repository benchmark datasets, the new initialisation method outperformed the existing K-modes clustering methods.
引用
收藏
页码:S6171 / S6179
页数:9
相关论文
共 50 条
  • [1] Attribute weights-based clustering centres algorithm for initialising K-modes clustering
    Liwen Peng
    Yongguo Liu
    Cluster Computing, 2019, 22 : 6171 - 6179
  • [2] Attribute value weighting in k-modes clustering
    He, Zengyou
    Xu, Xiaofei
    Deng, Shengchun
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (12) : 15365 - 15369
  • [3] K-modes clustering
    Chaturvedi, A
    Green, PE
    Carroll, JD
    JOURNAL OF CLASSIFICATION, 2001, 18 (01) : 35 - 55
  • [4] K-modes Clustering
    Anil Chaturvedi
    Paul E. Green
    J. Douglas Caroll
    Journal of Classification, 2001, 18 : 35 - 55
  • [5] A dissimilarity measure for the k-Modes clustering algorithm
    Cao, Fuyuan
    Liang, Jiye
    Li, Deyu
    Bai, Liang
    Dang, Chuangyin
    KNOWLEDGE-BASED SYSTEMS, 2012, 26 : 120 - 127
  • [6] CLEKMODES: a modified k-modes clustering algorithm
    Mastrogiannis, N.
    Giannikos, I.
    Boutsinas, B.
    Antzoulatos, G.
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2009, 60 (08) : 1085 - 1095
  • [7] Block Fuzzy K-modes Clustering Algorithm
    Yang, Miin-Shen
    Lin, Chih-Ying
    2009 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 2009, : 384 - 389
  • [8] K-Modes clustering algorithm based on a new distance measure
    Liang, Jiye
    Bai, Liang
    Cao, Fuyuan
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2010, 47 (10): : 1749 - 1755
  • [9] DP- k-modes: A self-tuning k-modes clustering algorithm
    Xie, Juanying
    Wang, Mingzhao
    Lu, Xiaoxiao
    Liu, Xinglin
    Grant, Philip W.
    PATTERN RECOGNITION LETTERS, 2022, 158 : 117 - 124
  • [10] A note on K-modes clustering
    Huang, ZX
    Ng, MK
    JOURNAL OF CLASSIFICATION, 2003, 20 (02) : 257 - 261