Finding reproducible cluster partitions for the k-means algorithm

被引:0
|
作者
Paulo JG Lisboa
Terence A Etchells
Ian H Jarman
Simon J Chambers
机构
[1] Liverpool John Moores University,School of Computing and Mathematical Sciences
来源
BMC Bioinformatics | / 14卷
关键词
Cluster Solution; Adjust Rand Index; Cluster Partition; Point Dataset; Dual Measure;
D O I
暂无
中图分类号
学科分类号
摘要
K-means clustering is widely used for exploratory data analysis. While its dependence on initialisation is well-known, it is common practice to assume that the partition with lowest sum-of-squares (SSQ) total i.e. within cluster variance, is both reproducible under repeated initialisations and also the closest that k-means can provide to true structure, when applied to synthetic data. We show that this is generally the case for small numbers of clusters, but for values of k that are still of theoretical and practical interest, similar values of SSQ can correspond to markedly different cluster partitions.
引用
收藏
相关论文
共 50 条
  • [1] Finding reproducible cluster partitions for the k-means algorithm
    Lisboa, Paulo J. G.
    Etchells, Terence A.
    Jarman, Ian H.
    Chambers, Simon J.
    BMC BIOINFORMATICS, 2013, 14
  • [2] A new algorithm for initial cluster centers in k-means algorithm
    Erisoglu, Murat
    Calis, Nazif
    Sakallioglu, Sadullah
    PATTERN RECOGNITION LETTERS, 2011, 32 (14) : 1701 - 1705
  • [3] k*-means -: A generalized k-means clustering algorithm with unknown cluster number
    Cheung, YM
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2002, 2002, 2412 : 307 - 317
  • [4] Enhancing the K-means Algorithm Using Cluster Adjustment
    Yamout, Fadi
    2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 307 - 311
  • [5] On selecting the Initial Cluster Centers in the K-means Algorithm
    Tanir, Deniz
    Nuriyeva, Fidan
    2017 11TH IEEE INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT 2017), 2017, : 131 - 135
  • [6] Optimization and improvement based on K-Means Cluster algorithm
    Wu, Jieming
    Yu, Wenhu
    2009 SECOND INTERNATIONAL SYMPOSIUM ON KNOWLEDGE ACQUISITION AND MODELING: KAM 2009, VOL 3, 2009, : 335 - 339
  • [7] Cluster center initialization algorithm for K-means clustering
    Khan, SS
    Ahmad, A
    PATTERN RECOGNITION LETTERS, 2004, 25 (11) : 1293 - 1302
  • [8] ANR: An algorithm to recommend initial cluster centers for k-means algorithm
    Delavar, Arash Ghorbannia
    Mohebpour, Gholam Hasan
    JOURNAL OF MATHEMATICS AND COMPUTER SCIENCE-JMCS, 2014, 11 (04): : 277 - 290
  • [9] A Novel Genetic Algorithm Based k-means Algorithm for Cluster Analysis
    El-Shorbagy, M. A.
    Ayoub, A. Y.
    El-Desoky, I. M.
    Mousa, A. A.
    INTERNATIONAL CONFERENCE ON ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS (AMLTA2018), 2018, 723 : 92 - 101
  • [10] Algorithm for the k-means clustering based on minimum cluster size
    Wang, Shou-Qiang
    Zhu, Da-Ming
    Tongxin Xuebao/Journal on Communications, 2010, 31 (07): : 46 - 52