Finding reproducible cluster partitions for the k-means algorithm

被引：0

作者：

Paulo JG Lisboa

Terence A Etchells

Ian H Jarman

Simon J Chambers

机构：

[1] Liverpool John Moores University,School of Computing and Mathematical Sciences

来源：

BMC Bioinformatics | / 14卷

关键词：

Cluster Solution; Adjust Rand Index; Cluster Partition; Point Dataset; Dual Measure;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

K-means clustering is widely used for exploratory data analysis. While its dependence on initialisation is well-known, it is common practice to assume that the partition with lowest sum-of-squares (SSQ) total i.e. within cluster variance, is both reproducible under repeated initialisations and also the closest that k-means can provide to true structure, when applied to synthetic data. We show that this is generally the case for small numbers of clusters, but for values of k that are still of theoretical and practical interest, similar values of SSQ can correspond to markedly different cluster partitions.

引用

共 50 条

[1] Finding reproducible cluster partitions for the k-means algorithm
Lisboa, Paulo J. G.
Etchells, Terence A.
Jarman, Ian H.
Chambers, Simon J.
BMC BIOINFORMATICS, 2013, 14
[2] A new algorithm for initial cluster centers in k-means algorithm
Erisoglu, Murat
Calis, Nazif
Sakallioglu, Sadullah
PATTERN RECOGNITION LETTERS, 2011, 32 (14) : 1701 - 1705
[3] k*-means -: A generalized k-means clustering algorithm with unknown cluster number
Cheung, YM
INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2002, 2002, 2412 : 307 - 317
[4] Enhancing the K-means Algorithm Using Cluster Adjustment
Yamout, Fadi
2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, : 307 - 311
[5] On selecting the Initial Cluster Centers in the K-means Algorithm
Tanir, Deniz
Nuriyeva, Fidan
2017 11TH IEEE INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT 2017), 2017, : 131 - 135
[6] Optimization and improvement based on K-Means Cluster algorithm
Wu, Jieming
Yu, Wenhu
2009 SECOND INTERNATIONAL SYMPOSIUM ON KNOWLEDGE ACQUISITION AND MODELING: KAM 2009, VOL 3, 2009, : 335 - 339
[7] Cluster center initialization algorithm for K-means clustering
Khan, SS
Ahmad, A
PATTERN RECOGNITION LETTERS, 2004, 25 (11) : 1293 - 1302
[8] ANR: An algorithm to recommend initial cluster centers for k-means algorithm
Delavar, Arash Ghorbannia
Mohebpour, Gholam Hasan
JOURNAL OF MATHEMATICS AND COMPUTER SCIENCE-JMCS, 2014, 11 (04): : 277 - 290
[9] A Novel Genetic Algorithm Based k-means Algorithm for Cluster Analysis
El-Shorbagy, M. A.
Ayoub, A. Y.
El-Desoky, I. M.
Mousa, A. A.
INTERNATIONAL CONFERENCE ON ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS (AMLTA2018), 2018, 723 : 92 - 101
[10] Algorithm for the k-means clustering based on minimum cluster size
Wang, Shou-Qiang
Zhu, Da-Ming
Tongxin Xuebao/Journal on Communications, 2010, 31 (07): : 46 - 52

← 1 2 3 4 5 →