Subspace K-means clustering

被引：0

作者：

Marieke E. Timmerman

Eva Ceulemans

Kim De Roover

Karla Van Leeuwen

机构：

[1] University of Groningen,Heymans Institute for Psychology, Psychometrics & Statistics

[2] K.U. Leuven,Educational Sciences

[3] K.U. Leuven,Parenting and Special Education

来源：

Behavior Research Methods | 2013年 / 45卷

关键词：

Cluster analysis; Cluster recovery; Multivariate data; Reduced ; -means; means; Factorial ; -means; Mixtures of factor analyzers; MCLUST;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

To achieve an insightful clustering of multivariate data, we propose subspace K-means. Its central idea is to model the centroids and cluster residuals in reduced spaces, which allows for dealing with a wide range of cluster types and yields rich interpretations of the clusters. We review the existing related clustering methods, including deterministic, stochastic, and unsupervised learning approaches. To evaluate subspace K-means, we performed a comparative simulation study, in which we manipulated the overlap of subspaces, the between-cluster variance, and the error variance. The study shows that the subspace K-means algorithm is sensitive to local minima but that the problem can be reasonably dealt with by using partitions of various cluster procedures as a starting point for the algorithm. Subspace K-means performs very well in recovering the true clustering across all conditions considered and appears to be superior to its competitor methods: K-means, reduced K-means, factorial K-means, mixtures of factor analyzers (MFA), and MCLUST. The best competitor method, MFA, showed a performance similar to that of subspace K-means in easy conditions but deteriorated in more difficult ones. Using data from a study on parental behavior, we show that subspace K-means analysis provides a rich insight into the cluster characteristics, in terms of both the relative positions of the clusters (via the centroids) and the shape of the clusters (via the within-cluster residuals).

引用

页码：1011 / 1023

页数：12

共 50 条

[1] Subspace K-means clustering
Timmerman, Marieke E.
Ceulemans, Eva
De Roover, Kim
Van Leeuwen, Karla
BEHAVIOR RESEARCH METHODS, 2013, 45 (04) : 1011 - 1023
[2] Adapting k-means for graph clustering
Sami Sieranoja
Pasi Fränti
Knowledge and Information Systems, 2022, 64 : 115 - 142
[3] Adapting k-means for supervised clustering
S. H. Al-Harbi
V. J. Rayward-Smith
Applied Intelligence, 2006, 24 : 219 - 226
[4] On the Behaviour of K-Means Clustering of a Dissimilarity Matrix by Means of Full Multidimensional Scaling
J. Fernando Vera
Rodrigo Macías
Psychometrika, 2021, 86 : 489 - 513
[5] Data clustering using K-Means based on Crow Search Algorithm
K Lakshmi
N Karthikeyani Visalakshi
S Shanthi
Sādhanā, 2018, 43
[6] Adapting k-means for graph clustering
Sieranoja, Sami
Franti, Pasi
KNOWLEDGE AND INFORMATION SYSTEMS, 2022, 64 (01) : 115 - 142
[7] Large scale K-means clustering using GPUs
Mi Li
Eibe Frank
Bernhard Pfahringer
Data Mining and Knowledge Discovery, 2023, 37 : 67 - 109
[8] Shallow decision trees for explainable k-means clustering
Laber, Eduardo
Murtinho, Lucas
Oliveira, Felipe
PATTERN RECOGNITION, 2023, 137
[9] A MapReduce-based K-means clustering algorithm
YiMin Mao
DeJin Gan
D. S. Mwakapesa
Y. A. Nanehkaran
Tao Tao
XueYu Huang
The Journal of Supercomputing, 2022, 78 : 5181 - 5202
[10] Clustering Large Datasets by Merging K-Means Solutions
Volodymyr Melnykov
Semhar Michael
Journal of Classification, 2020, 37 : 97 - 123

← 1 2 3 4 5 →