Subspace K-means clustering

被引:0
|
作者
Marieke E. Timmerman
Eva Ceulemans
Kim De Roover
Karla Van Leeuwen
机构
[1] University of Groningen,Heymans Institute for Psychology, Psychometrics & Statistics
[2] K.U. Leuven,Educational Sciences
[3] K.U. Leuven,Parenting and Special Education
来源
Behavior Research Methods | 2013年 / 45卷
关键词
Cluster analysis; Cluster recovery; Multivariate data; Reduced ; -means; means; Factorial ; -means; Mixtures of factor analyzers; MCLUST;
D O I
暂无
中图分类号
学科分类号
摘要
To achieve an insightful clustering of multivariate data, we propose subspace K-means. Its central idea is to model the centroids and cluster residuals in reduced spaces, which allows for dealing with a wide range of cluster types and yields rich interpretations of the clusters. We review the existing related clustering methods, including deterministic, stochastic, and unsupervised learning approaches. To evaluate subspace K-means, we performed a comparative simulation study, in which we manipulated the overlap of subspaces, the between-cluster variance, and the error variance. The study shows that the subspace K-means algorithm is sensitive to local minima but that the problem can be reasonably dealt with by using partitions of various cluster procedures as a starting point for the algorithm. Subspace K-means performs very well in recovering the true clustering across all conditions considered and appears to be superior to its competitor methods: K-means, reduced K-means, factorial K-means, mixtures of factor analyzers (MFA), and MCLUST. The best competitor method, MFA, showed a performance similar to that of subspace K-means in easy conditions but deteriorated in more difficult ones. Using data from a study on parental behavior, we show that subspace K-means analysis provides a rich insight into the cluster characteristics, in terms of both the relative positions of the clusters (via the centroids) and the shape of the clusters (via the within-cluster residuals).
引用
收藏
页码:1011 / 1023
页数:12
相关论文
共 50 条
  • [1] Subspace K-means clustering
    Timmerman, Marieke E.
    Ceulemans, Eva
    De Roover, Kim
    Van Leeuwen, Karla
    BEHAVIOR RESEARCH METHODS, 2013, 45 (04) : 1011 - 1023
  • [2] Adapting k-means for graph clustering
    Sami Sieranoja
    Pasi Fränti
    Knowledge and Information Systems, 2022, 64 : 115 - 142
  • [3] Adapting k-means for supervised clustering
    S. H. Al-Harbi
    V. J. Rayward-Smith
    Applied Intelligence, 2006, 24 : 219 - 226
  • [4] On the Behaviour of K-Means Clustering of a Dissimilarity Matrix by Means of Full Multidimensional Scaling
    J. Fernando Vera
    Rodrigo Macías
    Psychometrika, 2021, 86 : 489 - 513
  • [5] Data clustering using K-Means based on Crow Search Algorithm
    K Lakshmi
    N Karthikeyani Visalakshi
    S Shanthi
    Sādhanā, 2018, 43
  • [6] Adapting k-means for graph clustering
    Sieranoja, Sami
    Franti, Pasi
    KNOWLEDGE AND INFORMATION SYSTEMS, 2022, 64 (01) : 115 - 142
  • [7] Large scale K-means clustering using GPUs
    Mi Li
    Eibe Frank
    Bernhard Pfahringer
    Data Mining and Knowledge Discovery, 2023, 37 : 67 - 109
  • [8] Shallow decision trees for explainable k-means clustering
    Laber, Eduardo
    Murtinho, Lucas
    Oliveira, Felipe
    PATTERN RECOGNITION, 2023, 137
  • [9] A MapReduce-based K-means clustering algorithm
    YiMin Mao
    DeJin Gan
    D. S. Mwakapesa
    Y. A. Nanehkaran
    Tao Tao
    XueYu Huang
    The Journal of Supercomputing, 2022, 78 : 5181 - 5202
  • [10] Clustering Large Datasets by Merging K-Means Solutions
    Volodymyr Melnykov
    Semhar Michael
    Journal of Classification, 2020, 37 : 97 - 123