Subspace K-means clustering

被引:0
|
作者
Marieke E. Timmerman
Eva Ceulemans
Kim De Roover
Karla Van Leeuwen
机构
[1] University of Groningen,Heymans Institute for Psychology, Psychometrics & Statistics
[2] K.U. Leuven,Educational Sciences
[3] K.U. Leuven,Parenting and Special Education
来源
Behavior Research Methods | 2013年 / 45卷
关键词
Cluster analysis; Cluster recovery; Multivariate data; Reduced ; -means; means; Factorial ; -means; Mixtures of factor analyzers; MCLUST;
D O I
暂无
中图分类号
学科分类号
摘要
To achieve an insightful clustering of multivariate data, we propose subspace K-means. Its central idea is to model the centroids and cluster residuals in reduced spaces, which allows for dealing with a wide range of cluster types and yields rich interpretations of the clusters. We review the existing related clustering methods, including deterministic, stochastic, and unsupervised learning approaches. To evaluate subspace K-means, we performed a comparative simulation study, in which we manipulated the overlap of subspaces, the between-cluster variance, and the error variance. The study shows that the subspace K-means algorithm is sensitive to local minima but that the problem can be reasonably dealt with by using partitions of various cluster procedures as a starting point for the algorithm. Subspace K-means performs very well in recovering the true clustering across all conditions considered and appears to be superior to its competitor methods: K-means, reduced K-means, factorial K-means, mixtures of factor analyzers (MFA), and MCLUST. The best competitor method, MFA, showed a performance similar to that of subspace K-means in easy conditions but deteriorated in more difficult ones. Using data from a study on parental behavior, we show that subspace K-means analysis provides a rich insight into the cluster characteristics, in terms of both the relative positions of the clusters (via the centroids) and the shape of the clusters (via the within-cluster residuals).
引用
收藏
页码:1011 / 1023
页数:12
相关论文
共 50 条
  • [21] ECKM: An improved K-means clustering based on computational geometry
    Biswas, Tuhin Kr
    Giri, Kinsuk
    Roy, Samir
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 212
  • [22] Multiple kernel k-means clustering with block diagonal property
    Cuiling Chen
    Jian Wei
    Zhi Li
    Pattern Analysis and Applications, 2023, 26 (3) : 1515 - 1526
  • [23] An efficient enhanced k-means clustering algorithm
    FAHIM A.M
    SALEM A.M
    TORKEY F.A
    RAMADAN M.A
    Journal of Zhejiang University Science A(Science in Engineering), 2006, (10) : 1626 - 1633
  • [24] Application of k-means clustering in psychological studies
    Zakharov, Kyrylo
    QUANTITATIVE METHODS FOR PSYCHOLOGY, 2016, 12 (02): : 87 - 100
  • [25] Improvement and Parallelism of k-Means Clustering Algorithm
    田金兰
    朱林
    张素琴
    刘璐
    Tsinghua Science and Technology, 2005, (03) : 277 - 281
  • [26] K-means Clustering for Periodogram of Logging Curve
    He Yan
    CONTEMPORARY INNOVATION AND DEVELOPMENT IN STATISTICAL SCIENCE, 2012, : 11 - 14
  • [27] Strong Consistency of Reduced K-means Clustering
    Terada, Yoshikazu
    SCANDINAVIAN JOURNAL OF STATISTICS, 2014, 41 (04) : 913 - 931
  • [28] Far Efficient K-Means Clustering Algorithm
    Mishra, Bikram Keshari
    Nayak, Nihar Ranjan
    Rath, Amiya
    Swain, Sagarika
    PROCEEDINGS OF THE 2012 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI'12), 2012, : 106 - 110
  • [29] K-Means Extensions for Clustering Categorical Data
    Alwersh, Mohammed
    Kovacs, Laszlo
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (09) : 492 - 507
  • [30] Choosing the Number of Clusters in K-Means Clustering
    Steinley, Douglas
    Brusco, Michael J.
    PSYCHOLOGICAL METHODS, 2011, 16 (03) : 285 - 297