On Probabilistic k-Richness of the k-Means Algorithms

被引:2
作者
Klopotek, Robert A. [1 ]
Klopotek, Mieczyslaw A. [2 ]
机构
[1] Cardinal Stefan Wyszynski Univ Warsaw, Fac Math & Nat Sci, Sch Exact Sci, Warsaw, Poland
[2] Polish Acad Sci, Comp Sci Fundamental Res Inst, Warsaw, Poland
来源
MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE | 2019年 / 11943卷
关键词
k-means; k-means plus; k-richness; Probabilistic k-richness; Weak probabilistic k-richness;
D O I
10.1007/978-3-030-37599-7_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With Kleinberg's axiomatic system for clustering, a discussion has been initiated, what kind of properties clustering algorithms should have and have not. As Ackerman et al. pointed out, the static properties studied by Kleinberg and other are not appropriate for clustering algorithms with elements of randomness. Therefore they introduced the property of probabilistic k-richness and claimed, without a proof that the versions of k-means both with random initialisation and k-means++ initialization have this property. We prove that k-means++ has the property of probabilistic k-richness, while k-means with random initialisation for well separated clusters does not. To characterize the latter, we introduce the notion of weak probabilistic k-richness and prove it for this algorithm. For completeness, we provide with a constructive proof that the theoretical k-means has the (deterministic) k-richness property.
引用
收藏
页码:259 / 271
页数:13
相关论文
共 50 条
  • [31] Website Intelligent Recommendation Based on K-means and Apriori Algorithms
    Zhang, Shaohua
    Liu, Changhua
    Li, Qiaodan
    PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON ADVANCED CONTROL, AUTOMATION AND ARTIFICIAL INTELLIGENCE (ACAAI 2018), 2018, 155 : 188 - 190
  • [32] Comparative Analysis for k-Means Algorithms in Network Community Detection
    Liu, Jian
    ADVANCES IN COMPUTATION AND INTELLIGENCE, 2010, 6382 : 158 - 169
  • [33] A bad instance for k-means plus
    Brunsch, Tobias
    Roeglin, Heiko
    THEORETICAL COMPUTER SCIENCE, 2013, 505 : 19 - 26
  • [34] K*-Means: An Effective and Efficient K-means Clustering Algorithm
    Qi, Jianpeng
    Yu, Yanwei
    Wang, Lihong
    Liu, Jinglei
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCES ON BIG DATA AND CLOUD COMPUTING (BDCLOUD 2016) SOCIAL COMPUTING AND NETWORKING (SOCIALCOM 2016) SUSTAINABLE COMPUTING AND COMMUNICATIONS (SUSTAINCOM 2016) (BDCLOUD-SOCIALCOM-SUSTAINCOM 2016), 2016, : 242 - 249
  • [35] Better Guarantees for k-Means and Euclidean k-Median by Primal-Dual Algorithms
    Ahmadian, Sara
    Norouzi-Fard, Ashkan
    Svensson, Ola
    Ward, Justin
    2017 IEEE 58TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS), 2017, : 61 - 72
  • [36] BETTER GUARANTEES FOR k-MEANS AND EUCLIDEAN k-MEDIAN BY PRIMAL-DUAL ALGORITHMS
    Ahmadian, Sara
    Norouzi-Fard, Ashkan
    Svensson, Ola
    Ward, Justin
    SIAM JOURNAL ON COMPUTING, 2020, 49 (04) : 97 - 156
  • [37] Applicability of K-medoids and K-means algorithms for segmenting students based on their scholastic performance
    Badhera, Usha
    Verma, Apoorva
    Nahar, Pooja
    JOURNAL OF STATISTICS AND MANAGEMENT SYSTEMS, 2022, 25 (07) : 1621 - 1632
  • [38] Cuckoo, Bat and Krill Herd based k-means plus plus clustering algorithms
    Aggarwal, Shruti
    Singh, Paramvir
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 6): : 14169 - 14180
  • [39] An Efficient K-Means Algorithm and its Benchmarking against other Algorithms
    Chadha, Anupama
    Kumar, Suresh
    INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2016, 9 (11): : 119 - 132
  • [40] Clustering performance comparison using K-means and expectation maximization algorithms
    Jung, Yong Gyu
    Kang, Min Soo
    Heo, Jun
    BIOTECHNOLOGY & BIOTECHNOLOGICAL EQUIPMENT, 2014, 28 : S44 - S48