On Probabilistic k-Richness of the k-Means Algorithms

被引:2
|
作者
Klopotek, Robert A. [1 ]
Klopotek, Mieczyslaw A. [2 ]
机构
[1] Cardinal Stefan Wyszynski Univ Warsaw, Fac Math & Nat Sci, Sch Exact Sci, Warsaw, Poland
[2] Polish Acad Sci, Comp Sci Fundamental Res Inst, Warsaw, Poland
来源
MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE | 2019年 / 11943卷
关键词
k-means; k-means plus; k-richness; Probabilistic k-richness; Weak probabilistic k-richness;
D O I
10.1007/978-3-030-37599-7_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With Kleinberg's axiomatic system for clustering, a discussion has been initiated, what kind of properties clustering algorithms should have and have not. As Ackerman et al. pointed out, the static properties studied by Kleinberg and other are not appropriate for clustering algorithms with elements of randomness. Therefore they introduced the property of probabilistic k-richness and claimed, without a proof that the versions of k-means both with random initialisation and k-means++ initialization have this property. We prove that k-means++ has the property of probabilistic k-richness, while k-means with random initialisation for well separated clusters does not. To characterize the latter, we introduce the notion of weak probabilistic k-richness and prove it for this algorithm. For completeness, we provide with a constructive proof that the theoretical k-means has the (deterministic) k-richness property.
引用
收藏
页码:259 / 271
页数:13
相关论文
共 50 条
  • [21] PSO Aided k-Means Clustering: Introducing Connectivity in k-Means
    Breaban, Mihaela Elena
    Luchian, Henri
    GECCO-2011: PROCEEDINGS OF THE 13TH ANNUAL GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2011, : 1227 - 1234
  • [22] COMPRESSIVE K-MEANS
    Keriven, Nicolas
    Tremblay, Nicolas
    Traonmilin, Yann
    Gribonval, Remi
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 6369 - 6373
  • [23] k-means: A revisit
    Zhao, Wan-Lei
    Deng, Cheng-Hao
    Ngo, Chong-Wah
    NEUROCOMPUTING, 2018, 291 : 195 - 206
  • [24] Balanced k-Means
    Tai, Chen-Ling
    Wang, Chen-Shu
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2017), PT II, 2017, 10192 : 75 - 82
  • [25] k-Means-MIND: An Efficient Alternative to Repetitive k-Means Runs
    Olukanmi, Peter O.
    Nelwamondo, Fuluffielo
    Marwala, Tshilidzi
    2020 7TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE (ISCMI 2020), 2020, : 172 - 176
  • [26] Comparision of k-means and PAM algorithms using cancer datasets
    Kumar, Parvesh
    Wasan, Siri Krishan
    ICSOFT 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON SOFTWARE AND DATA TECHNOLOGIES, VOL ISDM/ABF, 2008, : 255 - 258
  • [27] Applied Comparison of DBSCAN, OPTICS and K-Means Clustering Algorithms
    Bilgin, Turgay Tugay
    Camurcu, Yilmaz
    JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI, 2005, 8 (02): : 139 - 145
  • [28] Local search approximation algorithms for the k-means problem with penalties
    Zhang, Dongmei
    Hao, Chunlin
    Wu, Chenchen
    Xu, Dachuan
    Zhang, Zhenning
    JOURNAL OF COMBINATORIAL OPTIMIZATION, 2019, 37 (02) : 439 - 453
  • [29] Soil data clustering by using K-means and fuzzy K-means algorithm
    Hot, Elma
    Popovic-Bugarin, Vesna
    2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2015, : 890 - 893
  • [30] Sorted K-Means Towards the Enhancement of K-Means to Form Stable Clusters
    Arora, Preeti
    Virmani, Deepali
    Jindal, Himanshu
    Sharma, Mritunjaya
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMMUNICATION AND NETWORKS, 2017, 508 : 479 - 486