On Application of a Probabilistic K-Nearest Neighbors Model for Cluster Validation Problem

被引:2
|
作者
Volkovich, Zeev [1 ]
Barzily, Zeev [1 ]
Avros, Renata [1 ]
Toledano-Kitai, Dvora [1 ]
机构
[1] ORT Braude Coll Engn, Software Engn Dept, IL-21982 Karmiel, Israel
关键词
Clustering; Cluster stability; Data mining; K-Nearest neighbors; RESAMPLING METHOD; NUMBER;
D O I
10.1080/03610926.2011.562786
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
K-Nearest Neighbors is a widely used technique for classifying and clustering data. In the current article, we address the cluster stability problem based upon probabilistic characteristics of this approach. We estimate the stability of partitions obtained from clustering pairs of samples. Partitions are presumed to be consistent if their clusters are stable. Clusters validity is quantified through the amount of K-Nearest Neighbors belonging to the point's sample. The null-hypothesis, of the well-mixed samples within the clusters, suggests Binomial Distribution of this quantity with K trials and the success probability 0.5. A cluster is represented by a summarizing index, of the p-values calculated over all cluster objects, under the null hypothesis for the alternative, and the partition quality is evaluated via the worst partition cluster. The true number of clusters is attained by the empirical index distribution having maximal suitable asymmetry. The proposed methodology offers to produce the index distributions sequentially and to assess their asymmetry. Numerical experiments exhibit a good capability of the methodology to expose the true number of clusters.
引用
收藏
页码:2997 / 3010
页数:14
相关论文
共 50 条
  • [41] Exploring Target Identification for Drug Design with K-Nearest Neighbors' Algorithm
    Jimenes-Vargas, Karina
    Perez-Castillo, Yunierkis
    Tejera, Eduardo
    Munteanu, Cristian R.
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2023, PT II, 2023, 14126 : 219 - 227
  • [42] Kinetic Reverse k-Nearest Neighbor Problem
    Rahmati, Zahed
    King, Valerie
    Whitesides, Sue
    COMBINATORIAL ALGORITHMS, IWOCA 2014, 2015, 8986 : 307 - 317
  • [43] Weather Prediction and Classification Using Neural Networks and k-Nearest Neighbors
    Mantri, Rhea
    Raghavendra, Kulkarni Rakshit
    Puri, Harshita
    Chaudhary, Jhanavi
    Bingi, Kishore
    2021 8TH INTERNATIONAL CONFERENCE ON SMART COMPUTING AND COMMUNICATIONS (ICSCC), 2021, : 263 - 268
  • [44] A quantum k-nearest neighbors algorithm based on the Euclidean distance estimation
    Zardini, Enrico
    Blanzieri, Enrico
    Pastorello, Davide
    QUANTUM MACHINE INTELLIGENCE, 2024, 6 (01)
  • [45] Incremental k-Nearest Neighbors Using Reservoir Sampling for Data Streams
    Bahri, Maroua
    Bifet, Albert
    DISCOVERY SCIENCE (DS 2021), 2021, 12986 : 122 - 137
  • [46] K-Nearest Neighbors Classifier for Field Bit Error Rate Data
    Allogba, Stephanie
    Tremblay, Christine
    2018 ASIA COMMUNICATIONS AND PHOTONICS CONFERENCE (ACP), 2018,
  • [47] The Comparison of Linear Regression Method and K-Nearest Neighbors in Scholarship Recipient
    Okfalisa
    Fitriani, Ratika
    Vitriani, Yelfi
    2018 19TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2018, : 194 - 199
  • [48] Density peaks clustering algorithm with K-nearest neighbors and weighted similarity
    Zhao J.
    Chen L.
    Wu R.-X.
    Zhang B.
    Han L.-Z.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2022, 39 (12): : 2349 - 2357
  • [49] Research on the Humanlike Trajectories Control of Robots Based on the K-Nearest Neighbors
    Wang Lei
    Liu Zhaowei
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 7746 - 7751
  • [50] Weighted K-nearest neighbors classification based on Whale optimization algorithm
    Anvari, S.
    Azgomi, M. Abdollahi
    Dishabi, M. R. Ebrahimi
    Maheri, M.
    IRANIAN JOURNAL OF FUZZY SYSTEMS, 2023, 20 (03): : 61 - 74