A new interpoint distance-based clustering algorithm using kernel density estimation

被引:6
|
作者
Modak, Soumita [1 ]
机构
[1] Univ Calcutta, Fac Stat, Dept Stat, Basanti Devi Coll, 147B Rash Behari Ave, Kolkata 700029, India
关键词
Clustering algorithm; Interpoint distance; Nonparametric method; Kernel density estimator; High-dimensional applicability;
D O I
10.1080/03610918.2023.2179071
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A novel nonparametric clustering algorithm is proposed using the interpoint distances between the members of the data to reveal the inherent clustering structure existing in the given set of data, where we apply the classical nonparametric univariate kernel density estimation method to the interpoint distances to estimate the density around a data member. Our clustering algorithm is simple in its formation and easy to apply resulting in well-defined clusters. The algorithm starts with objective selection of the initial cluster representative and always converges independently of this choice. The method finds the number of clusters itself and can be used irrespective of the nature of underlying data by using an appropriate interpoint distance measure. The cluster analysis can be carried out in any dimensional space with viability to high-dimensional use. The distributions of the data or their interpoint distances are not required to be known due to the design of our procedure, except the assumption that the interpoint distances possess a density function. Data study shows its effectiveness and superiority over the widely used clustering algorithms.
引用
收藏
页码:5323 / 5341
页数:19
相关论文
共 50 条
  • [1] A new nonparametric interpoint distance-based measure for assessment of clustering
    Modak, Soumita
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2022, 92 (05) : 1062 - 1077
  • [2] A new algorithm for clustering based on kernel density estimation
    Matioli, L. C.
    Santos, S. R.
    Kleina, M.
    Leite, E. A.
    JOURNAL OF APPLIED STATISTICS, 2018, 45 (02) : 347 - 366
  • [3] Distance and density based clustering algorithm using Gaussian kernel
    Gungor, Emre
    Ozmen, Ahmet
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 69 : 10 - 20
  • [4] MulticlusterKDE: a new algorithm for clustering based on multivariate kernel density estimation
    Scaldelai, D.
    Matioli, L. C.
    Santos, S. R.
    Kleina, M.
    JOURNAL OF APPLIED STATISTICS, 2022, 49 (01) : 98 - 121
  • [5] Low Density Separation Density Sensitive Distance-based Spectral Clustering Algorithm
    Tao X.-M.
    Wang R.-T.
    Chang R.
    Li C.-X.
    Liu Y.-C.
    Zidonghua Xuebao/Acta Automatica Sinica, 2020, 46 (07): : 1479 - 1495
  • [6] Distance-Based Ensemble Online Classifier with Kernel Clustering
    Jedrzejowicz, Joanna
    Jedrzejowicz, Piotr
    INTELLIGENT DECISION TECHNOLOGIES, 2015, 39 : 279 - 289
  • [7] A new clustering algorithm based on distance and density
    Yu, XP
    Zhou, DY
    Zhou, Y
    2005 INTERNATIONAL CONFERENCE ON SERVICES SYSTEMS AND SERVICES MANAGEMENT, VOLS 1 AND 2, PROCEEDINGS, 2005, : 1016 - 1021
  • [8] SmallSteps: An adaptive distance-based clustering algorithm
    Koch, Gy.
    Dombi, J.
    Acta Cybernetica, 2001, 15 (02): : 241 - 256
  • [9] SmallSteps: An adaptive distance-based clustering algorithm
    2001, University of Szeged, Arpad ter 2., Szeged, H-6720, Hungary (15):
  • [10] Density-based clustering algorithm using kernel density estimation and hill-down strategy
    Xie, Conghua
    Song, Yuqing
    Liu, Zhe
    Journal of Information and Computational Science, 2010, 7 (01): : 135 - 142