Study on density peaks clustering based on k-nearest neighbors and principal component analysis

被引:371
|
作者
Du, Mingjing [1 ,2 ]
Ding, Shifei [1 ,2 ]
Jia, Hongjie [1 ,2 ]
机构
[1] China Univ Min & Technol, Sch Comp Sci & Technol, Xuzhou 221116, Peoples R China
[2] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100090, Peoples R China
基金
中国国家自然科学基金;
关键词
Data clustering; Density peaks; k Nearest neighbors (KNN); Principal component analysis (PCA); ALGORITHM; SEARCH;
D O I
10.1016/j.knosys.2016.02.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Density peaks clustering (DPC) algorithm published in the US journal Science in 2014 is a novel clustering algorithm based on density. It needs neither iterative process nor more parameters. However, original algorithm only has taken into account the global structure of data, which leads to missing many clusters. In addition, DPC does not perform well when data sets have relatively high dimension. Especially, DPC generates wrong number of clusters of real-world data sets. In order to overcome the first problem, we propose a density peaks clustering based on k nearest neighbors (DPC-KNN) which introduces the idea of k nearest neighbors (KNN) into DPC and has another option for the local density computation. In order to overcome the second problem, we introduce principal component analysis (PCA) into the model of DPC-KNN and further bring forward a method based on PCA (DPC-KNN-PCA), which preprocesses high dimensional data. By experiments on synthetic data sets, we demonstrate the feasibility of our algorithms. By experiments on real-world data sets, we compared this algorithm with k-means algorithm and spectral clustering (SC) algorithm in accuracy. Experimental results show that our algorithms are feasible and effective. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:135 / 145
页数:11
相关论文
共 50 条
  • [41] A UNIMODAL CLUSTERING-ALGORITHM BASED ON THE K-NEAREST NEIGHBORS METHOD
    KOVALENKO, AP
    AUTOMATION AND REMOTE CONTROL, 1993, 54 (05) : 794 - 798
  • [42] Fault recognition based on principal component analysis and k-nearest neighbor algorithm
    Zou G.
    Ren K.
    Ji Y.
    Ding J.
    Zhang S.
    Meitiandizhi Yu Kantan/Coal Geology and Exploration, 2021, 49 (04): : 15 - 23
  • [43] TWO-PHASES CLUSTERING ALGORITHM BASED ON SUBTRACTIVE CLUSTERING AND K-NEAREST NEIGHBORS
    Shieh, Horng-Lin
    Kuo, Cheng-Chien
    Chen, Fu-Hsien
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 1802 - 1806
  • [44] Fault Detection in the Tennessee Eastman Benchmark Process Using Principal Component Difference Based on K-Nearest Neighbors
    Zhang, Cheng
    Guo, Qingxiu
    Li, Yuan
    IEEE ACCESS, 2020, 8 : 49999 - 50009
  • [45] A Novel Density Peaks Clustering Halo Node Assignment Method Based on K-Nearest Neighbor Theory
    Wang, Limin
    Zhou, Wei
    Wang, Honghuan
    Parmar, Milan
    Han, Xuming
    IEEE ACCESS, 2019, 7 (174380-174390) : 174380 - 174390
  • [46] Unsupervised image clustering algorithm based on contrastive learning and K-nearest neighbors
    Zhang, Xiuling
    Wang, Shuo
    Wu, Ziyun
    Tan, Xiaofei
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (09) : 2415 - 2423
  • [47] A grid resource discovery method based on adaptive k-nearest neighbors clustering
    Zhang, Yan
    Jia, Yan
    Huang, Xiaobin
    Zhou, Bin
    Gu, Jian
    COMBINATORIAL OPTIMIZATION AND APPLICATIONS, PROCEEDINGS, 2007, 4616 : 171 - +
  • [48] Unsupervised image clustering algorithm based on contrastive learning and K-nearest neighbors
    Xiuling Zhang
    Shuo Wang
    Ziyun Wu
    Xiaofei Tan
    International Journal of Machine Learning and Cybernetics, 2022, 13 : 2415 - 2423
  • [49] Fast agglomerative clustering using information of k-nearest neighbors
    Chang, Chih-Tang
    Lai, Jim Z. C.
    Jeng, M. D.
    PATTERN RECOGNITION, 2010, 43 (12) : 3958 - 3968
  • [50] Information theoretic clustering using a k-nearest neighbors approach
    Vikjord, Vidar V.
    Jenssen, Robert
    PATTERN RECOGNITION, 2014, 47 (09) : 3070 - 3081