Clustering with Local Density Peaks-Based Minimum Spanning Tree

被引:87
作者
Cheng, Dongdong [1 ]
Zhu, Qingsheng [2 ]
Huang, Jinlong [1 ]
Wu, Quanwang [2 ]
Yang, Lijun [3 ]
机构
[1] Yangtze Normal Univ, Coll Comp Engn, Chongqing 408100, Peoples R China
[2] Chongqing Univ, Coll Comp Sci, Chongqing 400044, Peoples R China
[3] Southwest Minzu Univ, Sch Comp Sci & Technol, Chengdu 610041, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
minimum spanning tree; clustering; local density peaks; shared neighbors-based distance; NEIGHBOR; ALGORITHM;
D O I
10.1109/TKDE.2019.2930056
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering analysis has been widely used in statistics, machine learning, pattern recognition, image processing, and so on. It is a great challenge for most existing clustering algorithms to discover clusters with arbitrary shapes. Clustering algorithms based on Minimum spanning tree (MST) are able to discover clusters with arbitrary shapes, but they are time consuming and susceptible to noise points. In this paper, we employ local density peaks (LDP) to represent the whole data set and define a shared neighbors-based distance between local density peaks to better measure the dissimilarity between objects on manifold data. On the basis of local density peaks and the new distance, we propose a novel MST-based clustering algorithm called LDP-MST. It first uses local density peaks to construct MST and then repeatedly cuts the longest edge until a given number of clusters are found. The experimental results on synthetic data sets and real data sets show that our algorithm is competent with state-of-the-art methods when discovering clusters with complex structures.
引用
收藏
页码:374 / 387
页数:14
相关论文
共 46 条
[1]  
[Anonymous], 2007, ACM Transactions on Knowledge Discovery from Data (TKDD), DOI DOI 10.1145/1297332.1297338
[2]  
[Anonymous], 1956, P AM MATH SOC, DOI [10.1090/S0002-9939-1956-0078686-7, DOI 10.1090/S0002-9939-1956-0078686-7]
[3]   LOF: Identifying density-based local outliers [J].
Breunig, MM ;
Kriegel, HP ;
Ng, RT ;
Sander, J .
SIGMOD RECORD, 2000, 29 (02) :93-104
[4]  
Caccetta L., 2015, NETW, V37, P74
[5]   Parallel Spectral Clustering in Distributed Systems [J].
Chen, Wen-Yen ;
Song, Yangqiu ;
Bai, Hongjie ;
Lin, Chih-Jen ;
Chang, Edward Y. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (03) :568-586
[6]   Decentralized Clustering by Finding Loose and Distributed Density Cores [J].
Chen, Yewang ;
Tang, Shengyu ;
Zhou, Lida ;
Wang, Cheng ;
Du, Jixiang ;
Wang, Tian ;
Pei, Songwen .
INFORMATION SCIENCES, 2018, 433 :510-526
[7]   A Novel Cluster Validity Index Based on Local Cores [J].
Cheng, Dongdong ;
Zhu, Qingsheng ;
Huang, Jinlong ;
Wu, Quanwang ;
Yang, Lijun .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (04) :985-999
[8]   Natural neighbor-based clustering algorithm with local representatives [J].
Cheng, Dongdong ;
Zhu, Qingsheng ;
Huang, Jinlong ;
Yang, Lijun ;
Wu, Quanwang .
KNOWLEDGE-BASED SYSTEMS, 2017, 123 :238-253
[9]   Minimal spanning tree based clustering technique: Relationship with Bayes Classifier [J].
Chowdhury, N ;
Murthy, CA .
PATTERN RECOGNITION, 1997, 30 (11) :1919-1929
[10]   Looking for natural patterns in data - Part 1. Density-based approach [J].
Daszykowski, M ;
Walczak, B ;
Massart, DL .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2001, 56 (02) :83-92