A self-adaptive graph-based clustering method with noise identification

被引:5
作者
Li, Lin [1 ]
Chen, Xiang [2 ]
Song, Chengyun [1 ]
机构
[1] Chongqing Univ Technol, Coll Comp Sci & Engn, Chongqing 400054, Peoples R China
[2] Chongqing Univ, Coll Comp Sci, Chongqing 400044, Peoples R China
关键词
Directed natural neighbor graph; Graph-based clustering; Unsupervised learning; Nonlinear dataset; Noise cutting; NEIGHBOR;
D O I
10.1007/s10044-023-01160-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph-based clustering methods offer competitive performance in dealing with complex and nonlinear data patterns. The outstanding characteristic of such methods is the capability to mine the internal topological structure of a dataset. However, most graph-based clustering algorithms are vulnerable to parameters. In this paper, we propose a self-adaptive graph-based clustering method (SAGC) with noise identification based on directed natural neighbor graph to auto identify the desired number of clusters and simultaneously obtain reliable clustering results without prior knowledge and parameter setting. This method adopts parameter adaptive process to deal with specific data patterns and can identify clusters with diverse shapes and detect noises. We use synthetic and UCI real-world datasets to prove the validity of the innovatory method by comparing it to k-means, DBSCAN, OPTICS, AP, SC, CutPC, and WC algorithms in terms of clustering Accuracy, Adjusted Rand index, Normalized Mutual Information and Fowlkes-Mallows index. The experimental results confirm that the proposed method contributes to the progress of graph-based clustering algorithms.
引用
收藏
页码:907 / 916
页数:10
相关论文
共 20 条
[1]  
Ankerst M., 1999, SIGMOD Record, V28, P49, DOI 10.1145/304181.304187
[2]  
[Anonymous], 1951, Handbook of experimental psychology
[3]   MULTIDIMENSIONAL BINARY SEARCH TREES USED FOR ASSOCIATIVE SEARCHING [J].
BENTLEY, JL .
COMMUNICATIONS OF THE ACM, 1975, 18 (09) :509-517
[4]   NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].
COVER, TM ;
HART, PE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+
[5]  
Ester Martin, 1996, P 2 INT C KNOWLEDGE, DOI DOI 10.5555/3001460.3001507
[6]   Clustering by passing messages between data points [J].
Frey, Brendan J. ;
Dueck, Delbert .
SCIENCE, 2007, 315 (5814) :972-976
[7]  
Guo GD, 2003, LECT NOTES COMPUT SC, V2888, P986
[8]   Outer-Points shaver: Robust graph-based clustering via node cutting [J].
Kim, Younghoon ;
Do, Hyungrok ;
Kim, Seoung Bum .
PATTERN RECOGNITION, 2020, 97
[9]   A novel graph-based clustering method using noise cutting [J].
Li, Lin-Tao ;
Xiong, Zhong-Yang ;
Dai, Qi-Zhu ;
Zha, Yong-Fang ;
Zhang, Yu-Fang ;
Dan, Jing-Pei .
INFORMATION SYSTEMS, 2020, 91
[10]  
MacQueen J., 1967, P 5 BERK S MATH STAT, P281, DOI DOI 10.1007/S11665-016-2173-6