A Self-Adaptive Fuzzy c-Means Algorithm for Determining the Optimal Number of Clusters

被引:36
作者
Ren, Min [1 ,2 ,3 ]
Liu, Peiyu [1 ,3 ]
Wang, Zhihao [1 ,3 ]
Yi, Jing [1 ,3 ]
机构
[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan, Shandong, Peoples R China
[2] Shandong Univ Finance & Econ, Sch Math & Quantitat Econ, Jinan, Shandong, Peoples R China
[3] Shandong Prov Key Lab Distributed Comp Software N, Jinan, Shandong, Peoples R China
基金
中国国家自然科学基金; 国家教育部科学基金资助;
关键词
VALIDITY INDEX;
D O I
10.1155/2016/2647389
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
For the shortcoming of fuzzy c-means algorithm (FCM) needing to know the number of clusters in advance, this paper proposed a new self-adaptive method to determine the optimal number of clusters. Firstly, a density-based algorithm was put forward. The algorithm, according to the characteristics of the dataset, automatically determined the possible maximum number of clusters instead of using the empirical rule root n and obtained the optimal initial cluster centroids, improving the limitation of FCM that randomly selected cluster centroids lead the convergence result to the local minimum. Secondly, this paper, by introducing a penalty function, proposed a new fuzzy clustering validity index based on fuzzy compactness and separation, which ensured that when the number of clusters verged on that of objects in the dataset, the value of clustering validity index did not monotonically decrease and was close to zero, so that the optimal number of clusters lost robustness and decision function. Then, based on these studies, a self-adaptive FCM algorithm was put forward to estimate the optimal number of clusters by the iterative trial-and-error process. At last, experiments were done on the UCI, KDD Cup 1999, and synthetic datasets, which showed that the method not only effectively determined the optimal number of clusters, but also reduced the iteration of FCM with the stable clustering result.
引用
收藏
页数:12
相关论文
共 37 条
[1]  
[Anonymous], 2000, Icml
[2]  
[Anonymous], 2013, PATTERN RECOGN, DOI DOI 10.1007/978-1-4757-0450-1
[3]   Validity-guided (re)clustering with applications to image segmentation [J].
Bensaid, AM ;
Hall, LO ;
Bezdek, JC ;
Clarke, LP ;
Silbiger, ML ;
Arrington, JA ;
Murtagh, RF .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 1996, 4 (02) :112-123
[4]  
Bezdek J. C., 1973, Journal of Cybernetics, V3, P58, DOI 10.1080/01969727308546047
[5]   NUMERICAL TAXONOMY WITH FUZZY SETS [J].
BEZDEK, JC .
JOURNAL OF MATHEMATICAL BIOLOGY, 1974, 1 (01) :57-71
[6]   Multi-objective nature-inspired clustering and classification techniques for image segmentation [J].
Bong, Chin-Wei ;
Rajeswari, Mandava .
APPLIED SOFT COMPUTING, 2011, 11 (04) :3271-3282
[7]  
Dunn J. C., 1974, Journal of Cybernetics, V4, P95, DOI 10.1080/01969727408546059
[8]   Clustering by passing messages between data points [J].
Frey, Brendan J. ;
Dueck, Delbert .
SCIENCE, 2007, 315 (5814) :972-976
[9]   A robust competitive clustering algorithm with applications in computer vision [J].
Frigui, H ;
Krishnapuram, R .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1999, 21 (05) :450-465
[10]   Mercer kernel-based clustering in feature space [J].
Girolami, M .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (03) :780-784