Optimal Fuzzy Clustering in Overlapping Clusters

被引:0
作者
Ammor, Ouafa
Lachkar, Abdelmonaime
Slaoui, Khadija
Rais, Noureddine
机构
关键词
Unsupervised clustering; cluster validity index; optimal clusters number; overlapping clusters; maximum entropy principle;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The fuzzy c-means clustering algorithm has been widely used to obtain the fuzzy k-partitions. This algorithm requires that the user gives the number of clusters k. To find automatically the "right" number of clusters, k, for a given data set, many validity indexes algorithms have been proposed in the literature. Most of these indexes do not work well for clusters with different overlapping degree. They usually have a tendency to fails in selecting the correct optimal clusters number when dealing with some data sets containing overlapping clusters. To overcome this limitation, we propose in this paper, a new and efficient clusters validity measure for determination of the optimal number of clusters which can deal successfully with or without situation of overlapping. This measure is based on maximum entropy principle. Our approach does not require any parameter adjustment, it is then completely automatic. Many simulated and real examples are presented, showing the superiority of our measure to the existing ones.
引用
收藏
页码:402 / 408
页数:7
相关论文
共 17 条
[1]  
AMMOR O, 2006, P IEEE AICS, P26
[2]  
ANDERSON E, 1959, B AM IRIS SOC, V59, P2
[3]  
BEZDEK J, 1975, CYBERNET SYST, V3, P58
[4]  
CEMBRZYNSKI T, 1986, RR0784 INRIA
[5]  
Fukuyama Y., 1989, P 5 FUZZ SYST S, V5, P247
[6]  
Halkidi M, 2000, LECT NOTES COMPUT<D>, V1910, P265
[7]   Clustering validity assessment: Finding the optimal partitioning of a data set [J].
Halkidi, M ;
Vazirgiannis, M .
2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, :187-194
[8]  
Halkidi M, 2002, SIGMOD REC, V31, P19, DOI 10.1145/601858.601862
[9]   Data clustering: A review [J].
Jain, AK ;
Murty, MN ;
Flynn, PJ .
ACM COMPUTING SURVEYS, 1999, 31 (03) :264-323
[10]  
Kim DJ, 2001, IEICE T INF SYST, VE84D, P281