A geometric approach to cluster validity for normal mixtures

被引:60
作者
J. C. Bezdek
W. Q. Li
Y. Attikiouzel
M. Windham
机构
[1] Department of Computer Science,
[2] University of West Florida Pensacola,undefined
[3] FL 32514 USA,undefined
[4] Department of Electrical and Electronic Engineering,undefined
[5] University of Western Australia Nedlands,undefined
[6] Perth Western Australia,undefined
[7] 6009,undefined
[8] Australia,undefined
[9] Department of Mathematics and Statistics,undefined
[10] University of South Alabama Mobile,undefined
[11] AL 36688 USA,undefined
关键词
Keywords cluster validity; EM algorithm; generalized Dunn’s index; mixture decomposition; normal mixtures; Xie-Beni index;
D O I
10.1007/s005000050019
中图分类号
学科分类号
摘要
 We study indices for choosing the correct number of components in a mixture of normal distributions. Previous studies have been confined to indices based wholly on probabilistic models. Viewing mixture decomposition as probabilistic clustering (where the emphasis is on partitioning for geometric substructure) as opposed to parametric estimation enables us to introduce both fuzzy and crisp measures of cluster validity for this problem. We presume the underlying samples to be unlabeled, and use the expectation-maximization (EM) algorithm to find clusters in the data. We test 16 probabilistic, 3 fuzzy and 4 crisp indices on 12 data sets that are samples from bivariate normal mixtures having either 3 or 6 components. Over three run averages based on different initializations of EM, 10 of the 23 indices tested for choosing the right number of mixture components were correct in at least 9 of the 12 trials. Among these were the fuzzy index of Xie-Beni, the crisp Davies-Bouldin index, and two crisp indices that are recent generalizations of Dunn’s index.
引用
收藏
页码:166 / 179
页数:13
相关论文
empty
未找到相关数据