On similarity indices and correction for chance agreement

被引:100
作者
Albatineh, Ahmed N. [1 ]
Niewiadomska-Bugaj, Magdalena
Mihalko, Daniel
机构
[1] Nova SE Univ, Div Mat Sci & Technol, Ft Lauderdale, FL 33314 USA
[2] Western Michigan Univ, Dept Stat, Kalamazoo, MI 49008 USA
关键词
similarity indices; equivalence of similarity indices; correction for chance agreement; comparison of clusterings; Cohen's kappa;
D O I
10.1007/s00357-006-0017-z
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Similarity indices can be used to compare partitions (clusterings) of a data set. Many such indices were introduced in the literature over the years. We are showing that out of 28 indices we were able to track, there are 22 different ones. Even though their values differ for the same clusterings compared, after correcting for agreement attributed to chance only, their values become similar and some of them even become equivalent. Consequently, the problem of choice of the index to be used for comparing different clusterings becomes less important.
引用
收藏
页码:301 / 313
页数:13
相关论文
共 33 条