Chance-corrected measures of reliability and validity in K x K tables

被引:25
作者
Andrés, AM [1 ]
Marzo, PF [1 ]
机构
[1] Fac Med, Granada 1807, Spain
关键词
D O I
10.1191/0962280205sm412oa
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
When studying the degree of overall agreement between the nominal responses of two raters, it is customary to use the coefficient kappa. A more detailed analysis requires the evaluation of the degree of agreement category by category, and this is carried out in two different ways: using the value of kappa in the collapsed table for each category or using the agreement index for each category (proportion of agreements observed). Both indices have disadvantages: the former is sensitive to marginal totals; the latter is not chance corrected; and neither distinguishes the case where one of the two raters is a gold standard (an expert) from the case where neither rater is a gold standard. This article suggests five chance-corrected indices which are not sensitive to marginal totals and which differ depending on whether there is a standard rater. The article also justifies the reason for poor performance of kappa when the two marginal totals are unbalanced (especially if they are so in opposite directions) and the reason for its good performance when analysing the various 2 x 2 tables obtained by the collapse of a wider table.
引用
收藏
页码:473 / 492
页数:20
相关论文
共 17 条
[1]   RAKING KAPPA - DESCRIBING POTENTIAL IMPACT OF MARGINAL DISTRIBUTIONS ON MEASURES OF AGREEMENT [J].
AGRESTI, A ;
GHOSH, A ;
BINI, M .
BIOMETRICAL JOURNAL, 1995, 37 (07) :811-820
[2]   Delta:: A new measure of agreement between two raters [J].
Andrés, AM ;
Marzo, PF .
BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2004, 57 :1-19
[3]  
Blackman NJM, 2000, STAT MED, V19, P723, DOI 10.1002/(SICI)1097-0258(20000315)19:5<723::AID-SIM379>3.0.CO
[4]  
2-A
[5]   COEFFICIENT KAPPA - SOME USES, MISUSES, AND ALTERNATIVES [J].
BRENNAN, RL ;
PREDIGER, DJ .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1981, 41 (03) :687-699
[6]   HIGH AGREEMENT BUT LOW KAPPA .2. RESOLVING THE PARADOXES [J].
CICCHETTI, DV ;
FEINSTEIN, AR .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 1990, 43 (06) :551-558
[7]   A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46
[8]  
Fleiss J. L., 1981, Statistical Methods for Rates and Proportions, V2nd
[9]  
Guggenmoos-Holzmann I, 1998, STAT MED, V17, P797, DOI 10.1002/(SICI)1097-0258(19980430)17:8<797::AID-SIM776>3.0.CO
[10]  
2-G