Testing the Difference of Correlated Agreement Coefficients for Statistical Significance

被引:90
作者
Gwet, Kilem L. [1 ]
机构
[1] Adv Analyt LLC, POB 2696, Gaithersburg, MD 20886 USA
关键词
testing correlated kappas; Gwet's AC(1); agreement coefficients; kappa significance test; correlated agreement coefficients; raters' agreement; correlated kappas; INTERRATER RELIABILITY; KAPPA; VARIANCE; RATERS;
D O I
10.1177/0013164415596420
中图分类号
G44 [教育心理学];
学科分类号
0402 ; 040202 ;
摘要
This article addresses the problem of testing the difference between two correlated agreement coefficients for statistical significance. A number of authors have proposed methods for testing the difference between two correlated kappa coefficients, which require either the use of resampling methods or the use of advanced statistical modeling techniques. In this article, we propose a technique similar to the classical pairwise t test for means, which is based on a large-sample linear approximation of the agreement coefficient. We illustrate the use of this technique with several known agreement coefficients including Cohen's kappa, Gwet's AC(1), Fleiss's generalized kappa, Conger's generalized kappa, Krippendorff's alpha, and the Brenann-Prediger coefficient. The proposed method is very flexible, can accommodate several types of correlation structures between coefficients, and requires neither advanced statistical modeling skills nor considerable computer programming experience. The validity of this method is tested with a Monte Carlo simulation.
引用
收藏
页码:609 / 637
页数:29
相关论文
共 22 条
[1]  
[Anonymous], 2014, HDB INTERRATER RELIA
[2]   USING REPLICATE OBSERVATIONS IN OBSERVER AGREEMENT STUDIES WITH BINARY ASSESSMENTS [J].
BAKER, SG ;
FREEDMAN, LS ;
PARMAR, MKB .
BIOMETRICS, 1991, 47 (04) :1327-1338
[3]   Weighted least-squares approach for comparing correlated kappa [J].
Barnhart, HX ;
Williamson, JM .
BIOMETRICS, 2002, 58 (04) :1012-1019
[4]  
Bennett FM, 1954, PUBLIC OPIN QUART, V18, P303
[5]   COEFFICIENT KAPPA - SOME USES, MISUSES, AND ALTERNATIVES [J].
BRENNAN, RL ;
PREDIGER, DJ .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1981, 41 (03) :687-699
[6]   HIGH AGREEMENT BUT LOW KAPPA .2. RESOLVING THE PARADOXES [J].
CICCHETTI, DV ;
FEINSTEIN, AR .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 1990, 43 (06) :551-558
[8]   A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46
[9]   INTEGRATION AND GENERALIZATION OF KAPPAS FOR MULTIPLE RATERS [J].
CONGER, AJ .
PSYCHOLOGICAL BULLETIN, 1980, 88 (02) :322-328
[10]  
Cousineau D., 2015, EDUC PSYCHOL MEAS, V88, P1