Understanding the common interrater reliability measures

被引:8
作者
Lim, Sok Mui [1 ]
Palethorpe, Natasha [2 ]
Rodger, Sylvia [3 ]
机构
[1] Singapore Inst Technol, Acad Programmes Div, Singapore, Singapore
[2] Downer EDI Min, South Brisbane, Qld, Australia
[3] Univ Queensland, Sch Hlth & Rehabil Sci, Div Occupat Therapy, Brisbane, Qld, Australia
关键词
inter-rater reliability; consensus estimates; consistency estimates; measurement estimates;
D O I
10.12968/ijtr.2012.19.9.488
中图分类号
R49 [康复医学];
学科分类号
100215 ;
摘要
Health and rehabilitation professionals use a range of outcome instruments to evaluate the effectiveness of their interventions. In order to be evidenced-based practitioners, we need to understand the psychometric properties of these instruments and to be able to interpret the statistics used to test their psychometric properties. This paper focuses on inter-rater reliability. Different statistical methods for computing inter-rater reliability can be classified into one of three categories: consensus estimates, consistency estimates, and measurement estimates. The common statistical methods such as Kappa, intraclass correlation and Many-Facets Rasch Model are described along with the advantages and disadvantages of each approach. For each category of estimates, one paper has been chosen from the therapy and rehabilitation literature to illustrate the use of a number of commonly utilised inter-rater reliability measures. It is hoped that this overview will provide practitioners, students and/or new researchers with a ready reference of key measurements used for determining inter-rater reliability.
引用
收藏
页码:488 / 496
页数:9
相关论文
共 41 条
[1]  
[Anonymous], USING INTERPRETING S
[2]   How to read and critically appraise a reliability article [J].
Bialocerkowski, Andrea ;
Klupp, Nerida ;
Bragge, Peter .
INTERNATIONAL JOURNAL OF THERAPY AND REHABILITATION, 2010, 17 (03) :114-120
[3]   Measurement error and reliability testing: Application to rehabilitation [J].
Bialocerkowski, Andrea E. ;
Bragge, Peter .
INTERNATIONAL JOURNAL OF THERAPY AND REHABILITATION, 2008, 15 (10) :422-427
[4]   GRAPHICAL JUDGMENTAL AID WHICH SUMMARIZES OBTAINED AND CHANCE RELIABILITY DATA AND HELPS ASSESS THE BELIEVABILITY OF EXPERIMENTAL EFFECTS [J].
BIRKIMER, JC ;
BROWN, JH .
JOURNAL OF APPLIED BEHAVIOR ANALYSIS, 1979, 12 (04) :523-533
[5]   Observing washing and dressing of stroke patients: nursing intervention compared with occupational therapists. What is the difference? [J].
Booth, J ;
Davidson, I ;
Winstanley, J ;
Waters, K .
JOURNAL OF ADVANCED NURSING, 2001, 33 (01) :98-105
[6]  
Brennan, 2001, GEN THEORY
[7]  
Brown G., 2004, ASSESS WRIT, V9, P105, DOI [DOI 10.1016/J.ASW.2004.07.001, 10.1016/j.asw.2004.07.001]
[9]  
Consortium for Quality in Identification and Recruitment (ConQIR), 2008, LIT REV INT REL
[10]   A psychometric toolbox for testing validity and reliability [J].
DeVon, Holli A. ;
Block, Michelle E. ;
Moyle-Wright, Patricia ;
Ernst, Diane M. ;
Hayden, Susan J. ;
Lazzara, Deborah J. ;
Savoy, Suzanne M. ;
Kostas-Polston, Elizabeth .
JOURNAL OF NURSING SCHOLARSHIP, 2007, 39 (02) :155-164