Understanding the common interrater reliability measures

被引：8

作者：

Lim, Sok Mui ^{[1
]}

Palethorpe, Natasha ^{[2
]}

Rodger, Sylvia ^{[3
]}

机构：

[1] Singapore Inst Technol, Acad Programmes Div, Singapore, Singapore

[2] Downer EDI Min, South Brisbane, Qld, Australia

[3] Univ Queensland, Sch Hlth & Rehabil Sci, Div Occupat Therapy, Brisbane, Qld, Australia

来源：

INTERNATIONAL JOURNAL OF THERAPY AND REHABILITATION | 2012年 / 19卷 / 09期

关键词：

inter-rater reliability; consensus estimates; consistency estimates; measurement estimates;

D O I：

10.12968/ijtr.2012.19.9.488

中图分类号：

R49 [康复医学];

学科分类号：

100215 ;

摘要：

Health and rehabilitation professionals use a range of outcome instruments to evaluate the effectiveness of their interventions. In order to be evidenced-based practitioners, we need to understand the psychometric properties of these instruments and to be able to interpret the statistics used to test their psychometric properties. This paper focuses on inter-rater reliability. Different statistical methods for computing inter-rater reliability can be classified into one of three categories: consensus estimates, consistency estimates, and measurement estimates. The common statistical methods such as Kappa, intraclass correlation and Many-Facets Rasch Model are described along with the advantages and disadvantages of each approach. For each category of estimates, one paper has been chosen from the therapy and rehabilitation literature to illustrate the use of a number of commonly utilised inter-rater reliability measures. It is hoped that this overview will provide practitioners, students and/or new researchers with a ready reference of key measurements used for determining inter-rater reliability.

引用

页码：488 / 496

页数：9

共 41 条

[1]

[Anonymous], USING INTERPRETING S

[2] How to read and critically appraise a reliability article [J].