An Overview of Interrater Agreement on Likert Scales for Researchers and Practitioners

被引:69
作者
O'Neill, Thomas A. [1 ]
机构
[1] Univ Calgary, Dept Psychol, Individual & Team Performance Lab, Calgary, AB, Canada
来源
FRONTIERS IN PSYCHOLOGY | 2017年 / 8卷
关键词
interrater agreement; rwg; multilevel methods; data aggregation; within-group agreement; reliability; WITHIN-GROUP AGREEMENT; MULTILEVEL RESEARCH; RELIABILITY; R(WG); INDEXES; COEFFICIENTS; PERSONALITY; EXPLORATION; PERFORMANCE; QUESTIONS;
D O I
10.3389/fpsyg.2017.00777
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Applications of interrater agreement (IRA) statistics for Likert scales are plentiful in research and practice. IRA may be implicated in job analysis, performance appraisal, panel interviews, and any other approach to gathering systematic observations. Any rating system involving subject-matter experts can also benefit from IRA as a measure of consensus. Further, IRA is fundamental to aggregation in multilevel research, which is becoming increasingly common in order to address nesting. Although, several technical descriptions of a few specific IRA statistics exist, this paper aims to provide a tractable orientation to common IRA indices to support application. The introductory overview is written with the intent of facilitating contrasts among IRA statistics by critically reviewing equations, interpretations, strengths, and weaknesses. Statistics considered include r(wg), r*(wg), r'(wg), r(wg(p)), average deviation (AD), a(wg), standard deviation (S-wg), and the coefficient of variation (CVwg). Equations support quick calculation and contrasting of different agreement indices. The article also includes a "quick reference" table and three figures in order to help readers identify how IRA statistics differ and how interpretations of IRA will depend strongly on the statistic employed. A brief consideration of recommended practices involving statistical and practical cutoff standards is presented, and conclusions are offered in light of the current literature.
引用
收藏
页数:15
相关论文
共 64 条
[1]   Assessing the impact of nonresponse on work grroup diversity effects [J].
Allen, Natalie J. ;
Stanley, David J. ;
Williams, Helen M. ;
Ross, Sarah J. .
ORGANIZATIONAL RESEARCH METHODS, 2007, 10 (02) :262-286
[2]   MEASURES OF INEQUALITY [J].
ALLISON, PD .
AMERICAN SOCIOLOGICAL REVIEW, 1978, 43 (06) :865-880
[3]   On the Use of the Coefficient of Variation as a Measure of Diversity [J].
Bedeian, Arthur G. ;
Mossholder, Kevin W. .
ORGANIZATIONAL RESEARCH METHODS, 2000, 3 (03) :285-297
[4]   EFFECTS OF RATER TRAINING AND DIARY-KEEPING ON PSYCHOMETRIC ERROR IN RATINGS [J].
BERNARDIN, HJ ;
WALTER, CS .
JOURNAL OF APPLIED PSYCHOLOGY, 1977, 62 (01) :64-69
[5]   Within-group agreement: On the use (and misuse) of rWG and rWG(J) in leadership research and some best practice guidelines [J].
Biemann, Torsten ;
Cole, Michael S. ;
Voelpel, Sven .
LEADERSHIP QUARTERLY, 2012, 23 (01) :66-80
[6]  
Bliese P., 2009, MULTILEVEL MODELING
[7]  
Bliese P. D., 2000, USING RANDOM GROUP R
[8]   Using Random Group Resampling in multilevel research - An example of the buffering effects of leadership climate [J].
Bliese, PD ;
Halverson, RR .
LEADERSHIP QUARTERLY, 2002, 13 (01) :53-68
[9]   Interrater agreement reconsidered:: An alternative to the rwg indices [J].
Brown, RD ;
Hauenstein, NMA .
ORGANIZATIONAL RESEARCH METHODS, 2005, 8 (02) :165-184
[10]  
Brutus S., 1998, J MANAG DEV, V17, P177, DOI [10.1108/EUM0000000004487, DOI 10.1108/EUM0000000004487]