Good reasons for high variability (low inter-rater reliability) in performance assessment: Toward a fuzzy logic model

被引：6

作者：

Roth, Wolff-Michael ^{[1
,3
]}

Mavin, Timothy J. ^{[2
,3
]}

Munro, Ian ^{[4
]}

机构：

[1] Univ Victoria, Fac Educ, Victoria, BC V8W 3N4, Canada

[2] Griffith Univ, Sch Biomol & Phys Sci, Nathan, Qld 4111, Australia

[3] Griffith Univ, Griffith Inst Educ Res, Nathan, Qld 4111, Australia

[4] Mt Cook Airlines, Christchurch, New Zealand

来源：

INTERNATIONAL JOURNAL OF INDUSTRIAL ERGONOMICS | 2014年 / 44卷 / 05期

关键词：

Performance assessment; High-risk industry; Fuzzy logic model; Inter-rater reliability; Think-aloud protocol; Aviation; SKILLS; AUTOMATION; CONTEXT; WORK;

D O I：

10.1016/j.ergon.2014.07.004

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Regular performance assessment is an integral part of (high-) risk industries. Past research shows, however, that in many fields, inter-rater reliabilities tend to be moderate to low. This study was designed to investigate the variability of performance assessment in a naturalistic setting in aviation. A modified think-aloud protocol was used as research design to investigate the reasoning pairs of pilots use to assess the performance of an airline captain in a high-risk situation. Standard protocol analysis and interaction analysis methods were employed in the analysis of transcribed verbal protocols. The analyses confirm high variability in performance assessement and reveal the good, albeit fuzzy, justifications that assessor pairs use to ground their assessments. A fuzzy logic model exhibits a good approximation between predicted and actual ratings. Implications for the practice of performance assessment are provided. Relevance to industry: Many industries aim at achieving consistency in identifying true performance levels. However, if the variability in performance assessment is a real phenomenon, as reported here, then practitioners and researchers might have to test whether it can be used positively, e.g., as opportunity for improving the resilience of crews. (C) 2014 Elsevier B.V. All rights reserved.

引用

页码：685 / 696

页数：12

共 12 条

[11] Inter-rater reliability in performance status assessment among healthcare professionals: an updated systematic review and meta-analysis [J].

Chow, Ronald ;

Bruera, Eduardo ;

Temel, Jennifer S. ;

Krishnan, Monica ;

Im, James ;

Lock, Michael .

SUPPORTIVE CARE IN CANCER, 2020, 28 (05) :2071-2078

[12] Inter-rater reliability and validity of supervision performance assessment and recognition (SPARS) indicators to assess medicines management in public health facilities in Nepal [J].

Adhikari, Santusta ;

Bastola, Anup ;

Shrestha, Reekesh ;

Khanal, Narendra Kumar ;

Trap, Birna .

JOURNAL OF PHARMACEUTICAL POLICY AND PRACTICE, 2025, 18 (01)

← 1 2 →