Good reasons for high variability (low inter-rater reliability) in performance assessment: Toward a fuzzy logic model

被引:6
|
作者
Roth, Wolff-Michael [1 ,3 ]
Mavin, Timothy J. [2 ,3 ]
Munro, Ian [4 ]
机构
[1] Univ Victoria, Fac Educ, Victoria, BC V8W 3N4, Canada
[2] Griffith Univ, Sch Biomol & Phys Sci, Nathan, Qld 4111, Australia
[3] Griffith Univ, Griffith Inst Educ Res, Nathan, Qld 4111, Australia
[4] Mt Cook Airlines, Christchurch, New Zealand
关键词
Performance assessment; High-risk industry; Fuzzy logic model; Inter-rater reliability; Think-aloud protocol; Aviation; SKILLS; AUTOMATION; CONTEXT; WORK;
D O I
10.1016/j.ergon.2014.07.004
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Regular performance assessment is an integral part of (high-) risk industries. Past research shows, however, that in many fields, inter-rater reliabilities tend to be moderate to low. This study was designed to investigate the variability of performance assessment in a naturalistic setting in aviation. A modified think-aloud protocol was used as research design to investigate the reasoning pairs of pilots use to assess the performance of an airline captain in a high-risk situation. Standard protocol analysis and interaction analysis methods were employed in the analysis of transcribed verbal protocols. The analyses confirm high variability in performance assessement and reveal the good, albeit fuzzy, justifications that assessor pairs use to ground their assessments. A fuzzy logic model exhibits a good approximation between predicted and actual ratings. Implications for the practice of performance assessment are provided. Relevance to industry: Many industries aim at achieving consistency in identifying true performance levels. However, if the variability in performance assessment is a real phenomenon, as reported here, then practitioners and researchers might have to test whether it can be used positively, e.g., as opportunity for improving the resilience of crews. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:685 / 696
页数:12
相关论文
共 12 条
  • [1] Low Inter-Rater Reliability of a High Stakes Performance Assessment of Teacher Candidates
    Lyness, Scott A.
    Peterson, Kent
    Yates, Kenneth
    EDUCATION SCIENCES, 2021, 11 (10):
  • [2] More Evidence of Low Inter-Rater Reliability of a High-Stakes Performance Assessment of Teacher Candidates
    Lyness, Scott A.
    EDUCATION SCIENCES, 2024, 14 (03):
  • [3] Comparison of Inter-Rater Reliability Techniques in Performance-Based Assessment
    Mancar, Sinem Arslan
    Gulleroglu, H. Deniz
    INTERNATIONAL JOURNAL OF ASSESSMENT TOOLS IN EDUCATION, 2022, 9 (02): : 515 - 533
  • [4] Inter-rater and intra-rater reliability of the current assessment model and tools for laparoscopic suturing
    Wei, Chin-Hung
    Shen, Shih-Chiang
    Duh, Yih-Cherng
    Tsai, Kuei-Yen
    Chen, Hsin-An
    Huang, Shih-Wei
    SURGICAL ENDOSCOPY AND OTHER INTERVENTIONAL TECHNIQUES, 2022, 36 (09): : 6586 - 6591
  • [5] The inter-rater reliability of the Performance Oriented Mobility Assessment tool after brain surgery
    Galloway, Adam Marco
    Killan, Edward C.
    McHugh, Gretl A.
    INTERNATIONAL JOURNAL OF THERAPY AND REHABILITATION, 2019, 26 (12)
  • [6] Qualitative visual assessment of the J-sign demonstrates high inter-rater reliability
    Walla, Nicholas
    Moore, Toren
    Harangody, Sarah
    Fitzpatrick, Sean
    Flanigan, David C.
    Duerr, Robert A.
    Siston, Robert
    Magnussen, Robert A.
    JOURNAL OF ISAKOS JOINT DISORDERS & ORTHOPAEDIC SPORTS MEDICINE, 2023, 8 (06) : 420 - 424
  • [7] Inter-rater reliability of the Abbreviated Injury Scale scores in patients with severe head injury shows good inter-rater agreement but variability between countries. An inter-country comparison study
    Amy C. Gunning
    Menco J. S. Niemeyer
    Mark van Heijl
    Karlijn J. P. van Wessem
    Ronald V. Maier
    Zsolt J. Balogh
    Luke P. H. Leenen
    European Journal of Trauma and Emergency Surgery, 2023, 49 : 1183 - 1188
  • [8] Inter-rater reliability in performance status assessment among health care professionals: a systematic review
    Chow, Ronald
    Chiu, Nicholas
    Bruera, Eduardo
    Krishnan, Monica
    Chiu, Leonard
    Lam, Henry
    DeAngelis, Carlo
    Pulenzas, Natalie
    Vuong, Sherlyn
    Chow, Edward
    ANNALS OF PALLIATIVE MEDICINE, 2016, 5 (02) : 83 - 92
  • [9] Inter-rater reliability of the Abbreviated Injury Scale scores in patients with severe head injury shows good inter-rater agreement but variability between countries. An inter-country comparison study
    Gunning, Amy C.
    Niemeyer, Menco J. S.
    van Heijl, Mark
    van Wessem, Karlijn J. P.
    Maier, Ronald, V
    Balogh, Zsolt J.
    Leenen, Luke P. H.
    EUROPEAN JOURNAL OF TRAUMA AND EMERGENCY SURGERY, 2023, 49 (03) : 1183 - 1188
  • [10] Inter-rater reliability in performance status assessment among healthcare professionals: an updated systematic review and meta-analysis
    Chow, Ronald
    Bruera, Eduardo
    Temel, Jennifer S.
    Krishnan, Monica
    Im, James
    Lock, Michael
    SUPPORTIVE CARE IN CANCER, 2020, 28 (05) : 2071 - 2078