Comparison of two methods of standard setting: the performance of the three-level Angoff method

被引:22
|
作者
Jalili, Mohammad [1 ]
Hejri, Sara M.
Norcini, John J. [2 ]
机构
[1] Univ Tehran Med Sci, Educ Dev Ctr, Dept Emergency Med, Ctr Educ Res Med Sci, Tehran 1413843941, Iran
[2] Fdn Adv Int Med Educ & Res, Philadelphia, PA USA
关键词
OSCE; RELIABILITY; EDUCATION; SCORES;
D O I
10.1111/j.1365-2923.2011.04073.x
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
CONTEXT Cut-scores, reliability and validity vary among standard-setting methods. The modified Angoff method (MA) is a wellknown standard-setting procedure, but the three-level Angoff approach (TLA), a recent modification, has not been extensively evaluated. OBJECTIVES This study aimed to compare standards and pass rates in an objective structured clinical examination (OSCE) obtained using two methods of standard setting with discussion and reality checking, and to assess the reliability and validity of each method. METHODS A sample of 105 medical students participated in a 14-station OSCE. Fourteen and 10 faculty members took part in the MA and TLA procedures, respectively. In the MA, judges estimated the probability that a borderline student would pass each station. In the TLA, judges estimated whether a borderline examinee would perform the task correctly or not. Having given individual ratings, judges discussed their decisions. One week after the examination, the procedure was repeated using normative data. RESULTS The mean score for the total test was 54.11% (standard deviation: 8.80%). The MA cut-scores for the total test were 49.66% and 51.52% after discussion and reality checking, respectively (the consequent percentages of passing students were 65.7% and 58.1%, respectively). The TLA yielded mean pass scores of 53.92% and 63.09% after discussion and reality checking, respectively (rates of passing candidates were 44.8% and 12.4%, respectively). Compared with the TLA, the MA showed higher agreement between judges (0.94 versus 0.81) and a narrower 95% confidence interval in standards (3.22 versus 11.29). CONCLUSIONS The MA seems a more credible and reliable procedure with which to set standards for an OSCE than does the TLA, especially when a reality check is applied.
引用
收藏
页码:1199 / 1208
页数:10
相关论文
共 50 条
  • [21] Comparing Yes/No Angoff and Bookmark Standard Setting Methods in the Context of English Assessment
    Hsieh, Mingchuan
    LANGUAGE ASSESSMENT QUARTERLY, 2013, 10 (03) : 331 - 350
  • [23] Consistency between inter-institutional panels using a three-level Angoff-standard setting in licensure tests of foreign-trained dentists in Sweden: A cohort study
    Dalum, Jesper
    Paulsson, Liselotte
    Christidis, Nikolaos
    Franko, Mikael Andersson
    Karlgren, Klas
    Leanderson, Charlotte
    Sandborgh-Englund, Gunilla
    PLOS ONE, 2024, 19 (11):
  • [24] Performance Comparison Between SiC Two-Level and Si Three-Level AFE Converters with Grid Filters
    Karami, Marzieh
    Tallam, Rangarajan
    Pagenkopf, Kenneth E.
    Cuzner, Robert
    2020 IEEE ENERGY CONVERSION CONGRESS AND EXPOSITION (ECCE), 2020, : 736 - 741
  • [25] A Validity Study For Yes/No Angoff Standard Setting Method Using Cluster Analysis
    Tseng, Fen-Lan
    Chiou, Jia-Min
    Sung, Yao-Ting
    2015 12th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD), 2015, : 727 - 731
  • [26] MHD plants: A comparison between two-level and three-level systems
    Cicconardi, SP
    Jannelli, E
    Spazzafumo, G
    ENERGY CONVERSION AND MANAGEMENT, 1997, 38 (06) : 525 - 531
  • [27] Quantification of the level descriptors for the standard EQ-5D three-level system and a five-level version according to two methods
    M. F. Janssen
    E. Birnie
    G. J. Bonsel
    Quality of Life Research, 2008, 17 : 463 - 473
  • [28] Quantification of the level descriptors for the standard EQ-5D three-level system and a five-level version according to two methods
    Janssen, M. F.
    Birnie, E.
    Bonsel, G. J.
    QUALITY OF LIFE RESEARCH, 2008, 17 (03) : 463 - 473
  • [29] Teachers' ability to estimate item difficulty: A test of the assumptions in the Angoff standard setting method
    Impara, JC
    Plake, BS
    JOURNAL OF EDUCATIONAL MEASUREMENT, 1998, 35 (01) : 69 - 81
  • [30] Performance Comparison of Five-Phase Three-Level NPC to Five-Phase Two-Level VSI
    Chikondra, Bheemaiah
    Muduli, Utkal Ranjan
    Behera, Ranjan Kumar
    IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2020, 56 (04) : 3767 - 3775