Standard Setting Methods for Pass/Fail Decisions on High-Stakes Objective Structured Clinical Examinations: A Validity Study

被引:20
作者
Yousuf, Naveed [1 ]
Violato, Claudio [2 ]
Zuberi, Rukhsana W. [1 ]
机构
[1] Aga Khan Univ, Dept Educ Dev, Karachi 74800, Pakistan
[2] Univ Ambrosiana, Dept Med Educ, Milan, Italy
关键词
psychometrics; OSCE; clinical skills assessment; validity; standard setting; SMALL-SCALE OSCE; CLUSTER-ANALYSIS; PASSING SCORES; CREDENTIALING EXAMINATIONS; MEDICAL-EDUCATION; ANGOFF METHOD; DENTAL OSCE; PERFORMANCE; COMPETENCE; RELIABILITY;
D O I
10.1080/10401334.2015.1044749
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Construct: Authentic standard setting methods will demonstrate high convergent validity evidence of their outcomes, that is, cutoff scores and pass/fail decisions, with most other methods when compared with each other. Background: The objective structured clinical examination (OSCE) was established for valid, reliable, and objective assessment of clinical skills in health professions education. Various standard setting methods have been proposed to identify objective, reliable, and valid cutoff scores on OSCEs. These methods may identify different cutoff scores for the same examinations. Identification of valid and reliable cutoff scores for OSCEs remains an important issue and a challenge. Approach: Thirty OSCE stations administered at least twice in the years 2010-2012 to 393 medical students in Years 2 and 3 at Aga Khan University are included. Psychometric properties of the scores are determined. Cutoff scores and pass/fail decisions of Wijnen, Cohen, Mean-1.5SD, Mean-1SD, Angoff, borderline group and borderline regression (BL-R) methods are compared with each other and with three variants of cluster analysis using repeated measures analysis of variance and Cohen's kappa. Results: The mean psychometric indices on the 30 OSCE stations are reliability coefficient = 0.76 (SD = 0.12); standard error of measurement = 5.66 (SD = 1.38); coefficient of determination = 0.47 (SD = 0.19), and intergrade discrimination = 7.19 (SD = 1.89). BL-R and Wijnen methods show the highest convergent validity evidence among other methods on the defined criteria. Angoff and Mean-1.5SD demonstrated least convergent validity evidence. The three cluster variants showed substantial convergent validity with borderline methods. Conclusions: Although there was a high level of convergent validity of Wijnen method, it lacks the theoretical strength to be used for competency-based assessments. The BL-R method is found to show the highest convergent validity evidences for OSCEs with other standard setting methods used in the present study. We also found that cluster analysis using mean method can be used for quality assurance of borderline methods. These findings should be further confirmed by studies in other settings.
引用
收藏
页码:280 / 291
页数:12
相关论文
共 51 条
[1]   Setting defensible performance standards on OSCEs and standardized patient examinations [J].
Boulet, JR ;
De Champlain, AF ;
McKinley, DW .
MEDICAL TEACHER, 2003, 25 (03) :245-249
[2]   Standard setting for clinical competence at graduation from medical school: A comparison of passing scores across five medical schools [J].
Boursicot, KAM ;
Roberts, TE ;
Pell, G .
ADVANCES IN HEALTH SCIENCES EDUCATION, 2006, 11 (02) :173-183
[3]  
Bramble K, 1994, J Nurs Educ, V33, P59
[4]   A systematic review of the reliability of objective structured clinical examination scores [J].
Brannick, Michael T. ;
Erol-Korkmaz, H. Tugba ;
Prewett, Matthew .
MEDICAL EDUCATION, 2011, 45 (12) :1181-1189
[5]  
Bujack L, 1991, Nurse Educ Today, V11, P248, DOI 10.1016/0260-6917(91)90086-P
[6]  
Bujack L, 1991, Nurse Educ Today, V11, P179, DOI 10.1016/0260-6917(91)90057-H
[7]   The objective structured clinical examination - A step in the direction of competency-based evaluation [J].
Carraccio, C ;
Englander, R .
ARCHIVES OF PEDIATRICS & ADOLESCENT MEDICINE, 2000, 154 (07) :736-741
[8]  
Cizek G.J., 2007, STANDARD SETTING GUI
[9]   RELIABILITY AND VALIDITY OF THE OBJECTIVE STRUCTURED CLINICAL EXAMINATION IN ASSESSING SURGICAL RESIDENTS [J].
COHEN, R ;
REZNICK, RK ;
TAYLOR, BR ;
PROVAN, J ;
ROTHMAN, A .
AMERICAN JOURNAL OF SURGERY, 1990, 160 (03) :302-305
[10]  
Cusinamo MD, 1996, ACAD MED, V71, P112