Judges' Use of Examinee Performance Data in an Angoff Standard-Setting Exercise for a Medical Licensing Examination: An Experimental Study

被引:40
作者
Clauser, Brian E. [1 ]
Mee, Janet
Baldwin, Su G.
Margolis, Melissa J.
Dillon, Gerard F. [2 ]
机构
[1] Natl Board Med Examiners, Measurement Consulting Serv, Philadelphia, PA 19104 USA
[2] Natl Board Med Examiners, USMLE, Philadelphia, PA 19104 USA
关键词
CUTOFF SCORES; INFORMATION; JUDGMENTS; IMPACT;
D O I
10.1111/j.1745-3984.2009.00089.x
中图分类号
G44 [教育心理学];
学科分类号
0402 ; 040202 ;
摘要
Although the Angoff procedure is among the most widely used standard setting procedures for tests comprising multiple-choice items, research has shown that subject matter experts have considerable difficulty accurately making the required judgments in the absence of examinee performance data. Some authors have viewed the need to provide performance data as a fatal flaw for the procedure; others have considered it appropriate for experts to integrate performance data into their judgments but have been concerned that experts may rely too heavily on the data. There have, however, been relatively few studies examining how experts use the data. This article reports on two studies that examine how experts modify their judgments after reviewing data. In both studies, data for some items were accurate and data for other items had been manipulated. Judges in both studies substantially modified their judgments whether the data were accurate or not.
引用
收藏
页码:390 / 407
页数:18
相关论文
共 20 条
[1]  
Angoff W.H., 1971, ED MEASUREMENT, V2nd, P508
[2]  
[Anonymous], 1991, FUNDAMENTALS ITEM RE
[3]   Conclusions about frequently studied modified Angoff standard-setting topics [J].
Brandon, PR .
APPLIED MEASUREMENT IN EDUCATION, 2004, 17 (01) :59-88
[4]   INFLUENCE OF TYPE OF JUDGE, NORMATIVE INFORMATION, AND DISCUSSION ON STANDARDS RECOMMENDED FOR THE NATIONAL TEACHER EXAMINATIONS [J].
BUSCH, JC ;
JAEGER, RM .
JOURNAL OF EDUCATIONAL MEASUREMENT, 1990, 27 (02) :145-163
[5]   Multivariate generalizability analysis of the impact, of training and examinee performance information on judgments made in an Angoff-style standard-setting procedure [J].
Clauser, BE ;
Swanson, DB ;
Harik, P .
JOURNAL OF EDUCATIONAL MEASUREMENT, 2002, 39 (04) :269-290
[6]   An Empirical Examination of the Impact of Group Discussion and Examinee Performance Information on Judgments Made in the Angoff Standard-Setting Procedure [J].
Clauser, Brian E. ;
Harik, Polina ;
Margolis, Melissa J. ;
McManus, I. C. ;
Mollon, Jennifer ;
Chis, Liliana ;
Williams, Simon .
APPLIED MEASUREMENT IN EDUCATION, 2009, 22 (01) :1-21
[7]  
Dillon G.F., 2000, CLEAR EXAM REV, V11, P15
[8]  
DILLON GF, 1996, CLEAR EXAM REV, V7, P22
[9]   Relations between observed item difficulty levels and Angoff minimum passing levels for a group of borderline examinees [J].
Goodwin, LD .
APPLIED MEASUREMENT IN EDUCATION, 1999, 12 (01) :13-28
[10]  
Hambleton R.K., 2006, Educational Measurement, V4th, P433