Multivariate generalizability analysis of the impact, of training and examinee performance information on judgments made in an Angoff-style standard-setting procedure

被引：29

作者：

Clauser, BE

Swanson, DB

Harik, P

机构：

[1] Natl Board Med Examiners, Measuring Consulting Serv, Philadelphia, PA 19104 USA

[2] Natl Board Med Examiners, Test Dev, Philadelphia, PA 19104 USA

来源：

JOURNAL OF EDUCATIONAL MEASUREMENT | 2002年 / 39卷 / 04期

关键词：

D O I：

10.1111/j.1745-3984.2002.tb01143.x

中图分类号：

G44 [教育心理学];

学科分类号：

0402 ; 040202 ;

摘要：

Cut scores, estimated using the Angoff procedure, are routinely used to make high-stakes classification decisions based on examinee scores. Precision is necessary in estimation of cut scores because of the importance of these decisions. Although much has been written about how these procedures should be implemented, there is relatively little literature providing empirical support for specific approaches to providing training and feedback to standard-setting judges. This article presents a multivariate generalizability analysis designed to examine the impact of training and feedback on various sources of error in estimation of cut scores for a standard-setting procedure in which multiple independent groups completed the judgments. The results indicate that after training, there was little improvement in the ability of judges to rank order items by difficulty but there was a substantial improvement in inter-judge consistency in centering ratings. The results also show a substantial group effect. Consistent with this result, the direction of change for the estimated cut score was shown to be group dependent.

引用

页码：269 / 290

页数：22

共 24 条

[1]

[Anonymous], ED MEASUREMENT ISSUE

[2]

Brennan R. L., 1995, JOINT C STAND SETT L, VII, P269

[3]

Brennan R. L., 2001, GEN THEORY, DOI 10.1007/978-1-0716-1621-5_15

[4] An essay on the history and future of reliability from the perspective of replications [J].