Development of automated scoring algorithms for complex performance assessments: A comparison of two approaches

被引:46
作者
Clauser, BE
Margolis, MJ
Clyman, SG
Ross, LP
机构
[1] National Board of Medical Examiners, Philadelphia, PA 19104
[2] University of Massachusetts, Amherst, MA
[3] University of California, San Francisco, CA
关键词
D O I
10.1111/j.1745-3984.1997.tb00511.x
中图分类号
G44 [教育心理学];
学科分类号
0402 ; 040202 ;
摘要
Performance assessments are typically scored by having experts rate individual performances. The cast associated with using expert raters may represent a serious limitation in many large-scale testing programs. The use of raters may also introduce an additional source of error into the assessment. These limitations have motivated development of automated scoring systems for performance assessments. Preliminary research has shown these systems to have application across a variety of tasks ranging from simple mathematics to architectural problem solving. This study extends research on automated scoring by comparing alternative automated systems for scoring a computer simulation test of physicians' patient management skills; one system uses regression-derived weights for components of the performance, the other uses complex rules to map performances into score levels. The procedures are evaluated by comparing the resulting scores to expert ratings of the same performances.
引用
收藏
页码:141 / 161
页数:21
相关论文
共 22 条
[1]  
[Anonymous], 1995, EDUC RES-UK, DOI [10.3102/0013189X024005005, DOI 10.3102/0013189X024005005, DOI 10.3102/2F0013189X024005005]
[2]   EVALUATION OF PROCEDURE-BASED SCORING FOR HANDS-ON SCIENCE ASSESSMENT [J].
BAXTER, GP ;
SHAVELSON, RJ ;
GOLDMAN, SR ;
PINE, J .
JOURNAL OF EDUCATIONAL MEASUREMENT, 1992, 29 (01) :1-17
[3]  
BEJAR II, 1991, J APPL PSYCHOL, V76, P522
[4]   The accuracy of expert-system diagnoses of mathematical problem solutions [J].
Bennett, RE ;
Sebrechts, MM .
APPLIED MEASUREMENT IN EDUCATION, 1996, 9 (02) :133-150
[5]  
BRAUN HI, 1990, J EDUC MEAS, V27, P23
[6]  
Brehmer A, 1988, Advances in psychology, V54, P75, DOI DOI 10.1016/S0166-4115(08)62171-8
[7]   The generalizability of scores from a performance assessment of physicians' patient management skills [J].
Clauser, BE ;
Swanson, DB ;
Clyman, SG .
ACADEMIC MEDICINE, 1996, 71 (10) :S109-S111
[8]   Scoring a performance-based assessment by modeling the judgments of experts [J].
Clauser, BE ;
Subhiyah, RG ;
Nungester, RJ ;
Ripkey, DR ;
Clyman, SG ;
McKinley, D .
JOURNAL OF EDUCATIONAL MEASUREMENT, 1995, 32 (04) :397-415
[9]  
CLAUSER BE, IN PRESS APPL MEASUR
[10]  
CLYMAN SG, 1995, ASSESSING CLIN REASO, P139