Maintaining Content Validity in Computerized Adaptive Testing

被引:16
作者
Luecht, Richard M. [1 ]
de Champlain, Andre [1 ]
Nungester, Ronald J. [1 ]
机构
[1] Natl Board Med Examiners, Philadelphia, PA 19104 USA
关键词
computerized-adaptive testing; content validity; item response theory;
D O I
10.1023/A:1009789314011
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
A major advantage of using computerized adaptive testing (CAT) is improved measurement efficiency; better score reliability or mastery decisions can result from targeting item selections to the abilities of examinees. However, this type of engineering solution can result in differential content for different examinees at various levels of ability. This paper empirically demonstrates some of the trade-offs which can occur when content balancing is imposed in CAT forms or conversely, when it is ignored. That is, the content validity of a CAT form can actually change across a score scale when content balancing is ignored. On the other hand, efficiency and score precision can be severely reduced by over specifying content restrictions in a CAT form. The results from two simulation studies are presented as a means of highlighting some of the trade-offs that could occur between content and statistical considerations in CAT form assembly.
引用
收藏
页码:29 / 41
页数:13
相关论文
共 8 条
[1]  
Birnbaum A., 1968, STAT THEORIES MENTAL, P397
[2]  
Hambleton R.K., 1991, ADV ED PSYCHOL TESTI, P341, DOI [10.1007/978-94-009-2195-5_12, DOI 10.1007/978-94-009-2195-5_12]
[3]   A SAMPLING MODEL FOR VALIDITY [J].
KANE, MT .
APPLIED PSYCHOLOGICAL MEASUREMENT, 1982, 6 (02) :125-160
[4]  
Kingsbury G. G., 1991, APPL MEAN EDUC, V4, P241
[5]  
Lord Frederic M, 2012, APPL ITEM RESPONSE T
[6]  
Morrison C.A., 1995, M NAT COUNC MEAS ED
[7]  
Thomasson G.L., 1995, M PSYCH SOC MINN MN
[8]  
Wainer H., 1990, COMPUTERIZED ADAPTIV