A comparative study of on-line pretest item - Calibration/scaling methods in computerized adaptive testing

被引:42
作者
Ban, JC
Hanson, BA
Wang, TY
Yi, Q
Harris, DJ
机构
[1] ACT Inc, Measurement Res Dept, Iowa City, IA 52243 USA
[2] CTB McGraw Hill, Monterey, CA 93940 USA
关键词
D O I
10.1111/j.1745-3984.2001.tb01123.x
中图分类号
G44 [教育心理学];
学科分类号
0402 ; 040202 ;
摘要
The purpose of this study was to compare and evaluate five on-line pretest item-calibration/scaling methods in computerized adaptive testing (CAT): marginal maximum likelihood estimate with one EM cycle (OEM), marginal maximum likelihood estimate with multiple EM cycles (MEM), Stocking's Method A, Stocking's Method B, and BILOG/Prior. The five methods were evaluated in terms of item-parameter recovery, using three different sample sizes (300, 1000 and 3000). The MEM method appeared to be the best choice among these, because it produced the smallest parameter-estimation errors for all sample size conditions. MEM and OEM are mathematically similar although the OEM method produced larger errors. MEM also was preferable to OEM, unless the amount of time involved in iterative computation is a concern. Stocking's Method B also worked very well, but it required anchor items that either would increase test lengths or require larger sample sizes depending on test administration design. Until more appropriate ways of handling sparse data are devised, the BILOG/Prior method may not be a reasonable choice for small sample sizes. Stocking's Method A had the largest weighted total error as well as a theoretical weakness (i.e., treating estimated ability as true ability); thus, there appeared to be little reason to use it.
引用
收藏
页码:191 / 212
页数:22
相关论文
共 17 条
[1]  
ACT, 1997, ACT ASS TECHN MAN
[2]   ADAPTIVE EAP ESTIMATION OF ABILITY IN A MICROCOMPUTER ENVIRONMENT [J].
BOCK, RD ;
MISLEVY, RJ .
APPLIED PSYCHOLOGICAL MEASUREMENT, 1982, 6 (04) :431-444
[3]  
Folk V. G., 1996, ANN M NAT COUNC MEAS
[4]  
HANSON BA, 1999, 998 ACT INC
[5]  
HANSON BA, 2000, ESTIMATION TOOLKIT I
[6]  
HAYNIE KA, 1995, S ENT COMP AD TEST A
[7]  
Hsu Y., 1998, ANN M NAT COUNC MEAS
[8]  
LEVINE MV, 1998, DEV EV ONL CAL PROC
[9]  
Mislevy R.J., 1990, BILOG3 ITEM ANAL TES
[10]  
Parshall C. G., 1998, COLL COMP BAS TEST B