Classification Consistency and Accuracy for Mixed-Format Tests

被引:7
作者
Kim, Stella Y. [1 ]
Lee, Won-Chan [2 ]
机构
[1] Univ N Carolina, Educ Leadership, Charlotte, NC 28223 USA
[2] Univ Iowa, Psychol & Quantitat Fdn, Iowa City, IA 52242 USA
关键词
MULTIPLE-CHOICE; TRUE-SCORE; CONSTRUCTED-RESPONSE; COMPLEX ASSESSMENTS;
D O I
10.1080/08957347.2019.1577246
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
This study explores classification consistency and accuracy for mixed-format tests using real and simulated data. In particular, the current study compares six methods of estimating classification consistency and accuracy for seven mixed-format tests. The relative performance of the estimation methods is evaluated using simulated data. Study results from real data analysis showed that the procedures exhibited similar patterns across various exams, but some tended to produce lower estimates of classification consistency and accuracy than others. As data became more multidimensional, unidimensional and multidimensional item response theory (IRT) methods tended to produce different results, with the unidimensional approach yielding lower estimates than the multidimensional approach. Results from simulated data analysis demonstrated smaller estimation error for the multidimensional IRT methods than for the unidimensional IRT method. The unidimensional approach yielded larger error as tests became more multidimensional, whereas a reverse relationship was observed for the multidimensional IRT approach. Among the non-IRT approaches, the normal approximation and Livingston-Lewis methods performed well, whereas the compound multinomial method tended to produce relatively larger error.
引用
收藏
页码:97 / 115
页数:19
相关论文
共 40 条
[1]  
[Anonymous], THESIS
[2]   A CONSUMERS GUIDE TO CRITERION-REFERENCED TEST RELIABILITY [J].
BERK, RA .
JOURNAL OF EDUCATIONAL MEASUREMENT, 1980, 17 (04) :323-349
[3]  
Brennan R. L., 2004, 7 CASMA U IOW
[4]  
Brennan R. L., 2004, 9 CASMA U IOW
[5]  
BRENNAN RL, 2006, 18 CASMA U IOW
[7]  
Cai L., 2016, FLEXMIRT VERSION 1 8
[8]   A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46
[9]  
College Board, 2017, AP SCOR COLL BOARD
[10]  
Deng, 2011, THESIS U MASSACHUSET