Comparing the Performance of Eight Item Preknowledge Detection Statistics

被引:33
作者
Belov, Dmitry I. [1 ]
机构
[1] Law Sch Admiss Council, Newtown, PA 18940 USA
关键词
test security; item preknowledge; hypothesis testing; Neyman-Pearson lemma; Kullback-Leibler divergence; ROC; person misfit; person fit; lz; KULLBACK-LEIBLER DIVERGENCE;
D O I
10.1177/0146621615603327
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Item preknowledge describes a situation in which a group of examinees (called aberrant examinees) have had access to some items (called compromised items) from an administered test prior to the exam. Item preknowledge negatively affects both the corresponding testing program and its users (e.g., universities, companies, government organizations) because scores for aberrant examinees are invalid. In general, item preknowledge is hard to detect due to multiple unknowns: unknown groups of aberrant examinees (at unknown test centers or schools) accessing unknown subsets of items prior to the exam. Recently, multiple statistical methods were developed to detect compromised items. However, the detected subset of items (called the suspicious subset) naturally has an uncertainty due to false positives and false negatives. The uncertainty increases when different groups of aberrant examinees had access to different subsets of items; thus, compromised items for one group are uncompromised for another group and vice versa. The impact of uncertainty on the performance of eight statistics (each relying on the suspicious subset) was studied. The measure of performance was based on the receiver operating characteristic curve. Computer simulations demonstrated how uncertainty combined with various independent variables (e.g., type of test, distribution of aberrant examinees) affected the performance of each statistic.
引用
收藏
页码:83 / 97
页数:15
相关论文
共 27 条
[1]  
[Anonymous], 2013, ANN M NAT COUNC MEAS
[2]  
[Anonymous], 1988, Signal Detection Theory and Psychophysics
[3]  
[Anonymous], 2012, Applications of item response theory to practical testing problems
[4]  
Armstrong R., 2007, Practical Assessment, Research Evaluation, V12
[5]  
Belov D.I., 2007, New Trends in Psychometrics, P7
[6]  
Belov D.I., 2014, Journal of Computerized Adaptive Testing, V2, P37, DOI [DOI 10.7333/1410-0203037, 10.7333/jcat.v2i0.36, DOI 10.7333/JCAT.V2I0.36]
[7]   Distributions of the Kullback-Leibler divergence with applications [J].
Belov, Dmitry I. ;
Armstrong, Ronald D. .
BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2011, 64 (02) :291-309
[8]   Automatic Detection of Answer Copying via Kullback-Leibler Divergence and K-Index [J].
Belov, Dmitry I. ;
Armstrong, Ronald D. .
APPLIED PSYCHOLOGICAL MEASUREMENT, 2010, 34 (06) :379-392
[9]   THE ASYMPTOTIC POSTERIOR NORMALITY OF THE LATENT TRAIT IN AN IRT MODEL [J].
CHANG, HH ;
STOUT, W .
PSYCHOMETRIKA, 1993, 58 (01) :37-52
[10]  
Choe E., 2014, ANN M NAT COUNC MEAS