An Analytic Comparison of Effect Sizes for Differential Item Functioning

被引:27
作者
DeMars, Christine E. [1 ]
机构
[1] James Madison Univ, Ctr Assessment & Res Studies, Harrisonburg, VA 22807 USA
关键词
LOGISTIC-REGRESSION; I ERROR; STANDARDIZATION APPROACH; DIF DETECTION; PERFORMANCE; SIBTEST;
D O I
10.1080/08957347.2011.580255
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Three types of effects sizes for DIF are described in this exposition: log of the odds-ratio (differences in log-odds), differences in probability-correct, and proportion of variance accounted for. Using these indices involves conceptualizing the degree of DIF in different ways. This integrative review discusses how these measures are impacted in different ways by item difficulty, item discrimination, and item lower asymptote. For example, for a fixed discrimination, the difference in probabilities decreases as the difference between the item difficulty and the mean ability increases. Under the same conditions, the log of the odds-ratio remains constant if the lower asymptote is zero. A non-zero lower asymptote decreases the absolute value of the probability difference symmetrically for easy and hard items, but it decreases the absolute value of the log-odds difference much more for difficult items. Thus, one cannot set a criterion for defining a large effect size in one metric and find a corresponding criterion in another metric that is equivalent across all items or ability distributions. In choosing an effect size, these differences must be understood and considered.
引用
收藏
页码:189 / 209
页数:21
相关论文
共 29 条
[1]  
[Anonymous], 1988, Test validity
[2]  
Camilli G., 1994, METHODS IDENTIFYING
[3]  
Donoghue J.R., 1993, DIFFERENTIAL ITEM FU, P137
[4]  
Dorans N.J., 1989, Applied Measurement in Education, V2, P217, DOI DOI 10.1207/S15324818AME0203_3
[5]  
Dorans N.J., 1993, DIFFERENTIAL ITEM FU, P35, DOI [DOI 10.1002/J.2333-8504.1992.TB01440.X, 10.1002/j.2333-8504.1992.tb01440.x]
[6]   DEMONSTRATING THE UTILITY OF THE STANDARDIZATION APPROACH TO ASSESSING UNEXPECTED DIFFERENTIAL ITEM PERFORMANCE ON THE SCHOLASTIC APTITUDE-TEST [J].
DORANS, NJ ;
KULICK, E .
JOURNAL OF EDUCATIONAL MEASUREMENT, 1986, 23 (04) :355-368
[7]   Iterative purification and effect size use with logistic regression for differential item functioning detection [J].
French, Brian F. ;
Maller, Susan J. .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2007, 67 (03) :373-393
[8]   Differential item functioning detection and effect size:: A comparison between logistic regression and Mantel-Haenszel procedures [J].
Hidalgo, MD ;
López-Pina, JA .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2004, 64 (06) :903-915
[9]   Improved type I error control and reduced estimation bias for DIF detection using SIBTEST [J].
Jiang, H ;
Stout, W .
JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 1998, 23 (04) :291-322
[10]   Evaluating type I error and power rates using an effect size measure with the logistic regression procedure for DIF detection [J].
Jodoin, MG ;
Gierl, MJ .
APPLIED MEASUREMENT IN EDUCATION, 2001, 14 (04) :329-349