A New Stopping Criterion for Rasch Trees Based on the Mantel-Haenszel Effect Size Measure for Differential Item Functioning

被引:12
作者
Henninger, Mirka [1 ]
Debelak, Rudolf [1 ]
Strobl, Carolin [1 ]
机构
[1] Univ Zurich, Binzmuehlestr 14,Box 27, CH-8050 Zurich, Switzerland
关键词
differential item functioning; effect size; item response theory; Mantel-Haenszel odds ratio; Rasch tree; LOGISTIC-REGRESSION; SCALE PURIFICATION; DETECTING DIF; ODDS-RATIO; TESTS; MODEL; FRAMEWORK;
D O I
10.1177/00131644221077135
中图分类号
G44 [教育心理学];
学科分类号
0402 ; 040202 ;
摘要
To detect differential item functioning (DIF), Rasch trees search for optimal splitpoints in covariates and identify subgroups of respondents in a data-driven way. To determine whether and in which covariate a split should be performed, Rasch trees use statistical significance tests. Consequently, Rasch trees are more likely to label small DIF effects as significant in larger samples. This leads to larger trees, which split the sample into more subgroups. What would be more desirable is an approach that is driven more by effect size rather than sample size. In order to achieve this, we suggest to implement an additional stopping criterion: the popular Educational Testing Service (ETS) classification scheme based on the Mantel-Haenszel odds ratio. This criterion helps us to evaluate whether a split in a Rasch tree is based on a substantial or an ignorable difference in item parameters, and it allows the Rasch tree to stop growing when DIF between the identified subgroups is small. Furthermore, it supports identifying DIF items and quantifying DIF effect sizes in each split. Based on simulation results, we conclude that the Mantel-Haenszel effect size further reduces unnecessary splits in Rasch trees under the null hypothesis, or when the sample size is large but DIF effects are negligible. To make the stopping criterion easy-to-use for applied researchers, we have implemented the procedure in the statistical software R. Finally, we discuss how DIF effects between different nodes in a Rasch tree can be interpreted and emphasize the importance of purification strategies for the Mantel-Haenszel procedure on tree stopping and DIF item classification.
引用
收藏
页码:181 / 212
页数:32
相关论文
共 60 条
[12]   Iterative purification and effect size use with logistic regression for differential item functioning detection [J].
French, Brian F. ;
Maller, Susan J. .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2007, 67 (03) :373-393
[13]  
GLAS CAW, 1995, RASCH MODELS, P69
[14]   Type I Error and Statistical Power of the Mantel-Haenszel Procedure for Detecting DIF: A Meta-Analysis [J].
Guilera, Georgina ;
Gomez-Benito, Juana ;
Dolores Hidalgo, Maria ;
Sanchez-Meca, Julio .
PSYCHOLOGICAL METHODS, 2013, 18 (04) :553-571
[15]   Test purification and the evaluation of differential item functioning with multinomial logistic regression [J].
Hidalgo-Montesinos, MD ;
Gómez-Benito, J .
EUROPEAN JOURNAL OF PSYCHOLOGICAL ASSESSMENT, 2003, 19 (01) :1-11
[16]  
Holland P.W., 1985, ETS Research Report Series, V1985, P1, DOI DOI 10.1002/J.2330-8516.1985.TB00128.X
[17]  
Holland P W., 1986, ETS Research Report Series, pi, DOI [10.1002/j.2330-8516.1986.tb00186.x, DOI 10.1002/J.2330-8516.1986.TB00186.X]
[18]  
Hothorn T, 2015, J MACH LEARN RES, V16, P3905
[19]  
Khalid M.N., 2011, INT ONLINE J ED SCI, V3, P435
[20]   DETECTING EXPERIMENTALLY INDUCED ITEM BIAS USING THE ITERATIVE LOGIT METHOD [J].
KOK, FG ;
MELLENBERGH, GJ ;
VANDERFLIER, H .
JOURNAL OF EDUCATIONAL MEASUREMENT, 1985, 22 (04) :295-303