Anchor Selection Strategies for DIF Analysis: Review, Assessment, and New Approaches

被引:69
作者
Kopf, Julia [1 ]
Zeileis, Achim [2 ]
Strobl, Carolin [3 ]
机构
[1] Univ Munich, Dept Stat, D-80539 Munich, Germany
[2] Univ Innsbruck, A-6020 Innsbruck, Austria
[3] Univ Zurich, Zurich, Switzerland
关键词
Rasch model; differential item functioning (DIF); anchor selection; anchor class; uniform DIF; measurement invariance; LIKELIHOOD RATIO TEST; ITEM FUNCTIONING DETECTION; MANTEL-HAENSZEL PROCEDURES; GRADED RESPONSE MODEL; CHARACTERISTIC CURVES; LOGISTIC-REGRESSION; MEASUREMENT BIAS; MULTIPLE GROUPS; PARAMETERS; AREA;
D O I
10.1177/0013164414529792
中图分类号
G44 [教育心理学];
学科分类号
0402 ; 040202 ;
摘要
Differential item functioning (DIF) indicates the violation of the invariance assumption, for instance, in models based on item response theory (IRT). For item-wise DIF analysis using IRT, a common metric for the item parameters of the groups that are to be compared (e.g., for the reference and the focal group) is necessary. In the Rasch model, therefore, the same linear restriction is imposed in both groups. Items in the restriction are termed the ``anchor items''. Ideally, these items are DIF-free to avoid artificially augmented false alarm rates. However, the question how DIF-free anchor items are selected appropriately is still a major challenge. Furthermore, various authors point out the lack of new anchor selection strategies and the lack of a comprehensive study especially for dichotomous IRT models. This article reviews existing anchor selection strategies that do not require any knowledge prior to DIF analysis, offers a straightforward notation, and proposes three new anchor selection strategies. An extensive simulation study is conducted to compare the performance of the anchor selection strategies. The results show that an appropriate anchor selection is crucial for suitable item-wise DIF analysis. The newly suggested anchor selection strategies outperform the existing strategies and can reliably locate a suitable anchor when the sample sizes are large enough.
引用
收藏
页码:22 / 56
页数:35
相关论文
共 35 条
[1]   Identifying the causes of DIF in translated verbal items [J].
Allalouf, A ;
Hambleton, RK ;
Sireci, SG .
JOURNAL OF EDUCATIONAL MEASUREMENT, 1999, 36 (03) :185-198
[2]  
[Anonymous], 2013, R: A language and environment for statistical computing
[3]   AN ITERATIVE PROCEDURE FOR LINKING METRICS AND ASSESSING ITEM BIAS IN ITEM RESPONSE THEORY [J].
CANDELL, GL ;
DRASGOW, F .
APPLIED PSYCHOLOGICAL MEASUREMENT, 1988, 12 (03) :253-260
[4]   An investigation of the likelihood ratio test for detection of differential item functioning [J].
Cohen, AS ;
Kim, SH ;
Wollack, JA .
APPLIED PSYCHOLOGICAL MEASUREMENT, 1996, 20 (01) :15-26
[5]   STUDY OF THE MEASUREMENT BIAS OF 2 STANDARDIZED PSYCHOLOGICAL-TESTS [J].
DRASGOW, F .
JOURNAL OF APPLIED PSYCHOLOGY, 1987, 72 (01) :19-29
[6]   Loss of information in estimating item parameters in incomplete designs [J].
Eggen, Theo J. H. M. ;
Verrelst, Norman D. .
PSYCHOMETRIKA, 2006, 71 (02) :303-322
[8]  
FISCHER GH, 1995, RASCH MODELS, P15
[9]   RIM: A Random Item Mixture Model to Detect Differential Item Functioning [J].
Frederickx, Sofie ;
Tuerlinckx, Francis ;
De Boeck, Paul ;
Magis, David .
JOURNAL OF EDUCATIONAL MEASUREMENT, 2010, 47 (04) :432-457
[10]  
GLAS CAW, 1995, RASCH MODELS, P69