The Langer-Improved Wald Test for DIF Testing With Multiple Groups: Evaluation and Comparison to Two-Group IRT

被引:123
作者
Woods, Carol M. [1 ]
Cai, Li [2 ]
Wang, Mian [1 ]
机构
[1] Univ Kansas, Lawrence, KS 66045 USA
[2] Univ Calif Los Angeles, Los Angeles, CA USA
关键词
differential item functioning; item response theory; Wald test; IRT-LR-DIF; LIKELIHOOD RATIO TEST; ANCHOR ITEM METHODS; CHI-SQUARE; PARAMETERS; EM; SEM;
D O I
10.1177/0013164412464875
中图分类号
G44 [教育心理学];
学科分类号
0402 ; 040202 ;
摘要
Differential item functioning (DIF) occurs when the probability of responding in a particular category to an item differs for members of different groups who are matched on the construct being measured. The identification of DIF is important for valid measurement. This research evaluates an improved version of Lord's chi(2) Wald test for comparing item response model parameter estimates between two groups. The improved version uses better approaches for computation of the covariance matrix and equating the item parameters across groups. There are two equating algorithms implemented in IRTPro and flexMIRT software: Wald-1 (one-stage) and Wald-2 (two-stage), only one of which has been studied in simulations before. The present study evaluates for the first time the Wald-1 algorithm and Wald-1 and Wald-2 for three groups simultaneously. A comparison to two-group IRT-LR-DIF is included. Results indicate that Wald-1 performs very well and is recommended, whereas Type I error is extremely inflated for Wald-2. Performance of IRT-LR-DIF and Wald-1 was similar, even for three groups.
引用
收藏
页码:532 / 547
页数:16
相关论文
共 42 条
[1]   An investigation of the power of the likelihood ratio goodness-of-fit statistic in detecting differential item functioning [J].
Ankenmann, RD ;
Witt, EA ;
Dunbar, SB .
JOURNAL OF EDUCATIONAL MEASUREMENT, 1999, 36 (04) :277-300
[2]  
[Anonymous], 1989, Int J Educ Res, DOI [DOI 10.1016/0883-0355, DOI 10.1016/0883-0355(89)90002-5]
[3]  
[Anonymous], 1985, Journal. of EducationalStatistics, DOI [DOI 10.1093/ntr/ntp093, DOI 10.3102/10769986010002121, 10.3102/10769986010002121]
[4]  
[Anonymous], 2012, Applications of item response theory to practical testing problems
[5]  
[Anonymous], 1993, Differential Item Functioning
[6]  
[Anonymous], 1997, Handbook of modern item response theory, DOI DOI 10.1007/978-1-4757-2691-6_5
[7]   MARGINAL MAXIMUM-LIKELIHOOD ESTIMATION OF ITEM PARAMETERS - APPLICATION OF AN EM ALGORITHM [J].
BOCK, RD ;
AITKIN, M .
PSYCHOMETRIKA, 1981, 46 (04) :443-459
[8]   A multigroup item response theory analysis of the Psychopathy Checklist-Revised [J].
Bolt, DA ;
Hare, RD ;
Vitale, JE ;
Newman, JP .
PSYCHOLOGICAL ASSESSMENT, 2004, 16 (02) :155-168
[9]   A Monte Carlo comparison of parametric and nonparametric polytomous DIF detection methods [J].
Bolt, DM .
APPLIED MEASUREMENT IN EDUCATION, 2002, 15 (02) :113-141
[10]  
Cai L., 2012, FLEXMIRT FLEXIBLE MU