The Langer-Improved Wald Test for DIF Testing With Multiple Groups: Evaluation and Comparison to Two-Group IRT

被引:115
作者
Woods, Carol M. [1 ]
Cai, Li [2 ]
Wang, Mian [1 ]
机构
[1] Univ Kansas, Lawrence, KS 66045 USA
[2] Univ Calif Los Angeles, Los Angeles, CA USA
关键词
differential item functioning; item response theory; Wald test; IRT-LR-DIF; LIKELIHOOD RATIO TEST; ANCHOR ITEM METHODS; CHI-SQUARE; PARAMETERS; EM; SEM;
D O I
10.1177/0013164412464875
中图分类号
G44 [教育心理学];
学科分类号
0402 ; 040202 ;
摘要
Differential item functioning (DIF) occurs when the probability of responding in a particular category to an item differs for members of different groups who are matched on the construct being measured. The identification of DIF is important for valid measurement. This research evaluates an improved version of Lord's chi(2) Wald test for comparing item response model parameter estimates between two groups. The improved version uses better approaches for computation of the covariance matrix and equating the item parameters across groups. There are two equating algorithms implemented in IRTPro and flexMIRT software: Wald-1 (one-stage) and Wald-2 (two-stage), only one of which has been studied in simulations before. The present study evaluates for the first time the Wald-1 algorithm and Wald-1 and Wald-2 for three groups simultaneously. A comparison to two-group IRT-LR-DIF is included. Results indicate that Wald-1 performs very well and is recommended, whereas Type I error is extremely inflated for Wald-2. Performance of IRT-LR-DIF and Wald-1 was similar, even for three groups.
引用
收藏
页码:532 / 547
页数:16
相关论文
共 42 条