Likelihood-ratio DIF testing: Effects of nonnormality

被引:23
作者
Woods, Carol M. [1 ]
机构
[1] Washington Univ, Dept Psychol, St Louis, MO 63130 USA
关键词
differential item functioning; LR-DIF; IRT-LR-DIF; item response theory; item bias; measurement invariance;
D O I
10.1177/0146621607310402
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Differential item functioning (DIF) occurs when an item has different measurement properties for members of one group versus another. Likelihood-ratio (LR) tests for DIF based on item response theory (IRT) involve statistically comparing IRT models that vary with respect to their constraints. A simulation study evaluated how violation of the normality assumption about the random latent variable for one or both groups affected IRT-LR-DIF results. Item response data with or without DIF were generated from the two-parameter logistic model and fitted Under the assumption that the latent distribution was normal for both groups. Although the IRT-LR-DIF method performed well when latent distributions were normal for both groups, results were distorted when the distribution was skewed for one or both groups. Specifically, Type I error was inflated, differences between reference- and focal-group item parameter estimates were inaccurate, and group differences in the mean and variance of the latent distribution were overestimated.
引用
收藏
页码:511 / 526
页数:16
相关论文
共 52 条
[1]   An investigation of the power of the likelihood ratio goodness-of-fit statistic in detecting differential item functioning [J].
Ankenmann, RD ;
Witt, EA ;
Dunbar, SB .
JOURNAL OF EDUCATIONAL MEASUREMENT, 1999, 36 (04) :277-300
[2]  
[Anonymous], 2001, IRTLRDIF V 2 0B SOFT
[3]  
[Anonymous], 2006, MPLUS STAT ANAL LATE
[4]   An item response theory analysis of DSM-IV personality disorder criteria across younger and older age groups [J].
Balsis, Steve ;
Gleason, Marci E. J. ;
Woods, Carol M. ;
Oltmanns, Thomas F. .
PSYCHOLOGY AND AGING, 2007, 22 (01) :171-185
[5]   Gender differences by item difficulty interactions in multiple-choice mathematics items [J].
Bielinski, J ;
Davison, ML .
AMERICAN EDUCATIONAL RESEARCH JOURNAL, 1998, 35 (03) :455-476
[6]  
Birnbaum A., 1968, Statistical theories of mental test scores
[7]  
BOCK RD, 1970, PSYCHOMETRIKA, V35, P179
[8]   MARGINAL MAXIMUM-LIKELIHOOD ESTIMATION OF ITEM PARAMETERS - APPLICATION OF AN EM ALGORITHM [J].
BOCK, RD ;
AITKIN, M .
PSYCHOMETRIKA, 1981, 46 (04) :443-459
[9]   Different kinds of DIF: A distinction between absolute and relative forms of measurement invariance and bias [J].
Borsboom, D ;
Mellenbergh, GJ ;
van Heerden, J .
APPLIED PSYCHOLOGICAL MEASUREMENT, 2002, 26 (04) :433-450
[10]   When does measurement invariance matter? Commentary [J].
Borsboom, Denny .
MEDICAL CARE, 2006, 44 (11) :S176-S181