Comparison of Methods for Identifying Differential Step Functioning with Polytomous Item Response Data

被引:1
作者
Finch, Holmes [1 ,2 ]
机构
[1] Ball State Univ, Dept Educ Psychol, Muncie, IN USA
[2] Ball State Univ, Dept Educ Psychol, Muncie, IN 47306 USA
关键词
I ERROR INFLATION; MANTEL-HAENSZEL; DETECTING DIF; MIMIC-MODEL; R PACKAGE; SELECTION; POWER;
D O I
10.1080/08957347.2022.2155650
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Much research has been devoted to identification of differential item functioning (DIF), which occurs when the item responses for individuals from two groups differ after they are conditioned on the latent trait being measured by the scale. There has been less work examining differential step functioning (DSF), which is present for polytomous items when the conditional likelihood of responses to specific categories differ between groups. DSF impacts estimation of the measured trait and reduces the effectiveness of standard DIF detection methods. The purpose of this simulation study was to extend upon earlier work by comparing several methods for detecting the presence of DSF in polytomous items, including an approach based on the lasso estimation of the generalized partial credit model. Results show that the lasso GPCM technique controlled the Type I error rate while yielding power rates somewhat lower than logistic regression and the MIMIC model, which were not able to control the Type I error rate in some conditions. An empirical example is also presented, and implications of this study for practice are discussed.
引用
收藏
页码:255 / 271
页数:17
相关论文
共 40 条
[1]   An investigation of the power of the likelihood ratio goodness-of-fit statistic in detecting differential item functioning [J].
Ankenmann, RD ;
Witt, EA ;
Dunbar, SB .
JOURNAL OF EDUCATIONAL MEASUREMENT, 1999, 36 (04) :277-300
[2]  
[Anonymous], 1988, Test validity, DOI DOI 10.4324/9780203056905
[3]   Use of Item Response Theory to Validate Cyberbullying Sensibility Scale for University Students [J].
Aricak, Osman Tolga ;
Avcu, Akif ;
Topcu, Feyza ;
Tutlu, Merve Gulcin .
INTERNATIONAL JOURNAL OF ASSESSMENT TOOLS IN EDUCATION, 2020, 7 (01) :18-29
[4]   Assessing Impact, DIF, and DFF in Accommodated Item Scores: A Comparison of Multilevel Measurement Model Parameterizations [J].
Beretvas, S. Natasha ;
Cawthon, Stephanie W. ;
Lockhart, L. Leland ;
Kaye, Alyssa D. .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2012, 72 (05) :754-773
[5]  
Cacioppo J.T., 1984, SOCIAL PERCEPTION CL, P91
[6]   Comparison of CAT Item Selection Criteria for Polytomous Items [J].
Choi, Seung W. ;
Swartz, Richard J. .
APPLIED PSYCHOLOGICAL MEASUREMENT, 2009, 33 (06) :419-440
[7]  
Delis D.C., 2001, Delis-Kaplan executive function system (D-KEFS)
[8]   Type I Error Inflation for Detecting DIF in the Presence of Impact [J].
DeMars, Christine E. .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2010, 70 (06) :961-972
[10]  
Finch W.H., 2014, Psychological Test and Assessment Modeling, V56, P25