A Regression Discontinuity Design Framework for Controlling Selection Bias in Evaluations of Differential Item Functioning

被引:0
作者
Koziol, Natalie A. [1 ]
Goodrich, J. Marc [2 ]
Yoon, HyeonJin [1 ]
机构
[1] Univ Nebraska, Lincoln, NE 68588 USA
[2] Texas A&M Univ, College Stn, TX USA
基金
美国国家科学基金会;
关键词
differential item functioning (DIF); logistic regression; regression discontinuity design; selection bias; LANGUAGE LEARNERS EVIDENCE; I ERROR INFLATION; LOGISTIC-REGRESSION; MANTEL-HAENSZEL; PROPENSITY SCORE; ODDS RATIO; DIF; SIBTEST; IDENTIFICATION; TESTS;
D O I
10.1177/00131644211068440
中图分类号
G44 [教育心理学];
学科分类号
0402 ; 040202 ;
摘要
Differential item functioning (DIF) is often used to examine validity evidence of alternate form test accommodations. Unfortunately, traditional approaches for evaluating DIF are prone to selection bias. This article proposes a novel DIF framework that capitalizes on regression discontinuity design analysis to control for selection bias. A simulation study was performed to compare the new framework with traditional logistic regression, with respect to Type I error and power rates of the uniform DIF test statistics and bias and root mean square error of the corresponding effect size estimators. The new framework better controlled the Type I error rate and demonstrated minimal bias but suffered from low power and lack of precision. Implications for practice are discussed.
引用
收藏
页码:1247 / 1277
页数:31
相关论文
共 48 条
[41]   The Multidimensionality of Measurement Bias in High-Stakes Testing: Using Machine Learning to Evaluate Complex Sources of Differential Item Functioning [J].
Belzak, William C. M. .
EDUCATIONAL MEASUREMENT-ISSUES AND PRACTICE, 2023, 42 (01) :24-33
[42]   Modified Logistic Regression Approaches to Eliminating the Impact of Response Styles on Differential Item Functioning Detection in Likert-Type Scale (vol 8, 1143, 2017) [J].
Chen, Hui-Fang ;
Jin, Kuan-Yu ;
Wang, Wen-Chung .
FRONTIERS IN PSYCHOLOGY, 2018, 9
[43]   "Youthful epidemic" or diagnostic bias? Differential item functioning of DSM-IV cannabis use criteria in an Australian general population survey [J].
Mewton, Louise ;
Teesson, Maree ;
Slade, Tim .
ADDICTIVE BEHAVIORS, 2010, 35 (05) :408-413
[44]   A Simulation Study to Assess the Effect of the Number of Response Categories on the Power of Ordinal Logistic Regression for Differential Item Functioning Analysis in Rating Scales [J].
Allahyari, Elahe ;
Jafari, Peyman ;
Bagheri, Zahra .
COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2016, 2016
[45]   Effectiveness of Combining Statistical Tests and Effect Sizes When Using Logistic Discriminant Function Regression to Detect Differential Item Functioning for Polytomous Items [J].
Gomez-Benito, Juana ;
Dolores Hidalgo, Ma ;
Zumbo, Bruno D. .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2013, 73 (05) :875-897
[46]   A Machine Learning Approach to Assess Differential Item Functioning in Psychometric Questionnaires Using the Elastic Net Regularized Ordinal Logistic Regression in Small Sample Size Groups [J].
Ebrahimi, Vahid ;
Bagheri, Zahra ;
Shayan, Zahra ;
Jafari, Peyman .
BIOMED RESEARCH INTERNATIONAL, 2021, 2021
[47]   A comparison of three methods of assessing differential item functioning (DIF) in the Hospital Anxiety Depression Scale: ordinal logistic regression, Rasch analysis and the Mantel chi-square procedure [J].
Cameron, Isobel M. ;
Scott, Neil W. ;
Adler, Mats ;
Reid, Ian C. .
QUALITY OF LIFE RESEARCH, 2014, 23 (10) :2883-2888
[48]   Assessing and adjusting for cross-cultural validity of impairment and activity limitation scales through differential item functioning within the framework of the Rasch model -: The PRO-ESOR project [J].
Tennant, A ;
Penta, M ;
Tesio, L ;
Grimby, G ;
Thonnard, JL ;
Slade, A ;
Lawton, G ;
Simone, A ;
Carter, J ;
Lundgren-Nilsson, Å ;
Tripolski, M ;
Ring, H ;
Biering-Sorensen, F ;
Marincek, C ;
Burger, H ;
Phillips, S .
MEDICAL CARE, 2004, 42 (01) :37-48