The Multidimensionality of Measurement Bias in High-Stakes Testing: Using Machine Learning to Evaluate Complex Sources of Differential Item Functioning

被引:7
作者
Belzak, William C. M. [1 ]
机构
[1] Duolingo Inc, Pittsburgh, PA 15206 USA
关键词
differential item functioning; measurement bias; multidimensionality; psychometrics; regularization; SCIENCE ACHIEVEMENT; GENDER DIFFERENCES; DIF; PREFERENCES; REGULARIZATION; ASSESSMENTS; SELECTION; LANGUAGE; MODELS; TREES;
D O I
10.1111/emip.12486
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Test developers and psychometricians have historically examined measurement bias and differential item functioning (DIF) across a single categorical variable (e.g., gender), independently of other variables (e.g., race, age, etc.). This is problematic when more complex forms of measurement bias may adversely affect test responses and, ultimately, bias test scores. Complex forms of measurement bias include conditional effects, interactions, and mediation of background information on test responses. I propose a multidimensional, person-specific perspective of measurement bias to explain how complex sources of bias can manifest in the assessment of human knowledge, skills, and abilities. I also describe a data-driven approach for identifying key sources of bias among many possibilities-namely, a machine learning method commonly known as regularization.
引用
收藏
页码:24 / 33
页数:10
相关论文
共 67 条
[1]  
Abbott M.L., 2007, LANG TEST, V24, P7, DOI [DOI 10.1177/0066552207071510, 10.1177/0265532207071510, DOI 10.1177/0265532207071510]
[2]   Constructing Better Second Language Assessments Based on Differential Item Functioning Analysis [J].
Allalouf, Avi ;
Abramzon, Andrea .
LANGUAGE ASSESSMENT QUARTERLY, 2008, 5 (02) :120-141
[3]  
ALSHUAIBI ABDULGHANI., 2009, LANGUAGE INDIA, V9, P195
[4]  
Angoff W.H., 1993, Perspectives on differential item functioning methodology
[5]  
[Anonymous], 2014, Iranian Journal of Language Testing
[6]   Simplifying the Assessment of Measurement Invariance over Multiple Background Variables: Using Regularized Moderated Nonlinear Factor Analysis to Detect Differential Item Functioning [J].
Bauer, Daniel J. ;
Belzak, William C. M. ;
Cole, Veronica T. .
STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2020, 27 (01) :43-55
[7]   A More General Model for Testing Measurement Invariance and Differential Item Functioning [J].
Bauer, Daniel J. .
PSYCHOLOGICAL METHODS, 2017, 22 (03) :507-526
[8]   Psychometric Approaches for Developing Commensurate Measures Across Independent Studies: Traditional and New Models [J].
Bauer, Daniel J. ;
Hussong, Andrea M. .
PSYCHOLOGICAL METHODS, 2009, 14 (02) :101-125
[10]   Improving the Assessment of Measurement Invariance: Using Regularization to Select Anchor Items and Identify Differential Item Functioning [J].
Belzak, William C. M. ;
Bauer, Daniel J. .
PSYCHOLOGICAL METHODS, 2020, 25 (06) :673-690