Modeling Item-Level Heterogeneous Treatment Effects With the Explanatory Item Response Model: Leveraging Large-Scale Online Assessments to Pinpoint the Impact of Educational Interventions

被引:14
作者
Gilbert, Joshua B. [1 ]
Kim, James S. [1 ]
Miratrix, Luke W. [1 ]
机构
[1] Harvard Univ, Grad Sch Educ, Cambridge, MA 02138 USA
关键词
heterogeneous treatment effects; explanatory item response model; causal inference; simulation; psychometrics; LOGISTIC-REGRESSION; RASCH MODEL; PACKAGE;
D O I
10.3102/10769986231171710
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Analyses that reveal how treatment effects vary allow researchers, practitioners, and policymakers to better understand the efficacy of educational interventions. In practice, however, standard statistical methods for addressing heterogeneous treatment effects (HTE) fail to address the HTE that may exist within outcome measures. In this study, we present a novel application of the explanatory item response model (EIRM) for assessing what we term "item-level" HTE (IL-HTE), in which a unique treatment effect is estimated for each item in an assessment. Results from data simulation reveal that when IL-HTE is present but ignored in the model, standard errors can be underestimated and false positive rates can increase. We then apply the EIRM to assess the impact of a literacy intervention focused on promoting transfer in reading comprehension on a digital assessment delivered online to approximately 8,000 third-grade students. We demonstrate that allowing for IL-HTE can reveal treatment effects at the item-level masked by a null average treatment effect, and the EIRM can thus provide fine-grained information for researchers and policymakers on the potentially heterogeneous causal effects of educational interventions.
引用
收藏
页码:889 / 913
页数:25
相关论文
共 47 条
[11]   Pooling Interactions Into Error Terms in Multisite Experiments [J].
Chan, Wendy ;
Hedges, Larry Vernon .
JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2022, 47 (06) :639-665
[12]   Additive Multilevel Item Structure Models with Random Residuals: Item Modeling for Explanation and Item Generation [J].
Cho, Sun-Joo ;
De Boeck, Paul ;
Embretson, Susan ;
Rabe-Hesketh, Sophia .
PSYCHOMETRIKA, 2014, 79 (01) :84-104
[13]  
Christensen Karl Bang, 2006, J Appl Meas, V7, P184
[14]   Latent regression in loglinear Rasch models [J].
Christensen, KB ;
Bjorner, JB ;
Kreiner, S ;
Petersen, JH .
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2004, 33 (06) :1295-1313
[15]  
De Boeck P., 2004, Explanatory item response models: A generalized linear and nonlinear approach, V10, P978
[16]  
De Boeck P., 2016, HDB COGNITION ASSESS, P249, DOI [10.1002/9781118956588.ch11, DOI 10.1002/9781118956588.CH11]
[17]  
De Boeck P, 2011, J STAT SOFTW, V39, P1
[18]   Random Item IRT Models [J].
De Boeck, Paul .
PSYCHOMETRIKA, 2008, 73 (04) :533-559
[19]  
de BoeckP., 2014, Handbook of item response theory modeling: Applications to typical performance assessment, P252
[20]   Decomposing Treatment Effect Variation [J].
Ding, Peng ;
Feller, Avi ;
Miratrix, Luke .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2019, 114 (525) :304-317