Causal Inference with Multilevel Data: A Comparison of Different Propensity Score Weighting Approaches

被引:27
作者
Fuentes, Alvaro [1 ]
Luedtke, Oliver [1 ]
Robitzsch, Alexander [1 ]
机构
[1] Leibniz Inst Sci & Math Educ, Ctr Int Student Assessment, Kiel, Germany
关键词
Causal inference; propensity scores; multilevel data; weighting; calibration weights;
D O I
10.1080/00273171.2021.1925521
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Propensity score methods are a widely recommended approach to adjust for confounding and to recover treatment effects with non-experimental, single-level data. This article reviews propensity score weighting estimators for multilevel data in which individuals (level 1) are nested in clusters (level 2) and nonrandomly assigned to either a treatment or control condition at level 1. We address the choice of a weighting strategy (inverse probability weights, trimming, overlap weights, calibration weights) and discuss key issues related to the specification of the propensity score model (fixed-effects model, multilevel random-effects model) in the context of multilevel data. In three simulation studies, we show that estimates based on calibration weights, which prioritize balancing the sample distribution of level-1 and (unmeasured) level-2 covariates, should be preferred under many scenarios (i.e., treatment effect heterogeneity, presence of strong level-2 confounding) and can accommodate covariate-by-cluster interactions. However, when level-1 covariate effects vary strongly across clusters (i.e., under random slopes), and this variation is present in both the treatment and outcome data-generating mechanisms, large cluster sizes are needed to obtain accurate estimates of the treatment effect. We also discuss the implementation of survey weights and present a real-data example that illustrates the different methods.
引用
收藏
页码:916 / 939
页数:24
相关论文
共 63 条
[1]  
Allison P., 2009, SAGE, DOI DOI 10.4135/9781412993869
[2]  
[Anonymous], 2007, 708 CSE UCLA
[3]  
Aronow P.M., 2019, Foundations of Agnostic Statistics
[4]   The specification of the propensity score in multilevel observational studies [J].
Arpino, Bruno ;
Mealli, Fabrizia .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2011, 55 (04) :1770-1780
[5]   An Introduction to Propensity Score Methods for Reducing the Effects of Confounding in Observational Studies [J].
Austin, Peter C. .
MULTIVARIATE BEHAVIORAL RESEARCH, 2011, 46 (03) :399-424
[6]   Propensity Score Analysis With Missing Data [J].
Cham, Heining ;
West, Stephen G. .
PSYCHOLOGICAL METHODS, 2016, 21 (03) :427-445
[7]   Constructing inverse probability weights for marginal structural models [J].
Cole, Stephen R. ;
Hernan, Miguel A. .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2008, 168 (06) :656-664
[8]   Dealing with limited overlap in estimation of average treatment effects [J].
Crump, Richard K. ;
Hotz, V. Joseph ;
Imbens, Guido W. ;
Mitnik, Oscar A. .
BIOMETRIKA, 2009, 96 (01) :187-199
[9]   Using Propensity Score Analysis of Survey Data to Estimate Population Average Treatment Effects: A Case Study Comparing Different Methods [J].
Dong, Nianbo ;
Stuart, Elizabeth A. ;
Lenis, David ;
Trang Quynh Nguyen .
EVALUATION REVIEW, 2020, 44 (01) :84-108
[10]   Regressor and random-effects dependencies in multilevel models [J].
Ebbes, P ;
Böckenholt, U ;
Wedel, M .
STATISTICA NEERLANDICA, 2004, 58 (02) :161-178