Causal inference for multi-level treatments with machine-learned propensity scores

被引:0
作者
Lin Lin
Yeying Zhu
Liang Chen
机构
[1] University of Waterloo,Department of Statistics and Actuarial Science
[2] Wuhan University,Economics and Management School
来源
Health Services and Outcomes Research Methodology | 2019年 / 19卷
关键词
Causal inference; Generalized propensity score; Multinomial logistic regression; Generalized boosted model; Random forest; Data adaptive matching score;
D O I
暂无
中图分类号
学科分类号
摘要
Propensity score-based methods have been widely developed to adjust for confounders in observational studies to estimate causal treatment effect for binary treatments. We generalize these causal inference methods to the multi-level treatment case. We review the generalized causal inference framework and several propensity score estimation methods. We conduct a comprehensive simulation study to evaluate the performance of multinomial logistic regression, generalized boosted models, random forest and data adaptive matching score for estimating propensity scores based on inverse probability of treatment weighting. From our findings, multinomial logistic regression is susceptible to yielding extreme weights while a mis-specified model is assumed, which results in poor performance of the inverse probability weighted estimator. On the other hand, machine-learned propensity scores tend to have less biased and more stable performance, and the data adaptive matching score tends to perform the best overall. The above-mentioned propensity score based methods are applied to the Taobao dataset to evaluate the causal effect of reputation on sales.
引用
收藏
页码:106 / 126
页数:20
相关论文
共 73 条
[1]  
Breiman L(2001)Random forests Mach. Learn. 45 5-32
[2]  
Brookhart MA(2006)A semiparametric model selection criterion with applications to the marginal structural model Comput. Stat. Data Anal. 50 475-498
[3]  
van der Laan MJ(2009)Dealing with limited overlap in estimation of average treatment effects Biometrika 96 187-199
[4]  
Crump RK(2016)Reputation premium and reputation management: evidence from the largest e-commerce platform in china Int. J. Ind. Organ. 46 63-76
[5]  
Hotz VJ(2012)Generalized propensity score for estimating the average treatment effect of multiple treatments Stat. Med. 31 681-697
[6]  
Imbens GW(2004)Programme evaluation with multiple treatments J. Econ. Surv. 18 181-224
[7]  
Mitnik OA(2004)Causal inference with general treatment regimes J. Am. Stat. Assoc. 99 854-866
[8]  
Fan Y(2000)The role of the propensity score in estimating dose-response functions Biometrika 87 706-710
[9]  
Ju J(2002)Program heterogeneity and propensity score matching: an application to the evaluation of active labor market policies Rev. Econ. Stat. 84 205-220
[10]  
Xiao M(2010)Improving propensity score weighting using machine learning Stat. Med. 29 337-346