Combining machine learning and propensity score weighting to estimate causal effects in multivalued treatments

被引:19
作者
Linden, Ariel [1 ,2 ]
Yarnold, Paul R. [3 ]
机构
[1] Linden Consulting Grp LLC, 1301 North Bay Dr, Ann Arbor, MI 48103 USA
[2] Univ Michigan, Sch Med, Div Gen Med, Ann Arbor, MI USA
[3] Optimal Data Anal LLC, Evanston, IL USA
关键词
causal inference; doubly robust; inverse probability of treatment weighting; machine learning; marginal mean weighting through stratification; multivalued treatments; observational studies; propensity score; MANAGEMENT PROGRAM EFFECTIVENESS; DOUBLY ROBUST ESTIMATION; DISEASE MANAGEMENT; COVARIATE BALANCE; MISSING DATA; INFERENCE; STRATIFICATION; REGRESSION; MODEL; UNIVARIATE;
D O I
10.1111/jep.12610
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Rationale, aims and objectivesInterventions with multivalued treatments are common in medical and health research; examples include comparing the efficacy of competing interventions and contrasting various doses of a drug. In recent years, there has been growing interest in the development of methods that estimate multivalued treatment effects using observational data. This paper extends a previously described analytic framework for evaluating binary treatments to studies involving multivalued treatments utilizing a machine learning algorithm called optimal discriminant analysis (ODA). MethodWe describe the differences between regression-based treatment effect estimators and effects estimated using the ODA framework. We then present an empirical example using data from an intervention including three study groups to compare corresponding effects. ResultsThe regression-based estimators produced statistically significant mean differences between the two intervention groups, and between one of the treatment groups and controls. In contrast, ODA was unable to discriminate between distributions of any of the three study groups. ConclusionsOptimal discriminant analysis offers an appealing alternative to conventional regression-based models for estimating effects in multivalued treatment studies because of its insensitivity to skewed data and use of accuracy measures applicable to all prognostic analyses. If these analytic approaches produce consistent treatment effect P values, this bolsters confidence in the validity of the results. If the approaches produce conflicting treatment effect P values, as they do in our empirical example, the investigator should consider the ODA-derived estimates to be most robust, given that ODA uses permutation P values that require no distributional assumptions and are thus, always valid.
引用
收藏
页码:871 / 881
页数:11
相关论文
共 54 条
[1]  
[Anonymous], P AM STAT ASS SECTIO
[2]  
[Anonymous], 2015, WORKING PAPER
[3]  
[Anonymous], 2014, Statistical Software Components S457793
[4]   Doubly robust estimation in missing data and causal inference models [J].
Bang, H .
BIOMETRICS, 2005, 61 (04) :962-972
[5]  
Campbell D.T., 1966, Experimental and quasi-experimental designs for research
[6]   Efficient semiparametric estimation of multi-valued treatment effects under ignorability [J].
Cattaneo, Matias D. .
JOURNAL OF ECONOMETRICS, 2010, 155 (02) :138-154
[7]   A NEW METHOD OF CLASSIFYING PROGNOSTIC CO-MORBIDITY IN LONGITUDINAL-STUDIES - DEVELOPMENT AND VALIDATION [J].
CHARLSON, ME ;
POMPEI, P ;
ALES, KL ;
MACKENZIE, CR .
JOURNAL OF CHRONIC DISEASES, 1987, 40 (05) :373-383
[8]   STANDARD DISTANCE IN UNIVARIATE AND MULTIVARIATE-ANALYSIS [J].
FLURY, BK ;
RIEDWYL, H .
AMERICAN STATISTICIAN, 1986, 40 (03) :249-251
[9]   Programme evaluation with multiple treatments [J].
Frölich, M .
JOURNAL OF ECONOMIC SURVEYS, 2004, 18 (02) :181-224
[10]  
Grimm L.G., 2000, READING UNDERSTANDIN