Using classification tree analysis to generate propensity score weights

被引:14
作者
Linden, Ariel [1 ,2 ]
Yarnold, Paul R. [3 ]
机构
[1] Linden Consulting Grp LLC, 1301 North Bay Dr, Ann Arbor, MI 48103 USA
[2] Univ Michigan, Sch Med, Div Gen Med, Ann Arbor, MI USA
[3] Optimal Data Anal LLC, Chicago, IL USA
关键词
causal inference; classification tree analysis; machine learning; propensity score; MANAGEMENT PROGRAM EFFECTIVENESS; IMPROVE CAUSAL INFERENCE; DISEASE MANAGEMENT; BOOSTED REGRESSION; STRATIFICATION; MODEL; CLASSIFIERS; UNIVARIATE; SELECTION; EXAMPLE;
D O I
10.1111/jep.12744
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Rationale, aims and objectives: In evaluating non-randomized interventions, propensity scores (PS) estimate the probability of assignment to the treatment group given observed characteristics. Machine learning algorithms have been proposed as an alternative to conventional logistic regression for modelling PS in order to avoid limitations of linear methods. We introduce classification tree analysis (CTA) to generate PS which is a "decision-tree"-like classification model that provides accurate, parsimonious decision rules that are easy to display and interpret, reports P values derived via permutation tests, and evaluates cross-generalizability. Method: Using empirical data, we identify all statistically valid CTA PS models and then use them to compute strata-specific, observation-level PS weights that are subsequently applied in outcomes analyses. We compare findings obtained using this framework to logistic regression and boosted regression, by evaluating covariate balance using standardized differences, model predictive accuracy, and treatment effect estimates obtained using median regression and a weighted CTA outcomes model. Results: While all models had some imbalanced covariates, main-effects logistic regression yielded the lowest average standardized difference, whereas CTA yielded the greatest predictive accuracy. Nevertheless, treatment effect estimates were generally consistent across all models. Conclusions: Assessing standardized differences in means as a test of covariate balance is inappropriate for machine learning algorithms that segment the sample into two or more strata. Because the CTA algorithm identifies all statistically valid PS models for a sample, it is most likely to identify a correctly specified PS model, and should be considered as an alternative approach to modeling the PS.
引用
收藏
页码:703 / 712
页数:10
相关论文
共 56 条
  • [1] Allison Paul D., 2008, SAS Global Forum, V360, P1
  • [2] Angrist JD, 1996, J AM STAT ASSOC, V91, P444, DOI 10.2307/2291629
  • [3] [Anonymous], OPTIM DATA ANAL
  • [4] [Anonymous], 2010, Optimal Data Analysis
  • [5] Barosi G, 1998, BLOOD, V91, P3630
  • [6] Statistical modeling: The two cultures
    Breiman, L
    [J]. STATISTICAL SCIENCE, 2001, 16 (03) : 199 - 215
  • [7] Variable selection for propensity score models
    Brookhart, M. Alan
    Schneeweiss, Sebastian
    Rothman, Kenneth J.
    Glynn, Robert J.
    Avorn, Jerry
    Sturmer, Til
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 2006, 163 (12) : 1149 - 1156
  • [8] ASYMMETRIC STRATIFICATION - AN OUTLINE FOR AN EFFICIENT METHOD FOR CONTROLLING CONFOUNDING IN COHORT STUDIES
    COOK, EF
    GOLDMAN, L
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 1988, 127 (03) : 626 - 639
  • [9] COVBAL Linden A., 2016, STAT SOFTWARE COMPON
  • [10] EFFECTS OF MISSPECIFICATION OF THE PROPENSITY SCORE ON ESTIMATORS OF TREATMENT EFFECT
    DRAKE, C
    [J]. BIOMETRICS, 1993, 49 (04) : 1231 - 1236