Using classification tree analysis to generate propensity score weights

被引：14

作者：

Linden, Ariel ^{[1
,2
]}

Yarnold, Paul R. ^{[3
]}

机构：

[1] Linden Consulting Grp LLC, 1301 North Bay Dr, Ann Arbor, MI 48103 USA

[2] Univ Michigan, Sch Med, Div Gen Med, Ann Arbor, MI USA

[3] Optimal Data Anal LLC, Chicago, IL USA

来源：

JOURNAL OF EVALUATION IN CLINICAL PRACTICE | 2017年 / 23卷 / 04期

关键词：

causal inference; classification tree analysis; machine learning; propensity score; MANAGEMENT PROGRAM EFFECTIVENESS; IMPROVE CAUSAL INFERENCE; DISEASE MANAGEMENT; BOOSTED REGRESSION; STRATIFICATION; MODEL; CLASSIFIERS; UNIVARIATE; SELECTION; EXAMPLE;

D O I：

10.1111/jep.12744

中图分类号：

R19 [保健组织与事业（卫生事业管理）];

学科分类号：

摘要：

Rationale, aims and objectives: In evaluating non-randomized interventions, propensity scores (PS) estimate the probability of assignment to the treatment group given observed characteristics. Machine learning algorithms have been proposed as an alternative to conventional logistic regression for modelling PS in order to avoid limitations of linear methods. We introduce classification tree analysis (CTA) to generate PS which is a "decision-tree"-like classification model that provides accurate, parsimonious decision rules that are easy to display and interpret, reports P values derived via permutation tests, and evaluates cross-generalizability. Method: Using empirical data, we identify all statistically valid CTA PS models and then use them to compute strata-specific, observation-level PS weights that are subsequently applied in outcomes analyses. We compare findings obtained using this framework to logistic regression and boosted regression, by evaluating covariate balance using standardized differences, model predictive accuracy, and treatment effect estimates obtained using median regression and a weighted CTA outcomes model. Results: While all models had some imbalanced covariates, main-effects logistic regression yielded the lowest average standardized difference, whereas CTA yielded the greatest predictive accuracy. Nevertheless, treatment effect estimates were generally consistent across all models. Conclusions: Assessing standardized differences in means as a test of covariate balance is inappropriate for machine learning algorithms that segment the sample into two or more strata. Because the CTA algorithm identifies all statistically valid PS models for a sample, it is most likely to identify a correctly specified PS model, and should be considered as an alternative approach to modeling the PS.

引用

页码：703 / 712

页数：10

共 56 条

[1] Allison Paul D., 2008, SAS Global Forum, V360, P1
[2] Angrist JD, 1996, J AM STAT ASSOC, V91, P444, DOI 10.2307/2291629
[3] [Anonymous], OPTIM DATA ANAL
[4] [Anonymous], 2010, Optimal Data Analysis
[5] Barosi G, 1998, BLOOD, V91, P3630
[6] Statistical modeling: The two cultures
Breiman, L
[J]. STATISTICAL SCIENCE, 2001, 16 (03) : 199 - 215
[7] Variable selection for propensity score models
Brookhart, M. Alan
Schneeweiss, Sebastian
Rothman, Kenneth J.
Glynn, Robert J.
Avorn, Jerry
Sturmer, Til
[J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 2006, 163 (12) : 1149 - 1156
[8] ASYMMETRIC STRATIFICATION - AN OUTLINE FOR AN EFFICIENT METHOD FOR CONTROLLING CONFOUNDING IN COHORT STUDIES
COOK, EF
GOLDMAN, L
[J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 1988, 127 (03) : 626 - 639
[9] COVBAL Linden A., 2016, STAT SOFTWARE COMPON
[10] EFFECTS OF MISSPECIFICATION OF THE PROPENSITY SCORE ON ESTIMATORS OF TREATMENT EFFECT
DRAKE, C
[J]. BIOMETRICS, 1993, 49 (04) : 1231 - 1236

← 1 2 3 4 5 6 →