Comparing the performance of propensity score methods in healthcare database studies with rare outcomes

被引:75
作者
Franklin, Jessica M. [1 ,2 ]
Eddings, Wesley [1 ,2 ]
Austin, Peter C. [3 ]
Stuart, Elizabeth A. [4 ,5 ,6 ]
Schneeweiss, Sebastian [1 ,2 ]
机构
[1] Brigham & Womens Hosp, Dept Med, Div Pharmacoepidemiol & Pharmacoecon, 1620 Tremont St,Suite 3030, Boston, MA 02120 USA
[2] Harvard Med Sch, 1620 Tremont St,Suite 3030, Boston, MA 02120 USA
[3] Inst Clin Evaluat Sci, Toronto, ON, Canada
[4] Johns Hopkins Univ, Bloomberg Sch Publ Hlth, Dept Mental Hlth, Baltimore, MD USA
[5] Johns Hopkins Univ, Bloomberg Sch Publ Hlth, Dept Biostat, Baltimore, MD USA
[6] Johns Hopkins Univ, Bloomberg Sch Publ Hlth, Dept Healthcare Policy & Management, Baltimore, MD USA
关键词
simulation; propensity score; epidemiology; healthcare databases; risk ratio; LOGISTIC-REGRESSION; CAUSAL INFERENCE; COHORT; STRATIFICATION; SIMULATION; ADJUSTMENT; EVENTS; NUMBER; BIAS;
D O I
10.1002/sim.7250
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Nonrandomized studies of treatments from electronic healthcare databases are critical for producing the evidence necessary to making informed treatment decisions, but often rely on comparing rates of events observed in a small number of patients. In addition, studies constructed from electronic healthcare databases, for example, administrative claims data, often adjust for many, possibly hundreds, of potential confounders. Despite the importance of maximizing efficiency when there are many confounders and few observed outcome events, there has been relatively little research on the relative performance of different propensity score methods in this context. In this paper, we compare a wide variety of propensity-based estimators of the marginal relative risk. In contrast to prior research that has focused on specific statistical methods in isolation of other analytic choices, we instead consider a method to be defined by the complete multistep process from propensity score modeling to final treatment effect estimation. Propensity score model estimation methods considered include ordinary logistic regression, Bayesian logistic regression, lasso, and boosted regression trees. Methods for utilizing the propensity score include pair matching, full matching, decile strata, fine strata, regression adjustment using one or two nonlinear splines, inverse propensity weighting, and matching weights. We evaluate methods via a plasmode' simulation study, which creates simulated datasets on the basis of a real cohort study of two treatments constructed from administrative claims data. Our results suggest that regression adjustment and matching weights, regardless of the propensity score model estimation method, provide lower bias and mean squared error in the context of rare binary outcomes. Copyright (c) 2017 John Wiley & Sons, Ltd.
引用
收藏
页码:1946 / 1963
页数:18
相关论文
共 65 条
  • [1] Austin PC, 2008, STAT MED, V27, P2037, DOI 10.1002/sim.3150
  • [2] The performance of different propensity score methods for estimating marginal odds ratios
    Austin, Peter C.
    [J]. STATISTICS IN MEDICINE, 2007, 26 (16) : 3078 - 3094
  • [3] Conditioning on the propensity score can result in biased estimation of common measures of treatment effect: A Monte Carlo study (p n/a)
    Austin, Peter C.
    Grootendorst, Paul
    Normand, Sharon-Lise T.
    Anderson, Geoffrey M.
    [J]. STATISTICS IN MEDICINE, 2007, 26 (16) : 3208 - 3210
  • [4] A comparison of 12 algorithms for matching on the propensity score
    Austin, Peter C.
    [J]. STATISTICS IN MEDICINE, 2014, 33 (06) : 1057 - 1069
  • [5] The performance of different propensity score methods for estimating marginal hazard ratios
    Austin, Peter C.
    [J]. STATISTICS IN MEDICINE, 2013, 32 (16) : 2837 - 2849
  • [6] Optimal caliper widths for propensity-score matching when estimating differences in means and differences in proportions in observational studies
    Austin, Peter C.
    [J]. PHARMACEUTICAL STATISTICS, 2011, 10 (02) : 150 - 161
  • [7] Doubly robust estimation in missing data and causal inference models
    Bang, H
    [J]. BIOMETRICS, 2005, 61 (04) : 962 - 972
  • [8] Bembom O, 2007, STAT APPL GENET MOL, V6
  • [9] Variable selection for propensity score models
    Brookhart, M. Alan
    Schneeweiss, Sebastian
    Rothman, Kenneth J.
    Glynn, Robert J.
    Avorn, Jerry
    Sturmer, Til
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 2006, 163 (12) : 1149 - 1156
  • [10] Estimation using all available covariate information versus a fixed look-back window for dichotomous covariates
    Brunelli, Steven M.
    Gagne, Joshua J.
    Huybrechts, Krista F.
    Wang, Shirley V.
    Patrick, Amanda R.
    Rothman, Kenneth J.
    Seeger, John D.
    [J]. PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2013, 22 (05) : 542 - 550