Evaluating treatment effectiveness in patient subgroups: a comparison of propensity score methods with an automated matching approach

被引:37
作者
Radice, Rosalba [1 ]
Ramsahai, Roland [2 ]
Grieve, Richard [2 ]
Kreif, Noemi [2 ]
Sadique, Zia [2 ]
Sekhon, Jasjeet S. [3 ]
机构
[1] CLondon Sch Hyg & Trop Med, London, England
[2] LSHTM, Ctr Stat Methodol, London, England
[3] Univ Calif Berkeley, Berkeley, CA 94720 USA
基金
美国国家卫生研究院; 英国医学研究理事会;
关键词
confounding; observational studies; matching; propensity score methods; subgroup analysis; MARGINAL STRUCTURAL MODELS; CAUSAL INFERENCE; UNTREATED SUBJECTS; REGRESSION; PERFORMANCE; ROBUSTNESS; ESTIMATORS; ABILITY; BIAS;
D O I
10.1515/1557-4679.1382
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Propensity score (Pscore) matching and inverse probability of treatment weighting (IPTW) can remove bias due to observed confounders, if the Pscore is correctly specified. Genetic Matching (GenMatch) matches on the Pscore and individual covariates using an automated search algorithm to balance covariates. This paper compares common ways of implementing Pscore matching and IPTW, with Genmatch for balancing time-constant baseline covariates}. The methods are considered when estimates of treatment effectiveness are required for patient subgroups, and the treatment allocation process differs by subgroup. We apply these methods in a prospective cohort study that estimates the effectiveness of Drotrecogin alfa activated, for subgroups of patients with severe sepsis. In a simulation study we compare the methods when the Pscore is correctly specified, and then misspecified by ignoring the subgroup-specific treatment allocation. The simulations also consider poor overlap in baseline covariates, and different sample sizes. In the case study, GenMatch reports better covariate balance than IPTW or Pscore matching. In the simulations with correctly specified Pscores, good overlap and reasonable sample sizes, all methods report minimal bias. When the Pscore is misspecified, GenMatch reports the least imbalance and bias. With small sample sizes, IPTW is the most efficient approach, but all methods report relatively high bias of treatment effects. This study shows that overall GenMatch achieves the best covariate balance for each subgroup, and is more robust to Pscore misspecification than common alternative Pscore approaches.
引用
收藏
页数:44
相关论文
共 68 条
  • [1] Large sample properties of matching estimators for average treatment effects
    Abadie, A
    Imbens, GW
    [J]. ECONOMETRICA, 2006, 74 (01) : 235 - 267
  • [2] Abadie A., 2001, STATA J, V1, P1, DOI DOI 10.1177/1536867X0400400307
  • [3] Abadie Alberto., 2009, Matching on the estimated propensity score
  • [4] Drotrecogin alfa (activated) for adults with severe sepsis and a low risk of death
    Abraham, E
    Laterre, P
    Garg, R
    Levy, H
    Talwar, D
    Trzaskoma, BL
    Francois, B
    Guy, JS
    Bruckmann, M
    Rea-Neto, A
    Rossaint, R
    Perrotin, D
    Sablotzki, A
    Arkins, N
    Utterback, BG
    Macias, WL
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 2005, 353 (13) : 1332 - 1341
  • [5] [Anonymous], 1998, Polit Anal, DOI DOI 10.1093/PAN/7.1.187
  • [6] [Anonymous], HLTH EC, DOI DOI 10.1002/HEC.1748
  • [7] Austin PC, 2008, STAT MED, V27, P2037, DOI 10.1002/sim.3150
  • [8] The performance of different propensity-score methods for estimating relative risks
    Austin, Peter C.
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 2008, 61 (06) : 537 - 545
  • [9] The performance of different propensity score methods for estimating marginal odds ratios
    Austin, Peter C.
    [J]. STATISTICS IN MEDICINE, 2007, 26 (16) : 3078 - 3094
  • [10] A comparison of the ability of different propensity score models to balance measured variables between treated and untreated subjects: a Monte Carlo study
    Austin, Peter C.
    Grootendorst, Paul
    Anderson, Geoffrey M.
    [J]. STATISTICS IN MEDICINE, 2007, 26 (04) : 734 - 753