Comparison of methods for handling covariate missingness in propensity score estimation with a binary exposure

被引:28
作者
Coffman, Donna L. [1 ]
Zhou, Jiangxiu [2 ]
Cai, Xizhen [3 ]
机构
[1] Temple Univ, 1301 Cecil B Moore Ave,Ritter Annex,9th Floor, Philadelphia, PA 19122 USA
[2] GlaxoSmithKline, Philadelphia, PA USA
[3] Williams Coll, Williamstown, MA 01267 USA
基金
美国国家卫生研究院;
关键词
Propensity scores; Missing data; Causal inference; Generalized boosted models; MULTIPLE IMPUTATION;
D O I
10.1186/s12874-020-01053-4
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background Causal effect estimation with observational data is subject to bias due to confounding, which is often controlled for using propensity scores. One unresolved issue in propensity score estimation is how to handle missing values in covariates. Method Several approaches have been proposed for handling covariate missingness, including multiple imputation (MI), multiple imputation with missingness pattern (MIMP), and treatment mean imputation. However, there are other potentially useful approaches that have not been evaluated, including single imputation (SI) + prediction error (PE), SI + PE + parameter uncertainty (PU), and Generalized Boosted Modeling (GBM), which is a nonparametric approach for estimating propensity scores in which missing values are automatically handled in the estimation using a surrogate split method. To evaluate the performance of these approaches, a simulation study was conducted. Results Results suggested that SI + PE, SI + PE + PU, MI, and MIMP perform almost equally well and better than treatment mean imputation and GBM in terms of bias; however, MI and MIMP account for the additional uncertainty of imputing the missingness. Conclusions Applying GBM to the incomplete data and relying on the surrogate split approach resulted in substantial bias. Imputation prior to implementing GBM is recommended.
引用
收藏
页数:14
相关论文
共 32 条
  • [1] Multiple imputation of covariates by fully conditional specification: Accommodating the substantive model
    Bartlett, Jonathan W.
    Seaman, Shaun R.
    White, Ian R.
    Carpenter, James R.
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2015, 24 (04) : 462 - 487
  • [2] Variable selection for propensity score models
    Brookhart, M. Alan
    Schneeweiss, Sebastian
    Rothman, Kenneth J.
    Glynn, Robert J.
    Avorn, Jerry
    Sturmer, Til
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 2006, 163 (12) : 1149 - 1156
  • [3] Propensity Score Analysis With Missing Data
    Cham, Heining
    West, Stephen G.
    [J]. PSYCHOLOGICAL METHODS, 2016, 21 (03) : 427 - 445
  • [4] A comparison of inclusive and restrictive strategies in modern missing data procedures
    Collins, LM
    Schafer, JL
    Kam, CM
    [J]. PSYCHOLOGICAL METHODS, 2001, 6 (04) : 330 - 351
  • [5] Comparison of several imputation methods for missing baseline data in propensity scores analysis of binary outcome
    Crowe, Brenda J.
    Lipkovich, Ilya A.
    Wang, Ouhong
    [J]. PHARMACEUTICAL STATISTICS, 2010, 9 (04) : 269 - 279
  • [6] D'Agostino R.B., 2001, Health Services Outcomes Research Methodology, V2, P291, DOI [DOI 10.1023/A:1020375413191, 10.1023/a:1020375413191, https://doi.org/10.1023/A:1020375413191]
  • [7] Estimating and using propensity scores with partially missing data
    D'Agostino, RB
    Rubin, DB
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2000, 95 (451) : 749 - 759
  • [8] Recursive partitioning for missing data imputation in the presence of interaction effects
    Doove, L. L.
    Van Buuren, S.
    Dusseldorp, E.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 72 : 92 - 104
  • [9] Planned missing data designs in psychological research
    Graham, John W.
    Taylor, Bonnie J.
    Olchowski, Allison E.
    Cumsille, Patricio E.
    [J]. PSYCHOLOGICAL METHODS, 2006, 11 (04) : 323 - 343
  • [10] How many imputations are really needed? - Some practical clarifications of multiple imputation theory
    Graham, John W.
    Olchowski, Allison E.
    Gilreath, Tamika D.
    [J]. PREVENTION SCIENCE, 2007, 8 (03) : 206 - 213