Estimating propensity scores with missing covariate data using general location mixture models

被引:32
作者
Mitra, Robin [1 ]
Reiter, Jerome P. [2 ]
机构
[1] Univ Southampton, Sch Math, Southampton SO17 1BJ, Hants, England
[2] Duke Univ, Dept Stat Sci, Durham, NC 27708 USA
基金
美国国家科学基金会;
关键词
latent class; matching; missing data; multiple imputation; observational studies; propensity score; LINEAR-MODELS; DISTRIBUTIONS; IMPUTATION; REGRESSION; INFERENCE; VALUES; BIAS;
D O I
10.1002/sim.4124
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In many observational studies, analysts estimate causal effects using propensity scores, e. g. by matching, sub-classifying, or inverse probability weighting based on the scores. Estimation of propensity scores is complicated when some values of the covariates are missing. Analysts can use multiple imputation to create completed data sets from which propensity scores can be estimated. We propose a general location mixture model for imputations that assumes that the control units are a latent mixture of (i) units whose covariates are drawn from the same distributions as the treated units' covariates and (ii) units whose covariates are drawn from different distributions. This formulation reduces the influence of control units outside the treated units' region of the covariate space on the estimation of parameters in the imputation model, which can result in more plausible imputations. In turn, this can result in more reliable estimates of propensity scores and better balance in the true covariate distributions when matching or sub-classifying. We illustrate the benefits of the latent class modeling approach with simulations and with an observational study of the effect of breast feeding on children's cognitive abilities. Copyright (C) 2010 John Wiley & Sons, Ltd.
引用
收藏
页码:627 / 641
页数:15
相关论文
共 42 条
[1]  
[Anonymous], 2000, SURV METHODOL
[2]  
Austin PC., 2009, Stat Med
[3]  
Austin PC, 2008, STAT MED, V27, P2037, DOI 10.1002/sim.3150
[5]   A latent-class mixture model for incomplete longitudinal Gaussian data [J].
Beunckens, Caroline ;
Molenberghs, Geert ;
Verbeke, Geert ;
Mallinckrodt, Craig .
BIOMETRICS, 2008, 64 (01) :96-105
[6]   AN ANALYSIS OF TRANSFORMATIONS [J].
BOX, GEP ;
COX, DR .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1964, 26 (02) :211-252
[7]   Full breastfeeding duration and associated decrease in respiratory tract infection in US children [J].
Chantry, CJ ;
Howard, CR ;
Auinger, P .
PEDIATRICS, 2006, 117 (02) :425-432
[8]   Hepatic resection compared to percutaneous ethanol injection for small hepatocellular carcinoma using propensity score matching [J].
Cho, Yong Beom ;
Lee, Kuhn Uk ;
Suh, Kyung-Suk ;
Kim, Yoon Jun ;
Yoon, Jung-Hwan ;
Lee, Hyo-Suk ;
Hahn, Seokyung ;
Park, Byung-Joo .
JOURNAL OF GASTROENTEROLOGY AND HEPATOLOGY, 2007, 22 (10) :1643-1649
[9]   THE PLANNING OF OBSERVATIONAL STUDIES OF HUMAN-POPULATIONS [J].
COCHRAN, WG .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-GENERAL, 1965, 128 (02) :234-266
[10]   Estimating and using propensity scores with partially missing data [J].
D'Agostino, RB ;
Rubin, DB .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2000, 95 (451) :749-759