Collaborative-controlled LASSO for constructing propensity score-based estimators in high-dimensional data

被引:30
作者
Ju, Cheng [1 ]
Wyss, Richard [2 ,3 ]
Franklin, Jessica M. [2 ,3 ]
Schneeweiss, Sebastian [2 ,3 ]
Haggstrom, Jenny [4 ]
van der Laan, Mark J. [1 ]
机构
[1] Univ Calif Berkeley, Div Biostat, Berkeley, CA 94720 USA
[2] Brigham & Womens Hosp, Dept Med, Div Pharmacoepidemiol & Pharmacoecon, Boston, MA 02115 USA
[3] Harvard Med Sch, Boston, MA 02115 USA
[4] Umea Univ, USBE, Dept Stat, Umea, Sweden
基金
瑞典研究理事会;
关键词
Propensity score; average treatment effect; LASSO; model selection; electronic healthcare database; collaborative targeted minimum loss-based estimation; CONFOUNDING ADJUSTMENT; REGRESSION; PERFORMANCE; ROBUSTNESS; ALGORITHM; INFERENCE; SELECTION; MODELS; SAFETY; DRUG;
D O I
10.1177/0962280217744588
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Propensity score-based estimators are increasingly used for causal inference in observational studies. However, model selection for propensity score estimation in high-dimensional data has received little attention. In these settings, propensity score models have traditionally been selected based on the goodness-of-fit for the treatment mechanism itself, without consideration of the causal parameter of interest. Collaborative minimum loss-based estimation is a novel methodology for causal inference that takes into account information on the causal parameter of interest when selecting a propensity score model. This "collaborative learning" considers variable associations with both treatment and outcome when selecting a propensity score model in order to minimize a bias-variance tradeoff in the estimated treatment effect. In this study, we introduce a novel approach for collaborative model selection when using the LASSO estimator for propensity score estimation in high-dimensional covariate settings. To demonstrate the importance of selecting the propensity score model collaboratively, we designed quasi-experiments based on a real electronic healthcare database, where only the potential outcomes were manually generated, and the treatment and baseline covariates remained unchanged. Results showed that the collaborative minimum loss-based estimation algorithm outperformed other competing estimators for both point estimation and confidence interval coverage. In addition, the propensity score model selected by collaborative minimum loss-based estimation could be applied to other propensity score-based estimators, which also resulted in substantive improvement for both point estimation and confidence interval coverage. We illustrate the discussed concepts through an empirical example comparing the effects of non-selective nonsteroidal anti-inflammatory drugs with selective COX-2 inhibitors on gastrointestinal complications in a population of Medicare beneficiaries.
引用
收藏
页码:1044 / 1063
页数:20
相关论文
共 50 条
[21]  
Hajek J., 1971, Foundations of statistical inference, P236
[22]   Performance of the High-dimensional Propensity Score in a Nordic Healthcare Model [J].
Hallas, Jesper ;
Pottegard, Anton .
BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2017, 120 (03) :312-317
[23]   The role of the propensity score in estimating dose-response functions [J].
Imbens, GW .
BIOMETRIKA, 2000, 87 (03) :706-710
[24]  
Ju C, 2017, COLLABORATIVE TARGET
[25]  
Ju C, 2017, WORKING PAPER
[26]   Scalable collaborative targeted learning for high-dimensional data [J].
Ju, Cheng ;
Gruber, Susan ;
Lendle, Samuel D. ;
Chambaz, Antoine ;
Franklin, Jessica M. ;
Wyss, Richard ;
Schneeweiss, Sebastian ;
van der Laan, Mark J. .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2019, 28 (02) :532-554
[27]  
Ju Cheng, 2017, ARXIV170401664
[28]   Demystifying double robustness: A comparison of alternative strategies for estimating a population mean from incomplete data [J].
Kang, Joseph D. Y. ;
Schafer, Joseph L. .
STATISTICAL SCIENCE, 2007, 22 (04) :523-539
[29]   Comparison of high-dimensional confounder summary scores in comparative studies of newly marketed medications [J].
Kumamaru, Hiraku ;
Gagnea, Joshua J. ;
Glynn, Robert J. ;
Setoguchi, Soko ;
Schneeweiss, Sebastian .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2016, 76 :200-208
[30]   Effects of aggregation of drug and diagnostic codes on the performance of the high-dimensional propensity score algorithm: an empirical example [J].
Le, Hoa V. ;
Poole, Charles ;
Brookhart, M. Alan ;
Schoenbach, Victor J. ;
Beach, Kathleen J. ;
Layton, J. Bradley ;
Stuermer, Til .
BMC MEDICAL RESEARCH METHODOLOGY, 2013, 13