Formulating causal questions and principled statistical answers

被引:61
作者
Goetghebeur, Els [1 ,2 ]
le Cessie, Saskia [3 ]
De Stavola, Bianca [4 ]
Moodie, Erica E. M. [5 ]
Waernbaum, Ingeborg [6 ]
机构
[1] Univ Ghent, Dept Appl Math Comp Sci & Stat, Ghent, Belgium
[2] Karolinska Inst, Dept Med Epidemiol & Biostat, Stockholm, Sweden
[3] Leiden Univ, Med Ctr, Dept Clin Epidemiol Biomed Data Sci, Leiden, Netherlands
[4] UCL, Great Ormond St Inst Child Hlth, London, England
[5] McGill Univ, Div Biostat, Montreal, PQ, Canada
[6] Uppsala Univ, Dept Stat, Uppsala, Sweden
基金
瑞典研究理事会; 英国医学研究理事会; 加拿大自然科学与工程研究理事会;
关键词
causation; instrumental variable; inverse probability weighting; matching; potential outcomes; propensity score; MARGINAL STRUCTURAL MODELS; INSTRUMENTAL VARIABLES ESTIMATION; MENDELIAN RANDOMIZATION; PROPENSITY SCORE; MEDIATION ANALYSIS; TREATMENT REGIMES; INFERENCE; IDENTIFICATION; INTERVENTIONS; NONCOMPLIANCE;
D O I
10.1002/sim.8741
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Although review papers on causal inference methods are now available, there is a lack of introductory overviews onwhatthey can render and on the guiding criteria for choosing one particular method. This tutorial gives an overview in situations where an exposure of interest is set at a chosen baseline ("point exposure") and the target outcome arises at a later time point. We first phrase relevant causal questions and make a case for being specific about the possible exposure levels involved and the populations for which the question is relevant. Using the potential outcomes framework, we describe principled definitions of causal effects and of estimation approaches classified according to whether they invoke the no unmeasured confounding assumption (including outcome regression and propensity score-based methods) or an instrumental variable with added assumptions. We mainly focus on continuous outcomes and causal average treatment effects. We discuss interpretation, challenges, and potential pitfalls and illustrate application using a "simulation learner," that mimics the effect of various breastfeeding interventions on a child's later development. This involves a typical simulation component with generated exposure, covariate, and outcome data inspired by a randomized intervention study. The simulation learner further generates various (linked) exposure types with a set of possible values per observation unit, from which observed as well as potential outcome data are generated. It thus provides true values of several causal effects. R code for data generation and analysis is available on , where SAS and Stata code for analysis is also provided.
引用
收藏
页码:4922 / 4948
页数:27
相关论文
共 85 条
[1]   On the Failure of the Bootstrap for Matching Estimators [J].
Abadie, Alberto ;
Imbens, Guido W. .
ECONOMETRICA, 2008, 76 (06) :1537-1557
[2]   Should a propensity score model be super? The utility of ensemble procedures for causal adjustment [J].
Alam, Shomoita ;
Moodie, Erica E. M. ;
Stephens, David A. .
STATISTICS IN MEDICINE, 2019, 38 (09) :1690-1702
[3]  
Angrist JD, 1996, J AM STAT ASSOC, V91, P444, DOI 10.2307/2291629
[4]  
Angrist JD, 2009, MOSTLY HARMLESS ECONOMETRICS: AN EMPIRICISTS COMPANION, P1
[5]   Instrumental variables and the search for identification: From supply and demand to natural experiments [J].
Angrist, JD ;
Krueger, AB .
JOURNAL OF ECONOMIC PERSPECTIVES, 2001, 15 (04) :69-85
[6]   Approximate residual balancing: debiased inference of average treatment effects in high dimensions [J].
Athey, Susan ;
Imbens, Guido W. ;
Wager, Stefan .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2018, 80 (04) :597-623
[7]   Balance diagnostics for comparing the distribution of baseline covariates between treatment groups in propensity-score matched samples [J].
Austin, Peter C. .
STATISTICS IN MEDICINE, 2009, 28 (25) :3083-3107
[8]   Comparison of causal effect estimators under exposure misclassification [J].
Babanezhad, Manoochehr ;
Vansteelandt, Stijn ;
Goetghebeur, Els .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2010, 140 (05) :1306-1319
[9]   Instrumental variable methods for causal inference [J].
Baiocchi, Michael ;
Cheng, Jing ;
Small, Dylan S. .
STATISTICS IN MEDICINE, 2014, 33 (13) :2297-2340
[10]   Doubly robust estimation in missing data and causal inference models [J].
Bang, H .
BIOMETRICS, 2005, 61 (04) :962-972