Introduction to computational causal inference using reproducible Stata, R, and Python']Python code: A tutorial

被引:33
|
作者
Smith, Matthew J. [1 ]
Mansournia, Mohammad A. [2 ]
Maringe, Camille [1 ]
Zivich, Paul N. [3 ,4 ]
Cole, Stephen R. [3 ]
Leyrat, Clemence [1 ]
Belot, Aurelien [1 ]
Rachet, Bernard [1 ]
Luque-Fernandez, Miguel A. [1 ,5 ,6 ]
机构
[1] London Sch Hyg & Trop Med, Dept Noncommunicable Dis Epidemiol, Inequal Canc Outcomes Network, Keppel St, London WC1E 7HT, England
[2] Univ Tehran Med Sci, Dept Epidemiol & Biostat, Tehran, Iran
[3] Univ N Carolina, Dept Epidemiol, Chapel Hill, NC 27515 USA
[4] Univ N Carolina, Carolina Populat Ctr, Chapel Hill, NC 27515 USA
[5] Univ Granada, Andalusian Sch Publ Hlth, Inst Invest Biosanitaria Granada Ibs GRANADA, Noncommunicable Dis & Canc Epidemiol Grp, Granada, Spain
[6] Biomed Network Res Ctr Epidemiol & Publ Hlth CIBE, Madrid, Spain
关键词
causal inference; double-robust methods; g-formula; G-methods; inverse probability weighting; machine learning; propensity score; regression adjustment; targeted maximum likelihood estimation; MARGINAL STRUCTURAL MODELS; PROPENSITY SCORE; MORTALITY;
D O I
10.1002/sim.9234
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The main purpose of many medical studies is to estimate the effects of a treatment or exposure on an outcome. However, it is not always possible to randomize the study participants to a particular treatment, therefore observational study designs may be used. There are major challenges with observational studies; one of which is confounding. Controlling for confounding is commonly performed by direct adjustment of measured confounders; although, sometimes this approach is suboptimal due to modeling assumptions and misspecification. Recent advances in the field of causal inference have dealt with confounding by building on classical standardization methods. However, these recent advances have progressed quickly with a relative paucity of computational-oriented applied tutorials contributing to some confusion in the use of these methods among applied researchers. In this tutorial, we show the computational implementation of different causal inference estimators from a historical perspective where new estimators were developed to overcome the limitations of the previous estimators (ie, nonparametric and parametric g-formula, inverse probability weighting, double-robust, and data-adaptive estimators). We illustrate the implementation of different methods using an empirical example from the Connors study based on intensive care medicine, and most importantly, we provide reproducible and commented code in Stata, R, and Python for researchers to adapt in their own observational study. The code can be accessed at .
引用
收藏
页码:407 / 432
页数:26
相关论文
共 50 条
  • [1] Quasi-Experimental Design for Causal Inference Using Python']Python and Apache Spark: A Hands-on Tutorial
    Mojarad, Shirin
    Lewkow, Nicholas
    Essa, Alfred
    Zhang, Jie
    Feild, Jacqueline
    SEVENTH INTERNATIONAL LEARNING ANALYTICS & KNOWLEDGE CONFERENCE (LAK'17), 2017, : 502 - 503
  • [2] Machine learning using Stata/Python']Python
    Cerulli, Giovanni
    STATA JOURNAL, 2022, 22 (04): : 772 - 810
  • [3] Causal ML: Python']Python package for causal inference machine learning
    Zhao, Yang
    Liu, Qing
    SOFTWAREX, 2023, 21
  • [4] DockerizeMe: Automatic Inference of Environment Dependencies for Python']Python Code Snippets
    Horton, Eric
    Parnin, Chris
    2019 IEEE/ACM 41ST INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2019), 2019, : 328 - 338
  • [5] Using AI to Inject Vulnerabilities in Python']Python Code
    Bogaerts, Frederic C. G.
    Ivaki, Naghmeh
    Fonseca, Jose
    2023 53RD ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS WORKSHOPS, DSN-W, 2023, : 223 - 230
  • [6] A TUTORIAL ON OBJECT RECOGNITION BY MACHINE LEARNING TECHNIQUES USING PYTHON']PYTHON
    Ruiz-Sarmiento, J. R.
    Monroy, J.
    Moreno, F. A.
    Gonzalez-Jimenez, J.
    13TH INTERNATIONAL TECHNOLOGY, EDUCATION AND DEVELOPMENT CONFERENCE (INTED2019), 2019, : 3321 - 3330
  • [7] Meta-analysis using Python']Python: a hands-on tutorial
    Masoumi, Safoora
    Shahraz, Saeid
    BMC MEDICAL RESEARCH METHODOLOGY, 2022, 22 (01)
  • [8] Computational Finance Using QuantLib-Python']Python
    Varma, Jayanth R.
    Virmani, Vineet
    COMPUTING IN SCIENCE & ENGINEERING, 2016, 18 (02) : 78 - 88
  • [9] Harvesting Twitter Data for Studying Motor Behavior in Disabled Populations: An Introduction and Tutorial in Python']Python
    Fears, Nicholas E.
    Chatterjee, Riya
    Tamplain, Priscila M.
    Miller, Haylie L.
    JOURNAL OF MOTOR LEARNING AND DEVELOPMENT, 2023, 11 (03) : 555 - 570
  • [10] Constrained Optimization for Decision Making in Health Care Using Python']Python: A Tutorial
    Leung, K. H. Benjamin
    Yousefi, Nasrin
    Chan, Timothy C. Y.
    Bayoumi, Ahmed M. M.
    MEDICAL DECISION MAKING, 2023, 43 (7-8) : 760 - 773