Data integration in causal inference

被引:15
|
作者
Shi, Xu [1 ]
Pan, Ziyang [1 ]
Miao, Wang [2 ]
机构
[1] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
[2] Peking Univ, Dept Probabil & Stat, Beijing, Peoples R China
关键词
causal inference; data fusion; data integration; generalizability; transportability; HISTORICAL CONTROL DATA; MENDELIAN RANDOMIZATION; PROPENSITY SCORE; INSTRUMENTAL VARIABLES; CLINICAL-TRIALS; GENERALIZING EVIDENCE; MULTIPLE IMPUTATION; PRIOR DISTRIBUTIONS; VALIDATION DATA; REGRESSION;
D O I
10.1002/wics.1581
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Integrating data from multiple heterogeneous sources has become increasingly popular to achieve a large sample size and diverse study population. This article reviews development in causal inference methods that combines multiple datasets collected by potentially different designs from potentially heterogeneous populations. We summarize recent advances on combining randomized clinical trials with external information from observational studies or historical controls, combining samples when no single sample has all relevant variables with application to two-sample Mendelian randomization, distributed data setting under privacy concerns for comparative effectiveness and safety research using real-world data, Bayesian causal inference, and causal discovery methods. This article is categorized under: Statistical Models > Semiparametric Models Applications of Computational Statistics > Clinical Trials
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Causal inference with observational data
    Nichols, Austin
    STATA JOURNAL, 2007, 7 (04) : 507 - 541
  • [2] Mendelian randomization: causal inference leveraging genetic data
    Chen, Lane G.
    Tubbs, Justin D.
    Liu, Zipeng
    Thach, Thuan-Quoc
    Sham, Pak C.
    PSYCHOLOGICAL MEDICINE, 2024, 54 (08) : 1461 - 1474
  • [3] Causal inference and data fusion in econometrics
    Huenermund, Paul
    Bareinboim, Elias
    ECONOMETRICS JOURNAL, 2023, 28 (01) : 41 - 82
  • [4] Causal Aggregation: Estimation and Inference of Causal Effects by Constraint-Based Data Fusion
    Gimenez, Jaime Roquero
    Rothenhausler, Dominik
    JOURNAL OF MACHINE LEARNING RESEARCH, 2022, 23
  • [5] Marginal integration for nonparametric causal inference
    Ernest, Jan
    Buehlmann, Peter
    ELECTRONIC JOURNAL OF STATISTICS, 2015, 9 (02): : 3155 - 3194
  • [6] Causal inference and the data-fusion problem
    Bareinboim, Elias
    Pearl, Judea
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (27) : 7345 - 7352
  • [7] A Causal Dirichlet Mixture Model for Causal Inference from Observational Data
    Lin, Adi
    Lu, Jie
    Xuan, Junyu
    Zhu, Fujin
    Zhang, Guangquan
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2020, 11 (03)
  • [8] Instrumental variable methods for causal inference
    Baiocchi, Michael
    Cheng, Jing
    Small, Dylan S.
    STATISTICS IN MEDICINE, 2014, 33 (13) : 2297 - 2340
  • [9] Child welfare and the challenge of causal inference
    Foster, E. Michael
    McCombs-Thornton, Kimberly
    CHILDREN AND YOUTH SERVICES REVIEW, 2013, 35 (07) : 1130 - 1142
  • [10] Handling Missing Data in Instrumental Variable Methods for Causal Inference
    Kennedy, Edward H.
    Mauro, Jacqueline A.
    Daniels, Michael J.
    Burns, Natalie
    Small, Dylan S.
    ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 6, 2019, 6 : 125 - 148