Combining Multiple Observational Data Sources to Estimate Causal Effects

被引:39
|
作者
Yang, Shu [1 ]
Ding, Peng [2 ]
机构
[1] North Carolina State Univ, Dept Stat, 2311 Stinson Dr Campus Box 8203, Raleigh, NC 27695 USA
[2] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
关键词
Calibration; Causal inference; Inverse probability weighting; Missing confounder; Two-phase sampling; PROPENSITY SCORE CALIBRATION; DOUBLY ROBUST ESTIMATION; LARGE-SAMPLE PROPERTIES; AUXILIARY INFORMATION; MISSING CONFOUNDERS; MATCHING ESTIMATORS; VALIDATION DATA; REGRESSION; INFERENCE; 2-PHASE;
D O I
10.1080/01621459.2019.1609973
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The era of big data has witnessed an increasing availability of multiple data sources for statistical analyses. We consider estimation of causal effects combining big main data with unmeasured confounders and smaller validation data withon these confounders. Under the unconfoundedness assumption with completely observed confounders, the smaller validation data allow for constructing consistent estimators for causal effects, but the big main data can only give error-prone estimators in general. However, by leveraging the information in the big main data in a principled way, we can improve the estimation efficiencies yet preserve the consistencies of the initial estimators based solely on the validation data. Our framework applies to asymptotically normal estimators, including the commonly used regression imputation, weighting, and matching estimators, and does not require a correct specification of the model relating the unmeasured confounders to the observed variables. We also propose appropriate bootstrap procedures, which makes our method straightforward to implement using software routines for existing estimators.for this article are available online.
引用
收藏
页码:1540 / 1554
页数:15
相关论文
共 50 条
  • [11] Bayesian doubly robust estimation of causal effects for clustered observational data
    Zhou, Qi
    He, Haonan
    Zhao, Jie
    Song, Joon Jin
    JOURNAL OF APPLIED STATISTICS, 2025,
  • [12] Conceptual framework for investigating causal effects from observational data in livestock
    Bello, Nora M.
    Ferreira, Vera C.
    Gianola, Daniel
    Rosa, Guilherme J. M.
    JOURNAL OF ANIMAL SCIENCE, 2018, 96 (10) : 4045 - 4062
  • [13] Learning Causal Effects From Observational Data in Healthcare: A Review and Summary
    Shi, Jingpu
    Norgeot, Beau
    FRONTIERS IN MEDICINE, 2022, 9
  • [14] Combining parametric and nonparametric models to estimate treatment effects in observational studies
    Daly-Grafstein, Daniel
    Gustafson, Paul
    BIOMETRICS, 2023, 79 (03) : 1986 - 1995
  • [15] Causal interaction trees: Finding subgroups with heterogeneous treatment effects in observational data
    Yang, Jiabei
    Dahabreh, Issa J.
    Steingrimsson, Jon A.
    BIOMETRICS, 2022, 78 (02) : 624 - 635
  • [16] Joint estimation of causal effects from observational and intervention gene expression data
    Rau, Andrea
    Jaffrezic, Florence
    Nuel, Gregory
    BMC SYSTEMS BIOLOGY, 2013, 7
  • [17] Matching with multiple controls to estimate treatment effects in observational studies
    Smith, HL
    SOCIOLOGICAL METHODOLOGY 1997, VOL 27, 1997, 27 : 325 - 353
  • [18] A Causal Dirichlet Mixture Model for Causal Inference from Observational Data
    Lin, Adi
    Lu, Jie
    Xuan, Junyu
    Zhu, Fujin
    Zhang, Guangquan
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2020, 11 (03)
  • [19] Federated causal inference in heterogeneous observational data
    Xiong, Ruoxuan
    Koenecke, Allison
    Powell, Michael
    Shen, Zhu
    Vogelstein, Joshua T.
    Athey, Susan
    STATISTICS IN MEDICINE, 2023, 42 (24) : 4418 - 4439
  • [20] Causal discovery from observational and interventional data across multiple environments
    Li, Adam
    Jaber, Amin
    Bareinboim, Elias
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,