Combining Multiple Observational Data Sources to Estimate Causal Effects

被引:39
|
作者
Yang, Shu [1 ]
Ding, Peng [2 ]
机构
[1] North Carolina State Univ, Dept Stat, 2311 Stinson Dr Campus Box 8203, Raleigh, NC 27695 USA
[2] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
关键词
Calibration; Causal inference; Inverse probability weighting; Missing confounder; Two-phase sampling; PROPENSITY SCORE CALIBRATION; DOUBLY ROBUST ESTIMATION; LARGE-SAMPLE PROPERTIES; AUXILIARY INFORMATION; MISSING CONFOUNDERS; MATCHING ESTIMATORS; VALIDATION DATA; REGRESSION; INFERENCE; 2-PHASE;
D O I
10.1080/01621459.2019.1609973
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The era of big data has witnessed an increasing availability of multiple data sources for statistical analyses. We consider estimation of causal effects combining big main data with unmeasured confounders and smaller validation data withon these confounders. Under the unconfoundedness assumption with completely observed confounders, the smaller validation data allow for constructing consistent estimators for causal effects, but the big main data can only give error-prone estimators in general. However, by leveraging the information in the big main data in a principled way, we can improve the estimation efficiencies yet preserve the consistencies of the initial estimators based solely on the validation data. Our framework applies to asymptotically normal estimators, including the commonly used regression imputation, weighting, and matching estimators, and does not require a correct specification of the model relating the unmeasured confounders to the observed variables. We also propose appropriate bootstrap procedures, which makes our method straightforward to implement using software routines for existing estimators.for this article are available online.
引用
收藏
页码:1540 / 1554
页数:15
相关论文
共 50 条
  • [41] Multiple imputation procedures for estimating causal effects with multiple treatments with application to the comparison of healthcare providers
    Silva, Gabriella C.
    Gutman, Roee
    STATISTICS IN MEDICINE, 2022, 41 (01) : 208 - 226
  • [42] Causal inference with observational data in addiction research
    Chan, Gary C. K.
    Lim, Carmen
    Sun, Tianze
    Stjepanovic, Daniel
    Connor, Jason
    Hall, Wayne
    Leung, Janni
    ADDICTION, 2022, 117 (10) : 2736 - 2744
  • [43] Objective causal predictions from observational data
    Cox Jr, Louis Anthony
    CRITICAL REVIEWS IN TOXICOLOGY, 2024,
  • [44] Causal Inference From Observational Data: It Is Complicated
    Shpitser, Ilya
    Kudchadkar, Sapna R.
    Fackler, James
    PEDIATRIC CRITICAL CARE MEDICINE, 2021, 22 (12) : 1093 - 1096
  • [45] Disproving Causal Relationships Using Observational Data
    Bryant, Henry L.
    Bessler, David A.
    Haigh, Michael S.
    OXFORD BULLETIN OF ECONOMICS AND STATISTICS, 2009, 71 (03) : 357 - 374
  • [46] Causal Query in Observational Data with Hidden Variables
    Cheng, Debo
    Li, Jiuyong
    Liu, Lin
    Liu, Jixue
    Yu, Kui
    Le, Thuc Duy
    ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 2551 - 2558
  • [47] MIMICKING COUNTERFACTUAL OUTCOMES TO ESTIMATE CAUSAL EFFECTS
    Lok, Judith J.
    ANNALS OF STATISTICS, 2017, 45 (02): : 461 - 499
  • [48] Integrative R-learner of heterogeneous treatment effects combining experimental and observational studies
    Wu, Lili
    Yang, Shu
    CONFERENCE ON CAUSAL LEARNING AND REASONING, VOL 177, 2022, 177
  • [49] Performance of propensity score matching to estimate causal effects in small samples
    Andrillon, Anais
    Pirracchio, Romain
    Chevret, Sylvie
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2020, 29 (03) : 644 - 658
  • [50] Matching estimators for causal effects of multiple treatments
    Scotina, Anthony D.
    Beaudoin, Francesca L.
    Gutman, Roee
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2020, 29 (04) : 1051 - 1066