Combining observational and experimental data for causal inference considering data privacy

被引:0
|
作者
Mann, Charlotte Z. [1 ]
Sales, Adam C. [2 ]
Gagnon-Bartsch, Johann A. [3 ]
机构
[1] Calif Polytech State Univ San Luis Obispo, Stat Dept, San Luis Obispo, CA 93407 USA
[2] Worcester Polytech Inst, Math Sci, Worcester, MA USA
[3] Univ Michigan, Dept Stat, Ann Arbor, MI USA
基金
美国国家科学基金会;
关键词
data integration; statistical disclosure control; differential privacy;
D O I
10.1515/jci-2022-0081
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Combining observational and experimental data for causal inference can improve treatment effect estimation. However, many observational datasets cannot be released due to data privacy considerations, so one researcher may not have access to both experimental and observational data. Nonetheless, a small amount of risk of disclosing sensitive information might be tolerable to organizations that house confidential data. In these cases, organizations can employ data privacy techniques, which decrease disclosure risk, potentially at the expense of data utility. In this study, we explore disclosure limiting transformations of observational data, which can be combined with experimental data to estimate the sample and population average treatment effects. We consider leveraging observational data to improve generalizability of treatment effect estimates, when a randomized controlled trial (RCT) is not representative of the population of interest, and to increase precision of treatment effect estimates. Through simulation studies, we illustrate the trade-off between privacy and utility when employing different disclosure limiting transformations. We find that leveraging transformed observational data in treatment effect estimation can still improve estimation over only using data from an RCT.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Causal inference and observational data
    Ivan Olier
    Yiqiang Zhan
    Xiaoyu Liang
    Victor Volovici
    BMC Medical Research Methodology, 23
  • [2] Causal inference and observational data
    Olier, Ivan
    Zhan, Yiqiang
    Liang, Xiaoyu
    Volovici, Victor
    BMC MEDICAL RESEARCH METHODOLOGY, 2023, 23 (01)
  • [3] Causal inference with observational data
    Nichols, Austin
    STATA JOURNAL, 2007, 7 (04): : 507 - 541
  • [4] Causal inference from observational data
    Listl, Stefan
    Juerges, Hendrik
    Watt, Richard G.
    COMMUNITY DENTISTRY AND ORAL EPIDEMIOLOGY, 2016, 44 (05) : 409 - 415
  • [5] Causal inference with observational data in addiction research
    Chan, Gary C. K.
    Lim, Carmen
    Sun, Tianze
    Stjepanovic, Daniel
    Connor, Jason
    Hall, Wayne
    Leung, Janni
    ADDICTION, 2022, 117 (10) : 2736 - 2744
  • [6] Federated causal inference in heterogeneous observational data
    Xiong, Ruoxuan
    Koenecke, Allison
    Powell, Michael
    Shen, Zhu
    Vogelstein, Joshua T.
    Athey, Susan
    STATISTICS IN MEDICINE, 2023, 42 (24) : 4418 - 4439
  • [7] Causal Inference From Observational Data: It Is Complicated
    Shpitser, Ilya
    Kudchadkar, Sapna R.
    Fackler, James
    PEDIATRIC CRITICAL CARE MEDICINE, 2021, 22 (12) : 1093 - 1096
  • [8] How and Why to Use Experimental Data to Evaluate Methods for Observational Causal Inference
    Gentzel, Amanda
    Pruthi, Purva
    Jensen, David
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [9] ASSESSING STATISTICAL METHODS FOR CAUSAL INFERENCE IN OBSERVATIONAL DATA
    Parks, D. C.
    Lin, X.
    Lee, K. R.
    VALUE IN HEALTH, 2014, 17 (07) : A731 - A731
  • [10] Observational process data analytics using causal inference
    Yang, Shu
    Bequette, B. Wayne
    AICHE JOURNAL, 2023, 69 (04)