Causal data fusion methods using summary-level statistics for a continuous outcome

被引:7
作者
Li, Hongkai [1 ]
Miao, Wang [2 ]
Cai, Zheng [1 ]
Liu, Xinhui [3 ]
Zhang, Tao [3 ]
Xue, Fuzhong [3 ]
Geng, Zhi [1 ]
机构
[1] Peking Univ, Sch Math Sci, Beijing 100871, Peoples R China
[2] Peking Univ, Guanghua Sch Management, Beijing, Peoples R China
[3] Shandong Univ, Sch Publ Hlth, Jinan 250100, Peoples R China
基金
中国国家自然科学基金;
关键词
causal inference; data fusion; identification; incomplete confounders;
D O I
10.1002/sim.8461
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In many empirical studies, there exist rich individual studies to separately estimate causal effect of the treatment or exposure variable on the outcome variable, but incomplete confounders are adjusted in each study. Suppose we are interested in the causal effect of a treatment or exposure on an outcome variable, and we have available rich datasets that contain different confounders. How to integrate summary-level statistics from multiple individual datasets to improve causal inference has become a main challenge in data fusion. We propose a novel method in this article to identify the causal effect of a treatment or exposure on the continuous outcome. We show that the causal effect is identifiable and can be estimated by combining summary-level statistics from multiple datasets containing subsets of confounders and an external dataset only containing complete confounding information. Simulation studies indicate the unbiasedness of causal effect estimate by our method and we apply our method to a study about the effect of body mass index on fasting blood glucose.
引用
收藏
页码:1054 / 1067
页数:14
相关论文
共 17 条
[1]  
ANGRIST JD, 1992, J AM STAT ASSOC, V87, P328
[2]  
[Anonymous], 2010, P 5 EUR WORKSH PROB
[3]  
[Anonymous], 2010, P 13 INT C ARTIFICIA
[4]   Causal inference and the data-fusion problem [J].
Bareinboim, Elias ;
Pearl, Judea .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (27) :7345-7352
[5]  
Borboudakis G, 2011, P EUR S ART NEUR NET, P8
[6]   A sufficient condition for pooling data [J].
Eberhardt, Frederick .
SYNTHESE, 2008, 163 (03) :433-442
[7]  
Klevmarken A, 1982, WORKING PAPER SERIES, V62
[8]  
Little R. J., 2019, Statistical analysis with missing data, V793
[9]  
Meek C., 1995, Uncertainty in Artificial Intelligence. Proceedings of the Eleventh Conference (1995), P403
[10]  
O'Donnell RT, 2006, TECHNICAL REPORT