Learning Causal Effects via Weighted Empirical Risk Minimization

被引:0
作者
Jung, Yonghan [1 ]
Tian, Jin [2 ]
Bareinboim, Elias [3 ]
机构
[1] Purdue Univ Univ, Dept Comp Sci, W Lafayette, IN 47907 USA
[2] Iowa State Univ, Dept Comp Sci, Ames, IA 50011 USA
[3] Columbia Univ, Dept Comp Sci, New York, NY 10027 USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020 | 2020年 / 33卷
关键词
INFERENCE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning causal effects from data is a fundamental problem across the sciences. Determining the identifiability of a target effect from a combination of the observational distribution and the causal graph underlying a phenomenon is well-understood in theory. However, in practice, it remains a challenge to apply the identification theory to estimate the identified causal functionals from finite samples. Although a plethora of effective estimators have been developed under the setting known as the back-door (also called conditional ignorability), there exists still no systematic way of estimating arbitrary causal functionals that are both computationally and statistically attractive. This paper aims to bridge this gap, from causal identification to causal estimation. We note that estimating functionals from limited samples based on the empirical risk minimization (ERM) principle has been pervasive in the machine learning literature, and these methods have been extended to causal inference under the back-door setting. In this paper, we develop a learning framework that marries two families of methods, benefiting from the generality of the causal identification theory and the effectiveness of the estimators produced based on the principle of ERM. Specifically, we develop a sound and complete algorithm that generates causal functionals in the form of weighted distributions that are amenable to the ERM optimization. We then provide a practical procedure for learning causal effects from finite samples and a causal graph. Finally, experimental results support the effectiveness of our approach.
引用
收藏
页数:13
相关论文
共 57 条
[1]  
[Anonymous], 2006, P 21 NATL C ARTIFICI, DOI DOI 10.5555/1597348.1597382
[2]  
[Anonymous], 2006, P 22 C UNCERTAINTY A, DOI DOI 10.5555/3020419.3020446
[3]  
[Anonymous], 2006, International Journal of Biostatistics, DOI 10.2202/1557-4679.1043
[4]  
[Anonymous], 2020, PROCEEDINGS OF THE 3, DOI DOI 10.1109/MEMS46641.2020.9056111
[5]  
[Anonymous], 2002, P 18 C UNCERTAINTY A
[6]   Doubly robust estimation in missing data and causal inference models [J].
Bang, H .
BIOMETRICS, 2005, 61 (04) :962-972
[7]  
Bareinboim E., 2012, P 28 C UNCERTAINTY A
[8]   Causal inference and the data-fusion problem [J].
Bareinboim, Elias ;
Pearl, Judea .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (27) :7345-7352
[9]  
Ben-David S., 2007, Advances in Neural Information Processing Systems, P137
[10]   A theory of learning from different domains [J].
Ben-David, Shai ;
Blitzer, John ;
Crammer, Koby ;
Kulesza, Alex ;
Pereira, Fernando ;
Vaughan, Jennifer Wortman .
MACHINE LEARNING, 2010, 79 (1-2) :151-175