One-stage Deep Instrumental Variable Method for Causal Inference from Observational Data

被引:3
作者
Lin, Adi [1 ]
Lu, Jie [1 ]
Xuan, Junyu [1 ]
Zhu, Fujin [1 ,2 ]
Zhang, Guangquan [1 ]
机构
[1] Univ Technol Sydney, FEIT, Ctr Artificial Intelligence, Sydney, NSW, Australia
[2] Beijing Inst Technol, Sch Management & Econ, Beijing, Peoples R China
来源
2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019) | 2019年
基金
澳大利亚研究理事会;
关键词
observational data; causal inference; instrumental variable; neural networks; PROPENSITY SCORE; ECONOMETRICS;
D O I
10.1109/ICDM.2019.00052
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Causal inference from observational data aims to estimate causal effects when controlled experimentation is not feasible, but it faces challenges when unobserved confounders exist. The instrumental variable method resolves this problem by introducing a variable that is correlated with the treatment and affects the outcome only through the treatment. However, existing instrumental variable methods require two stages to separately estimate the conditional treatment distribution and the outcome generating function, which is not sufficiently effective. This paper presents a one-stage approach to jointly estimate the treatment distribution and the outcome generating function through a cleverly designed deep neural network structure. This study is the first to merge the two stages to leverage the outcome to the treatment distribution estimation. Further, the new deep neural network architecture is designed with two strategies (i.e., shared and separate) of learning a confounder representation account for different observational data. Such network architecture can unveil complex relationships between confounders, treatments, and outcomes. Experimental results show that our proposed method outperforms the state-of-the-art methods. It has a wide range of applications, from medical treatment design to policy making, population regulation and beyond.
引用
收藏
页码:419 / 428
页数:10
相关论文
共 33 条
[1]  
Alaa Ahmed M, 2017, 31 C NEUR INF PROC S
[2]  
Angrist JD, 1996, J AM STAT ASSOC, V91, P444, DOI 10.2307/2291629
[3]  
[Anonymous], 2015, Nature, DOI [10.1038/nature14539, DOI 10.1038/NATURE14539]
[4]  
[Anonymous], 1985, Technical Report, DOI DOI citeulike-article-id:3847802
[5]   Recursive partitioning for heterogeneous causal effects [J].
Athey, Susan ;
Imbens, Guido .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (27) :7353-7360
[6]   The performance of different propensity score methods for estimating marginal hazard ratios [J].
Austin, Peter C. .
STATISTICS IN MEDICINE, 2013, 32 (16) :2837-2849
[7]   Doubly robust estimation in missing data and causal inference models [J].
Bang, H .
BIOMETRICS, 2005, 61 (04) :962-972
[8]  
Bembom O, 2007, STAT APPL GENET MOL, V6
[9]   MACHINE LEARNING IN ECONOMETRICS Double/Debiased/Neyman Machine Learning of Treatment Effects [J].
Chernozhukov, Victor ;
Chetverikov, Denis ;
Demirer, Mert ;
Duflo, Esther ;
Hansen, Christian ;
Newey, Whitney .
AMERICAN ECONOMIC REVIEW, 2017, 107 (05) :261-265
[10]   NONPARAMETRIC INSTRUMENTAL REGRESSION [J].
Darolles, S. ;
Fan, Y. ;
Florens, J. P. ;
Renault, E. .
ECONOMETRICA, 2011, 79 (05) :1541-1565