Long-term causal inference under persistent confounding via data combination

被引:0
作者
Imbens, Guido [1 ]
Kallus, Nathan [2 ]
Mao, Xiaojie [3 ]
Wang, Yuhao [4 ,5 ]
机构
[1] Stanford Univ, Grad Sch Business, 655 Knight Way, Stanford, CA 94305 USA
[2] Cornell Univ, Cornell Tech, New York, NY 10044 USA
[3] Tsinghua Univ, Sch Econ & Management, Beijing 100084, Peoples R China
[4] Tsinghua Univ, Inst Interdisciplinary Informat Sci, Beijing 100084, Peoples R China
[5] Shanghai Qi Zhi Inst, Shanghai 200232, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划; 美国国家科学基金会;
关键词
data combination; doubly robust estimation; long-term causal inference; proxy variables; unobserved confounding; MODELS; IDENTIFICATION; VARIABLES; MOMENTS;
D O I
10.1093/jrsssb/qkae095
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We study the identification and estimation of long-term treatment effects by combining short-term experimental data and long-term observational data subject to unobserved confounding. This problem arises often when concerned with long-term treatment effects since experiments are often short-term due to operational necessity while observational data can be more easily collected over longer time frames but may be subject to confounding. In this paper, we tackle the challenge of persistent confounding: unobserved confounders that can simultaneously affect the treatment, short-term outcomes, and long-term outcome. In particular, persistent confounding invalidates identification strategies in previous approaches to this problem. To address this challenge, we exploit the sequential structure of multiple short-term outcomes and develop several novel identification strategies for the average long-term treatment effect. Based on these, we develop estimation and inference methods with asymptotic guarantees. To demonstrate the importance of handling persistent confounders, we apply our methods to estimate the effect of a job training program on long-term employment using semi-synthetic data.
引用
收藏
页码:362 / 388
页数:27
相关论文
共 89 条