Local Search for Efficient Causal Effect Estimation

被引:13
作者
Cheng, Debo [1 ]
Li, Jiuyong [1 ]
Liu, Lin [1 ]
Zhang, Jiji [2 ]
Liu, Jixue [1 ]
Le, Thuc Duy [1 ]
机构
[1] Univ South Australia, UniSA STEM, Adelaide, SA 5095, Australia
[2] Hong Kong Baptist Univ, Dept Relig & Philosophy, Hong Kong, Peoples R China
基金
澳大利亚研究理事会;
关键词
Observational data; causal inference; graphical causal modelling; confounding bias; latent variables; GAUSSIAN ACYCLIC MODEL; PROPENSITY SCORE; SELECTION; MARKOV; LATENT; DISCOVERY; INFERENCE; GRAPHS; ROBUST;
D O I
10.1109/TKDE.2022.3218131
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Causal effect estimation from observational data is a challenging problem, especially with high dimensional data and in the presence of unobserved variables. The available data-driven methods for tackling the problem either provide an estimation of the bounds of a causal effect (i.e., nonunique estimation) or have low efficiency. The major hurdle for achieving high efficiency while trying to obtain unique and unbiased causal effect estimation is how to find a proper adjustment set for confounding control in a fast way, given the huge covariate space and considering unobserved variables. In this paper, we approach the problem as a local search task for finding valid adjustment sets in data. We establish the theorems to support the local search for adjustment sets, and we show that unique and unbiased estimation can be achieved from observational data even when there exist unobserved variables. We then propose a data-driven algorithm that is fast and consistent under mild assumptions. We also make use of a frequent pattern mining method to further speed up the search of minimal adjustment sets for causal effect estimation. Experiments conducted on extensive synthetic and real-world datasets demonstrate that the proposed algorithm outperforms the state-of-the-art criteria/estimators in both accuracy and time-efficiency.
引用
收藏
页码:8823 / 8837
页数:15
相关论文
共 57 条
[1]  
Agrawal R., 1994, P 20 INT C VER LARG, P487, DOI DOI 10.5555/645920.672836
[2]  
Aliferis CF, 2010, J MACH LEARN RES, V11, P171
[3]   The costs of low birth weight [J].
Almond, D ;
Chay, KY ;
Lee, DS .
QUARTERLY JOURNAL OF ECONOMICS, 2005, 120 (03) :1031-1083
[4]   2-STAGE LEAST-SQUARES ESTIMATION OF AVERAGE CAUSAL EFFECTS IN MODELS WITH VARIABLE TREATMENT INTENSITY [J].
ANGRIST, JD ;
IMBENS, GW .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1995, 90 (430) :431-442
[5]   Doubly robust estimation in missing data and causal inference models [J].
Bang, H .
BIOMETRICS, 2005, 61 (04) :962-972
[6]   LIMITATIONS OF THE APPLICATION OF FOURFOLD TABLE ANALYSIS TO HOSPITAL DATA [J].
BERKSON, J .
BIOMETRICS BULLETIN, 1946, 2 (03) :47-53
[7]   Lasso adjustments of treatment effect estimates in randomized experiments [J].
Bloniarz, Adam ;
Liu, Hanzhong ;
Zhang, Cun-Hui ;
Sekhon, Jasjeet S. ;
Yu, Bin .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (27) :7383-7390
[8]  
Bowden R. J., 1990, Instrumental Variables
[9]   Variable selection in high-dimensional linear models: partially faithful distributions and the PC-simple algorithm [J].
Buehlmann, P. ;
Kalisch, M. ;
Maathuis, M. H. .
BIOMETRIKA, 2010, 97 (02) :261-278
[10]   Causal Discovery in Linear Non-Gaussian Acyclic Model With Multiple Latent Confounders [J].
Chen, Wei ;
Cai, Ruichu ;
Zhang, Kun ;
Hao, Zhifeng .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (07) :2816-2827