MissDAG: Causal Discovery in the Presence of Missing Data with Continuous Additive Noise Models

被引:0
|
作者
Gao, Erdun [1 ]
Ng, Ignavier [2 ]
Gong, Mingming [1 ]
Shen, Li [3 ]
Huang, Wei [1 ]
Liu, Tongliang [4 ]
Zhang, Kun [2 ,5 ]
Bondell, Howard [1 ]
机构
[1] Univ Melbourne, Parkville, Australia
[2] Carnegie Mellon Univ, Pittsburgh, PA USA
[3] JD Explore Acad, Beijing, Peoples R China
[4] Univ Sydney, Sydney, Australia
[5] Mohamed Bin Zayed Univ Artificial Intelligence, Abu Dhabi, U Arab Emirates
基金
美国国家卫生研究院; 澳大利亚研究理事会;
关键词
BAYESIAN NETWORKS; EM ALGORITHM; IMPUTATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
State-of-the-art causal discovery methods usually assume that the observational data is complete. However, the missing data problem is pervasive in many practical scenarios such as clinical trials, economics, and biology. One straightforward way to address the missing data problem is first to impute the data using off-the-shelf imputation methods and then apply existing causal discovery methods. However, such a two-step method may suffer from suboptimality, as the imputation algorithm may introduce bias for modeling the underlying data distribution. In this paper, we develop a general method, which we call MissDAG, to perform causal discovery from data with incomplete observations. Focusing mainly on the assumptions of ignorable missingness and the identifiable additive noise models (ANMs), MissDAG maximizes the expected likelihood of the visible part of observations under the expectation-maximization (EM) framework. In the E-step, in cases where computing the posterior distributions of parameters in closed-form is not feasible, Monte Carlo EM is leveraged to approximate the likelihood. In the M-step, MissDAG leverages the density transformation to model the noise distributions with simpler and specific formulations by virtue of the ANMs and uses a likelihood-based causal discovery algorithm with directed acyclic graph constraint. We demonstrate the flexibility of MissDAG for incorporating various causal discovery algorithms and its efficacy through extensive simulations and real data experiments.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Reconstruction of causal graphs for multivariate processes in the presence of missing data
    Agarwal, Piyush
    Tangirala, Arun K.
    2017 4TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT), 2017, : 389 - 394
  • [22] Neural Additive Vector Autoregression Models for Causal Discovery in Time Series
    Bussmann, Bart
    Nys, Jannes
    Latre, Steven
    DISCOVERY SCIENCE (DS 2021), 2021, 12986 : 446 - 460
  • [23] Doubly robust estimation in missing data and causal inference models
    Bang, H
    BIOMETRICS, 2005, 61 (04) : 962 - 972
  • [24] How to detect the Granger-causal flow direction in the presence of additive noise?
    Vinck, Martin
    Huurdeman, Lisanne
    Bosman, Conrado A.
    Fries, Pascal
    Battaglia, Francesco P.
    Pennartz, Cyriel M. A.
    Tiesinga, Paul H.
    NEUROIMAGE, 2015, 108 : 301 - 318
  • [25] On the continuous time additive Gaussian noise channel in the presence of perfect feedback
    Chawla, Aman
    Morgera, Salvatore Domenic
    2020 IEEE INFORMATION THEORY WORKSHOP (ITW), 2021,
  • [26] Data-driven local bandwidth selection for additive models with missing data
    Raya-Miranda, R.
    Martinez-Miranda, M. D.
    APPLIED MATHEMATICS AND COMPUTATION, 2011, 217 (24) : 10328 - 10342
  • [27] iSCAN: Identifying Causal Mechanism Shifts among Nonlinear Additive Noise Models
    Chen, Tianyu
    Bello, Kevin
    Aragam, Bryon
    Ravikumar, Pradeep
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [28] iSCAN: Identifying Causal Mechanism Shifts among Nonlinear Additive Noise Models
    Chen, Tianyu
    Bello, Kevin
    Aragam, Bryon
    Ravikumar, Pradeep
    Advances in Neural Information Processing Systems, 2023, 36 : 44671 - 44706
  • [29] Graphical Models for Recovering Probabilistic and Causal Queries from Missing Data
    Mohan, Karthika
    Pearl, Judea
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [30] Recoverability of causal effects under presence of missing data: a longitudinal case study
    Holovchak, Anastasiia
    McIlleron, Helen
    Denti, Paolo
    Schomaker, Michael
    BIOSTATISTICS, 2024, 26 (01)