MissDAG: Causal Discovery in the Presence of Missing Data with Continuous Additive Noise Models

被引:0
|
作者
Gao, Erdun [1 ]
Ng, Ignavier [2 ]
Gong, Mingming [1 ]
Shen, Li [3 ]
Huang, Wei [1 ]
Liu, Tongliang [4 ]
Zhang, Kun [2 ,5 ]
Bondell, Howard [1 ]
机构
[1] Univ Melbourne, Parkville, Australia
[2] Carnegie Mellon Univ, Pittsburgh, PA USA
[3] JD Explore Acad, Beijing, Peoples R China
[4] Univ Sydney, Sydney, Australia
[5] Mohamed Bin Zayed Univ Artificial Intelligence, Abu Dhabi, U Arab Emirates
基金
美国国家卫生研究院; 澳大利亚研究理事会;
关键词
BAYESIAN NETWORKS; EM ALGORITHM; IMPUTATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
State-of-the-art causal discovery methods usually assume that the observational data is complete. However, the missing data problem is pervasive in many practical scenarios such as clinical trials, economics, and biology. One straightforward way to address the missing data problem is first to impute the data using off-the-shelf imputation methods and then apply existing causal discovery methods. However, such a two-step method may suffer from suboptimality, as the imputation algorithm may introduce bias for modeling the underlying data distribution. In this paper, we develop a general method, which we call MissDAG, to perform causal discovery from data with incomplete observations. Focusing mainly on the assumptions of ignorable missingness and the identifiable additive noise models (ANMs), MissDAG maximizes the expected likelihood of the visible part of observations under the expectation-maximization (EM) framework. In the E-step, in cases where computing the posterior distributions of parameters in closed-form is not feasible, Monte Carlo EM is leveraged to approximate the likelihood. In the M-step, MissDAG leverages the density transformation to model the noise distributions with simpler and specific formulations by virtue of the ANMs and uses a likelihood-based causal discovery algorithm with directed acyclic graph constraint. We demonstrate the flexibility of MissDAG for incorporating various causal discovery algorithms and its efficacy through extensive simulations and real data experiments.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Causal Discovery with Continuous Additive Noise Models
    Peters, Jonas
    Mooij, Joris M.
    Janzing, Dominik
    Schoelkopf, Bernhard
    JOURNAL OF MACHINE LEARNING RESEARCH, 2014, 15 : 2009 - 2053
  • [2] Causal discovery with continuous additive noise models
    Peters, Jonas
    Mooij, Joris M.
    Janzing, Dominik
    Schölkopf, Bernhard
    Journal of Machine Learning Research, 2014, 15 : 2009 - 2053
  • [3] Identification of Causal Structure in the Presence of Missing Data with Additive Noise Model
    Qiao, Jie
    Chen, Zhengming
    Yu, Jianhua
    Cai, Ruichu
    Hao, Zhifeng
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 18, 2024, : 20516 - 20523
  • [4] Causal Discovery in the Presence of Missing Data
    Tu, Ruibo
    Zhang, Cheng
    Ackermann, Paul
    Mohan, Karthika
    Kjellstrom, Hedvig
    Zhang, Kun
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [5] On the Robustness of Causal Discovery with Additive Noise Models on Discrete Data
    Du, Kang
    Goddard, Austin
    Xiang, Yu
    2020 DATA COMPRESSION CONFERENCE (DCC 2020), 2020, : 365 - 365
  • [6] Causal Discovery with Cascade Nonlinear Additive Noise Models
    Cai, Ruichu
    Qiao, Jie
    Zhang, Kun
    Zhang, Zhenjie
    Hao, Zhifeng
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 1609 - 1615
  • [7] Causal Discovery with Score Matching on Additive Models with Arbitrary Noise
    Montagna, Francesco
    Noceti, Nicoletta
    Rosasco, Lorenzo
    Zhang, Kun
    Locatello, Francesco
    CONFERENCE ON CAUSAL LEARNING AND REASONING, VOL 213, 2023, 213 : 726 - 751
  • [8] Causal Discovery with Confounding Cascade Nonlinear Additive Noise Models
    Qiao, Jie
    Cai, Ruichu
    Zhang, Kun
    Zhang, Zhenjie
    Hao, Zhifeng
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2021, 12 (06)
  • [9] Score Matching Enables Causal Discovery of Nonlinear Additive Noise Models
    Rolland, Paul
    Cevher, Volkan
    Kleindessner, Matthaeus
    Russel, Chris
    Schoelkopf, Bernhard
    Janzing, Dominik
    Locatello, Francesco
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [10] SCORE MATCHING ENABLES CAUSAL DISCOVERY of NONLINEAR ADDITIVE NOISE MODELS
    Rolland, Paul
    Cevher, Volkan
    Kleindessner, Matthäus
    Russel, Chris
    Schölkopf, Bernhard
    Janzing, Dominik
    Locatello, Francesco
    arXiv, 2022,