Masked Gradient-Based Causal Structure Learning

被引:0
作者
Ng, Ignavier [1 ]
Zhu, Shengyu [2 ]
Fang, Zhuangyan [3 ]
Li, Haoyang [4 ]
Chen, Zhitang [2 ]
Wang, Jun [5 ]
机构
[1] Univ Toronto, Toronto, ON, Canada
[2] Huawei Noahs Ark Lab, Montreal, PQ, Canada
[3] Peking Univ, Beijing, Peoples R China
[4] Ecole Polytech, Lausanne, Switzerland
[5] UCL, London, England
来源
PROCEEDINGS OF THE 2022 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM | 2022年
关键词
Causal structure learning; gradient-based optimization; binary adjacency matrix; Gumbel-Softmax; DISCOVERY; SEARCH;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper studies the problem of learning causal structures from observational data. We reformulate the Structural Equation Model (SEM) with additive noises in a form parameterized by binary graph adjacency matrix and show that, if the original SEM is identifiable, then the binary adjacency matrix can be identified up to super-graphs of the true causal graph under mild conditions. We then utilize the reformulated SEM to develop a causal structure learning method that can be efficiently trained using gradient-based optimization, by leveraging a smooth characterization on acyclicity and the Gumbel-Softmax approach to approximate the binary adjacency matrix. It is found that the obtained entries are typically near zero or one and can be easily thresholded to identify the edges. We conduct experiments on synthetic and real datasets to validate the e.ectiveness of the proposed method, and show that it readily includes di.erent smooth model functions and achieves a much improved performance on most datasets considered.
引用
收藏
页码:424 / 432
页数:9
相关论文
共 44 条
  • [1] Bengio Y, 2013, Arxiv, DOI arXiv:1308.3432
  • [2] Bertsekas D.P., 1999, Nonlinear Programming
  • [3] CAM: CAUSAL ADDITIVE MODELS, HIGH-DIMENSIONAL ORDER SEARCH AND PENALIZED REGRESSION
    Buehlmann, Peter
    Peters, Jonas
    Ernest, Jan
    [J]. ANNALS OF STATISTICS, 2014, 42 (06) : 2526 - 2556
  • [4] Chickering D. M., 2003, Journal of Machine Learning Research, V3, P507, DOI 10.1162/153244303321897717
  • [5] Chickering D.M., 1996, Learning from Data: Artificial Intelligence and Statistics
  • [6] Chu TJ, 2008, J MACH LEARN RES, V9, P967
  • [7] Glorot X, 2010, P 13 INT C ARTIFICIA, P249
  • [8] Review of Causal Discovery Methods Based on Graphical Models
    Glymour, Clark
    Zhang, Kun
    Spirtes, Peter
    [J]. FRONTIERS IN GENETICS, 2019, 10
  • [9] Goudet Olivier, 2018, Explainable and Interpretable Models in Computer Vision and Machine Learning
  • [10] Generalized Score Functions for Causal Discovery
    Huang, Biwei
    Zhang, Kun
    Lin, Yizhu
    Schoelkopf, Bernhard
    Glymour, Clark
    [J]. KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1551 - 1560