Masked Gradient-Based Causal Structure Learning

被引:0
作者
Ng, Ignavier [1 ]
Zhu, Shengyu [2 ]
Fang, Zhuangyan [3 ]
Li, Haoyang [4 ]
Chen, Zhitang [2 ]
Wang, Jun [5 ]
机构
[1] Univ Toronto, Toronto, ON, Canada
[2] Huawei Noahs Ark Lab, Montreal, PQ, Canada
[3] Peking Univ, Beijing, Peoples R China
[4] Ecole Polytech, Lausanne, Switzerland
[5] UCL, London, England
来源
PROCEEDINGS OF THE 2022 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM | 2022年
关键词
Causal structure learning; gradient-based optimization; binary adjacency matrix; Gumbel-Softmax; DISCOVERY; SEARCH;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper studies the problem of learning causal structures from observational data. We reformulate the Structural Equation Model (SEM) with additive noises in a form parameterized by binary graph adjacency matrix and show that, if the original SEM is identifiable, then the binary adjacency matrix can be identified up to super-graphs of the true causal graph under mild conditions. We then utilize the reformulated SEM to develop a causal structure learning method that can be efficiently trained using gradient-based optimization, by leveraging a smooth characterization on acyclicity and the Gumbel-Softmax approach to approximate the binary adjacency matrix. It is found that the obtained entries are typically near zero or one and can be easily thresholded to identify the edges. We conduct experiments on synthetic and real datasets to validate the e.ectiveness of the proposed method, and show that it readily includes di.erent smooth model functions and achieves a much improved performance on most datasets considered.
引用
收藏
页码:424 / 432
页数:9
相关论文
共 44 条
[1]  
[Anonymous], 1999, Nonlinear programming
[2]  
Bengio Y, 2013, Arxiv, DOI arXiv:1308.3432
[3]   CAM: CAUSAL ADDITIVE MODELS, HIGH-DIMENSIONAL ORDER SEARCH AND PENALIZED REGRESSION [J].
Buehlmann, Peter ;
Peters, Jonas ;
Ernest, Jan .
ANNALS OF STATISTICS, 2014, 42 (06) :2526-2556
[4]  
Chickering D. M., 2003, Journal of Machine Learning Research, V3, P507, DOI 10.1162/153244303321897717
[5]  
Chickering D. M., 1996, Learning from Data: Artificial Intelligence and Statistics V, VV
[6]  
Chu TJ, 2008, J MACH LEARN RES, V9, P967
[7]  
Glorot X., 2010, P 13 INT C ART INT S, P249
[8]   Review of Causal Discovery Methods Based on Graphical Models [J].
Glymour, Clark ;
Zhang, Kun ;
Spirtes, Peter .
FRONTIERS IN GENETICS, 2019, 10
[9]  
Goudet O., 2018, EXPLAINABLE INTERPRE
[10]   Generalized Score Functions for Causal Discovery [J].
Huang, Biwei ;
Zhang, Kun ;
Lin, Yizhu ;
Schoelkopf, Bernhard ;
Glymour, Clark .
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, :1551-1560