Bootstrap-based Causal Structure Learning

被引:10
作者
Guo, Xianjie [1 ]
Wang, Yujie [1 ]
Huang, Xiaoling [1 ]
Yang, Shuai [1 ]
Yu, Kui [1 ]
机构
[1] Hefei Univ Technol, Hefei, Anhui, Peoples R China
来源
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022 | 2022年
基金
中国国家自然科学基金;
关键词
Bootstrap sampling; Causal structure learning; Directed acyclic graph; Local skeleton learning; BAYESIAN NETWORKS; INDUCTION;
D O I
10.1145/3511808.3557249
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Learning a causal structure from observational data is crucial for data scientists. Recent advances in causal structure learning (CSL) have focused on local-to-global learning, since the local-to-global CSL can be scaled to high-dimensional data. The local-to-global CSL algorithms first learn the local skeletons, then construct the global skeleton, and finally orient edges. In practice, the performance of local-to-global CSL mainly depends on the accuracy of the global skeleton. However, in many real-world settings, owing to inevitable data quality issues (e.g. noise and small sample), existing local-to-global CSL methods often yield many asymmetric edges (e.g., given an asymmetric edge containing variables A and B, the learned skeleton of.. contains B, but the learned skeleton of B does not contain A), which make it difficult to construct a high quality global skeleton. To tackle this problem, this paper proposes a Bootstrap sampling based Causal Structure Learning (BCSL) algorithm. The novel contribution of BCSL is that it proposes an integrated global skeleton learning strategy that can construct more accurate global skeletons. Specifically, this strategy first utilizes the Bootstrap method to generate multiple sub-datasets, then learns the local skeleton of variables on each asymmetric edge on those sub-datasets, and finally designs a novel scoring function to estimate the learning results on all sub-datasets for correcting the asymmetric edge. Extensive experiments on both benchmark and real datasets verify the effectiveness of the proposed method.
引用
收藏
页码:656 / 665
页数:10
相关论文
共 49 条
[1]  
Aliferis CF, 2010, J MACH LEARN RES, V11, P171
[2]  
[Anonymous], 1993, An introduction to the bootstrap
[3]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]  
Cai RC, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P1609
[5]  
Chen R, 2021, PR MACH LEARN RES, V139
[6]  
Chickering D. M., 2003, Journal of Machine Learning Research, V3, P507, DOI 10.1162/153244303321897717
[7]  
Chickering DM, 2004, J MACH LEARN RES, V5, P1287
[8]  
Colombo D, 2014, J MACH LEARN RES, V15, P3741
[9]  
Cussens J., 2011, P 27 C UNCERTAINTY A, P153
[10]   Entropy-based pruning for learning Bayesian networks using BIC [J].
de Campos, Cassio P. ;
Scanagatta, Mauro ;
Corani, Giorgio ;
Zaffalon, Marco .
ARTIFICIAL INTELLIGENCE, 2018, 260 :42-50