Progressive Skeleton Learning for Effective Local-to-Global Causal Structure Learning

被引:0
作者
Guo, Xianjie [1 ,2 ]
Yu, Kui [1 ,2 ]
Liu, Lin [3 ]
Li, Jiuyong [3 ]
Liang, Jiye [4 ]
Cao, Fuyuan [4 ]
Wu, Xindong [1 ,2 ]
机构
[1] Minist Educ, Key Lab Knowledge Engn Big Data, Hefei 230601, Peoples R China
[2] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230601, Peoples R China
[3] Univ South Australia, UniSA STEM, Adelaide, SA 5095, Australia
[4] Shanxi Univ, Sch Comp & Informat Technol, Taiyuan 030006, Peoples R China
基金
中国国家自然科学基金;
关键词
Asymmetric edges; local-to-global causal structure learning; progressive learning; skeleton learning; SUPPORT VECTOR MACHINES; RULE EXTRACTION; ALGORITHM; COEFFICIENT; SELECTION; TUTORIAL; SVM;
D O I
10.1109/TKDE.2024.3461832
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Causal structure learning (CSL) from observational data is a crucial objective in various machine learning applications. Recent advances in CSL have focused on local-to-global learning, which offers improved efficiency and accuracy. The local-to-global CSL algorithms first learn the local skeleton of each variable in a dataset, then construct the global skeleton by combining these local skeletons, and finally orient edges to infer causality. However, data quality issues such as noise and small samples often result in the presence of problematic asymmetric edges during global skeleton construction, hindering the creation of a high-quality global skeleton. To address this challenge, we propose a novel local-to-global CSL algorithm with a progressive enhancement strategy and make the following novel contributions: 1) To construct an accurate global skeleton, we design a novel strategy to iteratively correct asymmetric edges and progressively improve the accuracy of the global skeleton. 2) Based on the learned accurate global skeleton, we design an integrated global skeleton orientation strategy to infer the correct directions of edges for obtaining an accurate and reliable causal structure. Extensive experiments demonstrate that our method achieves better performance than the existing CSL methods.
引用
收藏
页码:9065 / 9079
页数:15
相关论文
共 58 条
[21]  
Fumera G, 2002, LECT NOTES COMPUT SC, V2388, P68
[22]  
Fung G, 2008, STUD COMPUT INTELL, V80, P83
[23]  
Ghojogh B, 2020, Arxiv, DOI arXiv:1901.06708
[24]  
Goodrich B, 2009, LECT NOTES ARTIF INT, V5866, P230, DOI 10.1007/978-3-642-10439-8_24
[25]  
John GH, 2013, Arxiv, DOI [arXiv:1302.4964, DOI 10.48550/ARXIV.1302.4964]
[26]  
Handayani I., 2019, Indonesian J. Inf. Syst., V2, P57
[27]   A survey of outlier detection methodologies [J].
Hodge V.J. ;
Austin J. .
Artificial Intelligence Review, 2004, 22 (2) :85-126
[28]   Minerva: Sequential covering for rule extraction [J].
Huysmans, Johan ;
Setiono, Rudy ;
Baesens, Bart ;
Vanthienen, Jan .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (02) :299-309
[29]   THE OVERLAPPING COEFFICIENT AS A MEASURE OF AGREEMENT BETWEEN PROBABILITY-DISTRIBUTIONS AND POINT ESTIMATION OF THE OVERLAP OF 2 NORMAL DENSITIES [J].
INMAN, HF ;
BRADLEY, EL .
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1989, 18 (10) :3851-3874
[30]   Stratified feature sampling method for ensemble clustering of high dimensional data [J].
Jing, Liping ;
Tian, Kuang ;
Huang, Joshua Z. .
PATTERN RECOGNITION, 2015, 48 (11) :3688-3702