Sophisticated Merging Over Random Partitions: A Scalable and Robust Causal Discovery Approach

被引:6
作者
Cai, Ruichu [1 ,2 ]
Zhang, Zhenjie [3 ]
Hao, Zhifeng [1 ,4 ]
Winslett, Marianne [5 ]
机构
[1] Guangdong Univ Technol, Sch Comp, Guangzhou 510006, Guangdong, Peoples R China
[2] Guangdong Univ Technol, State Key Lab Novel Software Technol, Guangzhou 510006, Guangdong, Peoples R China
[3] Illinois Singapore Pte Ltd, Adv Digital Sci Ctr, Singapore 138632, Singapore
[4] Foshan Univ, Sch Math & Big Data, Foshan 528000, Peoples R China
[5] Univ Illinois, Dept Comp Sci, Champaign, IL 61801 USA
关键词
Causal discovery; maximal acyclic subgraph model; random partition; significance propagation;
D O I
10.1109/TNNLS.2017.2734804
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scalable causal discovery is an essential technology to a wide spectrum of applications, including biomedical studies and social network evolution analysis. To tackle the difficulty of high dimensionality, a number of solutions are proposed in the literature, generally dividing the original variable domain into smaller subdomains by computation intensive partitioning strategies. These approaches usually suffer significant structural errors when the partitioning strategies fail to recognize true causal edges across the output subdomains. Such a structural error accumulates quickly with the growing depth of recursive partitioning, due to the lack of correction mechanism over causally connected variables when they are wrongly divided into two subdomains, finally jeopardizing the robustness of the integrated results. This paper proposes a completely different strategy to solve the problem, powered by a lightweight random partitioning scheme together with a carefully designed merging algorithm over results from the random partitions. Based on the randomness properties of the partitioning scheme, we design a suite of tricks for the merging algorithm, in order to support propagation-based significance enhancement, maximal acyclic subgraph causal ordering, and order-sensitive redundancy elimination. Theoretical studies as well as empirical evaluations verify the genericity, effectiveness, and scalability of our proposal on both simulated and real-world causal structures when the scheme is used in combination with a variety of causal solvers known effective on smaller domains.
引用
收藏
页码:3623 / 3635
页数:13
相关论文
共 37 条
  • [1] Aliferis CF, 2010, J MACH LEARN RES, V11, P171
  • [2] [Anonymous], 2009, P 26 INT C MACH LEAR
  • [3] [Anonymous], 2013, International Conference on Machine Learning
  • [4] [Anonymous], 2009, P 25 C UNCERTAINTY A, DOI DOI 10.48550/ARXIV.1205.2599
  • [5] [Anonymous], 1991, KR
  • [6] [Anonymous], 2009, CAUSALITY MODELS REA
  • [7] [Anonymous], 2000, CAUSATION PREDICTION
  • [8] Partial correlation and conditional correlation as measures of conditional independence
    Baba, K
    Shibata, R
    Sibuya, M
    [J]. AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2004, 46 (04) : 657 - 664
  • [9] Bromberg F, 2009, J MACH LEARN RES, V10, P301
  • [10] Understanding Social Causalities Behind Human Action Sequences
    Cai, Ruichu
    Zhang, Zhenjie
    Hao, Zhifeng
    Winslett, Marianne
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (08) : 1801 - 1813