Kernel-based Approach for Learning Causal Graphs from Mixed Data

被引:0
作者
Handhayani, Teny [1 ]
Cussens, James [2 ]
机构
[1] Univ York, Dept Comp Sci, York, N Yorkshire, England
[2] Univ Bristol, Dept Comp Sci, Bristol, Avon, England
来源
INTERNATIONAL CONFERENCE ON PROBABILISTIC GRAPHICAL MODELS, VOL 138 | 2020年 / 138卷
关键词
Causal learning; Kernel Alignment; Mixed data; graph structure evaluation; DIRECTED ACYCLIC GRAPHS; DISCOVERY; MODELS; LATENT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A causal graph can be generated from a dataset using a particular causal algorithm, for instance, the PC algorithm or Fast Causal Inference (FCI). This paper provides two contributions in learning causal graphs: an easy way to handle mixed data so that it can be used to learn causal graphs using the PC algorithm/FCI and a method to evaluate the learned graph structure when the true graph is unknown. This research proposes using kernel functions and Kernel Alignment to handle mixed data. The two main steps of this approach are computing a kernel matrix for each variable and calculating a pseudo-correlation matrix using Kernel Alignment. The Kernel Alignment matrix is used as a substitute for the correlation matrix that is the main component used in computing a partial correlation for the conditional independence test for Gaussian data in the PC Algorithm and FCI. The advantage of this idea is that is possible to handle more data types when there is a suitable kernel function to compute a kernel matrix for an observed variable. The proposed method is successfully applied to learn a causal graph from mixed data containing categorical, binary, ordinal, and continuous variables. We also introduce the Modal Value of Edges Existence (MVEE) method, a new method to evaluate the structure of learned graphs represented by Partial Ancestral Graph (PAG) when the true graph is unknown. MVEE produces an agreement graph as a proxy to the true graph to evaluate the structure of the learned graph. MVEE is successfully used to choose the best-learned graph when the true graph is unknown.
引用
收藏
页码:221 / 232
页数:12
相关论文
共 24 条
  • [1] Bach F. R., 2002, P 15 INT C NEUR INF, P1033
  • [2] Kernel independent component analysis
    Bach, FR
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (01) : 1 - 48
  • [3] Kernel Functions for Categorical Variables with Application to Problems in the Life Sciences
    Belanche, Lluis A.
    Villegas, Marco A.
    [J]. ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE OF THE CATALAN ASSOCIATION FOR ARTIFICIAL INTELLIGENCE, 2013, 256 : 171 - 180
  • [4] LEARNING HIGH-DIMENSIONAL DIRECTED ACYCLIC GRAPHS WITH LATENT AND SELECTION VARIABLES
    Colombo, Diego
    Maathuis, Marloes H.
    Kalisch, Markus
    Richardson, Thomas S.
    [J]. ANNALS OF STATISTICS, 2012, 40 (01) : 294 - 321
  • [5] Cristianini N, 2002, ADV NEUR IN, V14, P367
  • [6] Cui R., 2016, MACHINE LEARNING KNO, P377, DOI DOI 10.1007/978-3-319-46227-1_24
  • [7] Fukumizu K, 2008, ADV NEURAL INFORM PR, P489
  • [8] Gretton A, 2005, J MACH LEARN RES, V6, P2075
  • [9] Kalisch M, 2007, J MACH LEARN RES, V8, P613
  • [10] Kalisch M, 2012, J STAT SOFTW, V47, P1