Learning causal graphs via nonlinear sufficient dimension reduction

被引:0
作者
Solea, Eftychia [1 ]
Li, Bing [2 ]
Kim, Kyongwon [3 ]
机构
[1] Queen Mary Univ London, Sch Math Sci, London E1 4NS, England
[2] Penn State Univ, Dept Stat, 326 Thomas Bldg, University Pk, PA 16802 USA
[3] Yonsei Univ, Dept Appl Stat, Dept Stat & Data Sci, 50 Yonsei ro, Seoul 03722, South Korea
基金
新加坡国家研究基金会;
关键词
causality; conditional independence; directed graphs; pc algorithm; sufficient dimension reduction; SLICED INVERSE REGRESSION; MARKOV EQUIVALENCE CLASSES; SUPPORT VECTOR MACHINES; KERNEL; NETWORKS; CONSISTENCY; MODELS; SELECTION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We introduce a new nonparametric methodology for estimating a directed acyclic graph (DAG) from observational data. Our method is nonparametric in nature: it does not impose any specific form on the joint distribution of the underlying DAG. Instead, it relies on a linear operator on reproducing kernel Hilbert spaces to evaluate conditional independence. However, a fully nonparametric approach would involve conditioning on a large number of random variables, subjecting it to the curse of dimensionality. To solve this problem, we apply nonlinear sufficient dimension reduction to reduce the number of variables before evaluating the conditional independence. We develop an estimator for the DAG, based on a linear operator that characterizes conditional independence, and establish the consistency and convergence rates of this estimator, as well as the uniform consistency of the estimated Markov equivalence class. We introduce a modified PC-algorithm to implement the estimating procedure efficiently such that the complexity depends on the sparseness of the underlying true DAG. We demonstrate the effectiveness of our methodology through simulations and a real data analysis.
引用
收藏
页码:1 / 46
页数:46
相关论文
共 81 条
[1]  
Andersson SA, 1997, ANN STAT, V25, P505
[2]   CAM: CAUSAL ADDITIVE MODELS, HIGH-DIMENSIONAL ORDER SEARCH AND PENALIZED REGRESSION [J].
Buehlmann, Peter ;
Peters, Jonas ;
Ernest, Jan .
ANNALS OF STATISTICS, 2014, 42 (06) :2526-2556
[3]   Multiclass support vector classification via coding and regression [J].
Chen, Pei-Chun ;
Lee, Kuang-Yao ;
Lee, Tsung-Ju ;
Lee, Yuh-Jye ;
Huang, Su-Yun .
NEUROCOMPUTING, 2010, 73 (7-9) :1501-1512
[4]  
Chickering D. M., 2003, Journal of Machine Learning Research, V3, P507, DOI 10.1162/153244303321897717
[5]  
Conway J. B., 1990, COURSE FUNCTIONAL AN
[6]  
COOK RD, 1991, J AM STAT ASSOC, V86, P328, DOI 10.2307/2290564
[7]  
Cooper GF, 1999, UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, P116
[8]   Structure Learning in Graphical Modeling [J].
Drton, Mathias ;
Maathuis, Marloes H. .
ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 4, 2017, 4 :365-393
[9]  
Eaton D., 2007, International Conference on Artificial Intelligence and Statistics, P107
[10]  
Eberhardt F., 2008, P 24 C UNC ART INT U, P161