Graph estimation with joint additive models

被引:51
作者
Voorman, Arend [1 ]
Shojaie, Ali [1 ]
Witten, Daniela [1 ]
机构
[1] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
Conditional independence; Graphical model; Lasso; Nonlinearity; Non-Gaussianity; Sparse additive model; Sparsity; BAYESIAN INFORMATION CRITERIA; SELECTION; REGRESSION; INFERENCE; NETWORKS;
D O I
10.1093/biomet/ast053
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In recent years, there has been considerable interest in estimating conditional independence graphs in high dimensions. Most previous work assumed that the variables are multivariate Gaussian or that the conditional means of the variables are linearly related. Unfortunately, if these assumptions are violated, the resulting conditional independence estimates can be inaccurate. We propose a semiparametric method, graph estimation with joint additive models, which allows the conditional means of the features to take an arbitrary additive form. We present an efficient algorithm for computation of our estimator, and prove that it is consistent. We extend our method to estimation of directed graphs with known causal ordering. Using simulated data, we show that our method performs better than existing methods when there are nonlinear relationships among the features, and is comparable to methods that assume multivariate normality when the conditional means are linear. We illustrate our method on a cell signalling dataset.
引用
收藏
页码:85 / 101
页数:17
相关论文
共 45 条
[1]  
[Anonymous], 2006, Journal of the Royal Statistical Society, Series B
[2]  
[Anonymous], 1980, MULTIVARIATE ANAL
[3]  
Banerjee O, 2008, J MACH LEARN RES, V9, P485
[4]   Reverse engineering of regulatory networks in human B cells [J].
Basso, K ;
Margolin, AA ;
Stolovitzky, G ;
Klein, U ;
Dalla-Favera, R ;
Califano, A .
NATURE GENETICS, 2005, 37 (04) :382-390
[5]   STATISTICAL-ANALYSIS OF NON-LATTICE DATA [J].
BESAG, J .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES D-THE STATISTICIAN, 1975, 24 (03) :179-195
[6]  
BESAG J, 1974, J ROY STAT SOC B MET, V36, P192
[7]   Extended Bayesian information criteria for model selection with large model spaces [J].
Chen, Jiahua ;
Chen, Zehua .
BIOMETRIKA, 2008, 95 (03) :759-771
[8]   A Nonparametric Approach to Detect Nonlinear Correlation in Gene Expression [J].
Chen, Y. Ann ;
Almeida, Jonas S. ;
Richards, Adam J. ;
Mueller, Peter ;
Carroll, Raymond J. ;
Rohrer, Baerbel .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2010, 19 (03) :552-568
[9]  
Cowell RG., 2007, Probabilistic Networks and Expert Systems: Exact Computational Methods for Bayesian Networks
[10]   COVARIANCE SELECTION [J].
DEMPSTER, AP .
BIOMETRICS, 1972, 28 (01) :157-&