Learning causal networks from systems biology time course data: an effective model selection procedure for the vector autoregressive process

被引:87
作者
Opgen-Rhein, Rainer
Strimmer, Korbinian
机构
[1] Univ Munich, Dept Stat, D-80539 Munich, Germany
[2] Univ Leipzig, IMISE, D-04107 Leipzig, Germany
来源
BMC BIOINFORMATICS | 2007年 / 8卷
关键词
EMPIRICAL BAYES APPROACH; SHRINKAGE APPROACH; LASSO;
D O I
10.1186/1471-2105-8-S2-S3
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Causal networks based on the vector autoregressive (VAR) process are a promising statistical tool for modeling regulatory interactions in a cell. However, learning these networks is challenging due to the low sample size and high dimensionality of genomic data. Results: We present a novel and highly efficient approach to estimate a VAR network. This proceeds in two steps: (i) improved estimation of VAR regression coefficients using an analytic shrinkage approach, and (ii) subsequent model selection by testing the associated partial correlations. In simulations this approach outperformed for small sample size all other considered approaches in terms of true discovery rate (number of correctly identified edges relative to the significant edges). Moreover, the analysis of expression time series data from Arabidopsis thaliana resulted in a biologically sensible network. Conclusion: Statistical learning of large-scale VAR causal models can be done efficiently by the proposed procedure, even in the difficult data situations prevalent in genomics and proteomics. Availability: The method is implemented in R code that is available from the authors on request.
引用
收藏
页数:8
相关论文
共 20 条
[1]   Temporal aggregation bias and inference of causal regulatory networks [J].
Bay, SD ;
Chrisman, L ;
Pohorille, A ;
Shrager, J .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2004, 11 (05) :971-985
[2]   Searching for the causal structure of a vector autoregression [J].
Demiralp, S ;
Hoover, KD .
OXFORD BULLETIN OF ECONOMICS AND STATISTICS, 2003, 65 :745-767
[3]   STEINS ESTIMATION RULE AND ITS COMPETITORS - EMPIRICAL BAYES APPROACH [J].
EFRON, B ;
MORRIS, C .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1973, 68 (341) :117-130
[4]  
EFRON B, 2005, TECH REP DEP STAT
[5]   GENERALIZED CROSS-VALIDATION AS A METHOD FOR CHOOSING A GOOD RIDGE PARAMETER [J].
GOLUB, GH ;
HEATH, M ;
WAHBA, G .
TECHNOMETRICS, 1979, 21 (02) :215-223
[6]   TESTING FOR CAUSALITY - A PERSONAL VIEWPOINT [J].
GRANGER, CWJ .
JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 1980, 2 (04) :329-352
[7]  
Lutkepohl H., 1993, Introduction to Multiple Time Series Analysis, V2
[8]   High-dimensional graphs and variable selection with the Lasso [J].
Meinshausen, Nicolai ;
Buehlmann, Peter .
ANNALS OF STATISTICS, 2006, 34 (03) :1436-1462
[9]  
MONETA A, 2004, TECHNICAL REPORT LAB
[10]   Bayesian estimates for vector autoregressive models [J].
Ni, S ;
Sun, DC .
JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2005, 23 (01) :105-117