Stability of building gene regulatory networks with sparse autoregressive models

被引:14
作者
Rajapakse, Jagath C. [1 ,2 ,3 ]
Mundra, Piyushkumar A. [1 ]
机构
[1] Nanyang Technol Univ, BioInformat Res Ctr, Sch Comp Engn, Singapore 639798, Singapore
[2] Singapore MIT Alliance, Singapore 639798, Singapore
[3] MIT, Dept Biol Sci, Cambridge, MA 02142 USA
基金
澳大利亚研究理事会;
关键词
TIME-SERIES; CELL-CYCLE; EXPRESSION; REGULARIZATION; SIMULATION; APOPTOSIS; INHIBITOR; SELECTION; SYSTEMS;
D O I
10.1186/1471-2105-12-S13-S17
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Biological networks are constantly subjected to random perturbations, and efficient feedback and compensatory mechanisms exist to maintain their stability. There is an increased interest in building gene regulatory networks (GRNs) from temporal gene expression data because of their numerous applications in life sciences. However, because of the limited number of time points at which gene expressions can be gathered in practice, computational techniques of building GRN often lead to inaccuracies and instabilities. This paper investigates the stability of sparse auto-regressive models of building GRN from gene expression data. Results: Criteria for evaluating the stability of estimating GRN structure are proposed. Thereby, stability of multivariate vector autoregressive (MVAR) methods - ridge, lasso, and elastic-net - of building GRN were studied by simulating temporal gene expression datasets on scale-free topologies as well as on real data gathered over Hela cell-cycle. Effects of the number of time points on the stability of constructing GRN are investigated. When the number of time points are relatively low compared to the size of network, both accuracy and stability are adversely affected. At least, the number of time points equal to the number of genes in the network are needed to achieve decent accuracy and stability of the networks. Our results on synthetic data indicate that the stability of lasso and elastic-net MVAR methods are comparable, and their accuracies are much higher than the ridge MVAR. As the size of the network grows, the number of time points required to achieve acceptable accuracy and stability are much less relative to the number of genes in the network. The effects of false negatives are easier to improve by increasing the number time points than those due to false positives. Application to HeLa cell-cycle gene expression dataset shows that biologically stable GRN can be obtained by introducing perturbations to the data. Conclusions: Accuracy and stability of building GRN are crucial for investigation of gene regulations. Sparse MVAR techniques such as lasso and elastic-net provide accurate and stable methods for building even GRN of small size. The effect of false negatives is corrected much easier with the increased number of time points than those due to false positives. With real data, we demonstrate how stable networks can be derived by introducing random perturbation to data.
引用
收藏
页数:10
相关论文
共 31 条
[1]  
Amati Bruno, 1998, Frontiers in Bioscience, V3, pD250
[2]  
[Anonymous], GLMNET LASSO ELASTIC
[3]   Emergence of scaling in random networks [J].
Barabási, AL ;
Albert, R .
SCIENCE, 1999, 286 (5439) :509-512
[4]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[5]   Noxa mediates p18INK4c cell-cycle control of homeostasis in B cells and plasma cell precursors [J].
Bretz, Jamieson ;
Garcia, Josefina ;
Huang, Xiangao ;
Kang, Lin ;
Zhang, Yang ;
Toellner, Kai-Michael ;
Chen-Kiang, Selina .
BLOOD, 2011, 117 (07) :2179-2188
[6]   Building gene networks with time-delayed regulations [J].
Chaturvedi, Iti ;
Rajapakse, Jagath C. .
PATTERN RECOGNITION LETTERS, 2010, 31 (14) :2133-2137
[7]  
Csardi Gabor., igraph The network analysis package
[8]   Modeling and simulation of genetic regulatory systems: A literature review [J].
De Jong, H .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (01) :67-103
[9]   Bayesian approaches to reverse engineer cellular systems: a simulation study on nonlinear Gaussian networks [J].
Ferrazzi, Fulvia ;
Sebastiani, Paola ;
Ramoni, Marco F. ;
Bellazzi, Riccardo .
BMC BIOINFORMATICS, 2007, 8 (Suppl 5)
[10]  
Fogelberg C, 2009, FDN COMPUTATIONAL IN