An Efficient Data Assimilation Schema for Restoration and Extension of Gene Regulatory Networks Using Time-Course Observation Data

被引:3
作者
Hasegawa, Takanori [1 ]
Mori, Tomoya [1 ]
Yamaguchi, Rui [2 ]
Imoto, Seiya [2 ]
Miyano, Satoru [2 ]
Akutsu, Tatsuya [1 ]
机构
[1] Kyoto Univ, Inst Chem Res, Bioinformat Ctr, Kyoto 6110011, Japan
[2] Univ Tokyo, Inst Med Sci, Ctr Human Genome, Minato Ku, Tokyo, Japan
关键词
biological simulation; gene regulatory networks inference; time-series analysis; COMBINATORIAL TRANSCRIPTIONAL DYNAMICS; PARAMETER-ESTIMATION; STATE; INFERENCE; SYSTEMS; MODELS; KALMAN;
D O I
10.1089/cmb.2014.0171
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Gene regulatory networks (GRNs) play a central role in sustaining complex biological systems in cells. Although we can construct GRNs by integrating biological interactions that have been recorded in literature, they can include suspicious data and a lack of information. Therefore, there has been an urgent need for an approach by which the validity of constructed networks can be evaluated; simulation-based methods have been applied in which biological observational data are assimilated. However, these methods apply nonlinear models that require high computational power to evaluate even one network consisting of only several genes. Therefore, to explore candidate networks whose simulation models can better predict the data by modifying and extending literature-based GRNs, an efficient and versatile method is urgently required. We applied a combinatorial transcription model, which can represent combinatorial regulatory effects of genes, as a biological simulation model, to reproduce the dynamic behavior of gene expressions within a state space model. Under the model, we applied the unscented Kalman filter to obtain the approximate posterior probability distribution of the hidden state to efficiently estimate parameter values maximizing prediction ability for observational data by the EM-algorithm. Utilizing the method, we propose a novel algorithm to modify GRNs reported in the literature so that their simulation models become consistent with observed data. The effectiveness of our approach was validated through comparison analysis to the previous methods using synthetic networks. Finally, as an application example, a Kyoto Encyclopedia of Genes and Genomes (KEGG)-based yeast cell cycle network was extended with additional candidate genes to better predict the real mRNA expressions data using the proposed method.
引用
收藏
页码:785 / 798
页数:14
相关论文
共 39 条
[1]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]  
Akutsu T, 2009, LECT NOTES ARTIF INT, V5809, P126, DOI 10.1007/978-3-642-04414-4_14
[3]   Large-scale learning of combinatorial transcriptional dynamics from gene expression [J].
Asif, H. M. Shahzad ;
Sanguinetti, Guido .
BIOINFORMATICS, 2011, 27 (09) :1277-1283
[4]   An improved constraint filtering technique for inferring hidden states and parameters of a biological model [J].
Baker, Syed Murtuza ;
Poskar, C. Hart ;
Schreiber, Falk ;
Junker, Bjoern H. .
BIOINFORMATICS, 2013, 29 (08) :1052-1059
[5]   A Bayesian approach to reconstructing genetic regulatory networks with hidden factors [J].
Beal, MJ ;
Falciani, F ;
Ghahramani, Z ;
Rangel, C ;
Wild, DL .
BIOINFORMATICS, 2005, 21 (03) :349-356
[6]   Saccharomyces Genome Database: the genomics resource of budding yeast [J].
Cherry, J. Michael ;
Hong, Eurie L. ;
Amundsen, Craig ;
Balakrishnan, Rama ;
Binkley, Gail ;
Chan, Esther T. ;
Christie, Karen R. ;
Costanzo, Maria C. ;
Dwight, Selina S. ;
Engel, Stacia R. ;
Fisk, Dianna G. ;
Hirschman, Jodi E. ;
Hitz, Benjamin C. ;
Karra, Kalpana ;
Krieger, Cynthia J. ;
Miyasato, Stuart R. ;
Nash, Rob S. ;
Park, Julie ;
Skrzypek, Marek S. ;
Simison, Matt ;
Weng, Shuai ;
Wong, Edith D. .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D700-D705
[7]   An unscented Kalman filter approach to the estimation of nonlinear dynamical systems models [J].
Chow, Sy-Miin ;
Ferrer, Emilio ;
Nesselroade, John R. .
MULTIVARIATE BEHAVIORAL RESEARCH, 2007, 42 (02) :283-321
[8]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[9]   Sparse inverse covariance estimation with the graphical lasso [J].
Friedman, Jerome ;
Hastie, Trevor ;
Tibshirani, Robert .
BIOSTATISTICS, 2008, 9 (03) :432-441
[10]   An efficient method of exploring simulation models by assimilating literature and biological observational data [J].
Hasegawa, Takanori ;
Nagasaki, Masao ;
Yamaguchi, Rui ;
Imoto, Seiya ;
Miyano, Satoru .
BIOSYSTEMS, 2014, 121 :54-66