Identification of Boolean Network Models From Time Series Data Incorporating Prior Knowledge

被引:17
作者
Leifeld, Thomas [1 ]
Zhang, Zhihua [1 ]
Zhang, Ping [1 ]
机构
[1] Tech Univ Kaiserslautern, Inst Automat Control, Kaiserslautern, Germany
来源
FRONTIERS IN PHYSIOLOGY | 2018年 / 9卷
关键词
Boolean networks; identification; prior knowledge; time series data; network inference; GENE REGULATORY NETWORKS; DYNAMICS; OPTIMIZATION; RECONSTRUCTION; ALGORITHM; SYSTEMS;
D O I
10.3389/fphys.2018.00695
中图分类号
Q4 [生理学];
学科分类号
071003 ;
摘要
Motivation: Mathematical models take an important place in science and engineering. A model can help scientists to explain dynamic behavior of a system and to understand the functionality of system components. Since length of a time series and number of replicates is limited by the cost of experiments, Boolean networks as a structurally simple and parameter-free logical model for gene regulatory networks have attracted interests of many scientists. In order to fit into the biological contexts and to lower the data requirements, biological prior knowledge is taken into consideration during the inference procedure. In the literature, the existing identification approaches can only deal with a subset of possible types of prior knowledge. Results: We propose a new approach to identify Boolean networks from time series data incorporating prior knowledge, such as partial network structure, canalizing property, positive and negative unateness. Using vector form of Boolean variables and applying a generalized matrix multiplication called the semi-tensor product (STP), each Boolean function can be equivalently converted into a matrix expression. Based on this, the identification problem is reformulated as an integer linear programming problem to reveal the system matrix of Boolean model in a computationally efficient way, whose dynamics are consistent with the important dynamics captured in the data. By using prior knowledge the number of candidate functions can be reduced during the inference. Hence, identification incorporating prior knowledge is especially suitable for the case of small size time series data and data without sufficient stimuli. The proposed approach is illustrated with the help of a biological model of the network of oxidative stress response. Conclusions: The combination of efficient reformulation of the identification problem with the possibility to incorporate various types of prior knowledge enables the application of computational model inference to systems with limited amount of time series data. The general applicability of this methodological approach makes it suitable for a variety of biological systems and of general interest for biological and medical research.
引用
收藏
页数:12
相关论文
共 49 条
[1]  
Akutsu T, 1999, Pac Symp Biocomput, P17
[2]   Dynamics of complex systems:: Scaling laws for the period of Boolean networks [J].
Albert, R ;
Barabási, AL .
PHYSICAL REVIEW LETTERS, 2000, 84 (24) :5660-5663
[3]   Missing values in gel-based proteomics [J].
Albrecht, Daniela ;
Kniemeyer, Olaf ;
Brakhage, Axel A. ;
Guthke, Reinhard .
PROTEOMICS, 2010, 10 (06) :1202-1211
[4]   MULTILINEAR POLYNOMIALS AND FRANKL-RAY-CHAUDHURI-WILSON TYPE INTERSECTION-THEOREMS [J].
ALON, N ;
BABAI, L ;
SUZUKI, H .
JOURNAL OF COMBINATORIAL THEORY SERIES A, 1991, 58 (02) :165-180
[5]  
[Anonymous], 2001, Sci. China, Ser. F: Info. Sci.
[6]  
Arnone MI, 1997, DEVELOPMENT, V124, P1851
[7]   An Evaluation of Methods for Inferring Boolean Networks from Time-Series Data [J].
Berestovsky, Natalie ;
Nakhleh, Luay .
PLOS ONE, 2013, 8 (06)
[8]  
Bollobas B., 2012, GRAPH THEORY INTRO C, V63
[9]   Pseudo-Boolean optimization [J].
Boros, E ;
Hammer, PL .
DISCRETE APPLIED MATHEMATICS, 2002, 123 (1-3) :155-225
[10]  
Breindl C, 2013, IEEE DECIS CONTR P, P733, DOI 10.1109/CDC.2013.6759969