Scalable Inference of Gene Regulatory Networks with the Spark Distributed Computing Platform

被引:6
作者
Barba-Gonzalez, Cristobal [1 ]
Garcia-Nieto, Jose [1 ]
Benitez-Hidalgo, Antonio [1 ]
Nebro, Antonio J. [1 ]
Aldana-Montes, Jose F. [1 ]
机构
[1] Univ Malaga, Inst Invest Biomed Malaga IBIMA, Dept Lenguajes & Ciencias Computac, Campus Teatinos, E-29071 Malaga, Spain
来源
INTELLIGENT DISTRIBUTED COMPUTING XII | 2018年 / 798卷
关键词
Gene Regulatory Networks; Multi-objective; Metaheuristics; Distributed Computing; jMetal; Spark;
D O I
10.1007/978-3-319-99626-4_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Inference of Gene Regulatory Networks (GRNs) remains an important open challenge in computational biology. The goal of bio-model inference is to, based on time-series of gene expression data, obtain the sparse topological structure and the parameters that quantitatively understand and reproduce the dynamics of biological system. Nevertheless, the inference of a GRN is a complex optimization problem that involve processing S-System models, which include large amount of gene expression data from hundreds (even thousands) of genes in multiple time-series (essays). This complexity, along with the amount of data managed, make the inference of GRNs to be a computationally expensive task. Therefore, the generation of parallel algorithmic proposals that operate efficiently on distributed processing platforms is a must in current reconstruction of GRNs. In this paper, a parallel multi-objective approach is proposed for the optimal inference of GRNs, since minimizing the Mean Squared Error using S-System model and Topology Regularization value. A flexible and robust multi-objective cellular evolutionary algorithm is adapted to deploy parallel tasks, in form of Spark jobs. The proposed approach has been developed using the framework jMetal, so in order to perform parallel computation, we use Spark on a cluster of distributed nodes to evaluate candidate solutions modeling the interactions of genes in biological networks.
引用
收藏
页码:61 / 70
页数:10
相关论文
共 14 条
[1]   Identification of genetic networks by strategic gene disruptions and gene overexpressions under a boolean model [J].
Akutsu, T ;
Kuhara, S ;
Maruyama, O ;
Miyano, S .
THEORETICAL COMPUTER SCIENCE, 2003, 298 (01) :235-251
[2]  
[Anonymous], 2010, BIOCH SYSTEMS ANAL S
[3]  
[Anonymous], 2010, P USENIX WORKSH HOT
[4]  
[Anonymous], 2007, DESIGN ISSUES MULTIO, DOI DOI 10.1007/978-3-540-70928-2_13
[5]  
Barba-Gonzalez Cristobal, 2017, Evolutionary Multi-Criterion Optimization. 9th International Conference, EMO 2017. Proceedings: LNCS 10173, P16, DOI 10.1007/978-3-319-54157-0_2
[6]  
Deb K., 2001, MULTIOBJECTIVE OPTIM, V16
[7]   jMetal: A Java']Java framework for multi-objective optimization [J].
Durillo, Juan J. ;
Nebro, Antonio J. .
ADVANCES IN ENGINEERING SOFTWARE, 2011, 42 (10) :760-771
[8]  
Friedman N, 2004, J COMPUT BIOL, V7, P3
[9]   Inferring gene regulatory networks using differential evolution with local search heuristics [J].
Noman, Nasimul ;
Iba, Hitoshi .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2007, 4 (04) :634-647
[10]   Reverse Engineering of Gene Regulatory Networks Using Dissipative Particle Swarm Optimization [J].
Palafox, Leon ;
Noman, Nasimul ;
Iba, Hitoshi .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2013, 17 (04) :577-587