A parallel metaheuristic for large mixed-integer dynamic optimization problems, with applications in computational biology

被引:9
作者
Penas, David R. [1 ]
Henriques, David [1 ]
Gonzalez, Patricia [2 ]
Doallo, Ramon [2 ]
Saez-Rodriguez, Julio [3 ,4 ]
Banga, Julio R. [1 ]
机构
[1] CSIC, Spanish Natl Res Council, IIM, BioProc Engn Grp, C Eduardo Cabello 6, E-36208 Vigo, Spain
[2] Univ A Coruna, Dept Elect & Syst, Comp Architecture Grp, Campus Elvina S-N, La Coruna 15071, Spain
[3] Rhein Westfal TH Aachen, Fac Med, Joint Res Ctr Computat Biomed, D-52074 Aachen, Germany
[4] European Bioinformat Inst, European Mol Biol Lab, Hinxton CD10 1SD, England
来源
PLOS ONE | 2017年 / 12卷 / 08期
关键词
ANT COLONY OPTIMIZATION; SYSTEMS BIOLOGY; BIOCHEMICAL PATHWAYS; PARAMETER-ESTIMATION; INTEGRATED PROCESS; IDENTIFICATION; ALGORITHM; NETWORKS; MODELS; DESIGN;
D O I
10.1371/journal.pone.0182186
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background We consider a general class of global optimization problems dealing with nonlinear dynamic models. Although this class is relevant to many areas of science and engineering, here we are interested in applying this framework to the reverse engineering problem in computational systems biology, which yields very large mixed-integer dynamic optimization (MIDO) problems. In particular, we consider the framework of logic-based ordinary differential equations (ODEs). Methods We present saCeSS2, a parallel method for the solution of this class of problems. This method is based on an parallel cooperative scatter search metaheuristic, with new mechanisms of self-adaptation and specific extensions to handle large mixed-integer problems. We have paid special attention to the avoidance of convergence stagnation using adaptive cooperation strategies tailored to this class of problems. Results We illustrate its performance with a set of three very challenging case studies from the domain of dynamic modelling of cell signaling. The simpler case study considers a synthetic signaling pathway and has 84 continuous and 34 binary decision variables. A second case study considers the dynamic modeling of signaling in liver cancer using high-throughput data, and has 135 continuous and 109 binaries decision variables. The third case study is an extremely difficult problem related with breast cancer, involving 690 continuous and 138 binary decision variables. We report computational results obtained in different infrastructures, including a local cluster, a large supercomputer and a public cloud platform. Interestingly, the results show how the cooperation of individual parallel searches modifies the systemic properties of the sequential algorithm, achieving superlinear speedups compared to an individual search (e.g. speedups of 15 with 10 cores), and significantly improving (above a 60%) the performance with respect to a non-cooperative parallel scheme. The scalability of the method is also good (tests were performed using up to 300 cores). Conclusions These results demonstrate that saCeSS2 can be used to successfully reverse engineer large dynamic models of complex biological pathways. Further, these results open up new possibilities for other MIDO-based large-scale applications in the life sciences such as metabolic engineering, synthetic biology, drug scheduling.
引用
收藏
页数:32
相关论文
共 100 条
  • [11] Dynamic optimization of bioprocesses: Efficient and robust numerical strategies
    Banga, JR
    Balsa-Canto, E
    Moles, CG
    Alonso, AA
    [J]. JOURNAL OF BIOTECHNOLOGY, 2005, 117 (04) : 407 - 419
  • [12] Optimization in computational systems biology
    Banga, Julio R.
    [J]. BMC SYSTEMS BIOLOGY, 2008, 2
  • [14] Bernardo-Faura M., 2014, PLOS COMPUTATIONAL B, V10, P1
  • [15] Bertsekas D. P., 1995, Dynamic programming and optimal control
  • [16] Advances in simultaneous strategies for dynamic process optimization
    Biegler, LT
    Cervantes, AM
    Wächter, A
    [J]. CHEMICAL ENGINEERING SCIENCE, 2002, 57 (04) : 575 - 593
  • [17] The Inferelator:: an algorithm for learning parsimonious regulatory networks from systems-biology data sets de novo
    Bonneau, Richard
    Reiss, David J.
    Shannon, Paul
    Facciotti, Marc
    Hood, Leroy
    Baliga, Nitin S.
    Thorsson, Vesteinn
    [J]. GENOME BIOLOGY, 2006, 7 (05)
  • [18] Global optimization advances in Mixed-Integer Nonlinear Programming, MINLP, and Constrained Derivative-Free Optimization, CDFO
    Boukouvala, Fani
    Misener, Ruth
    Floudas, Christodoulos A.
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2016, 252 (03) : 701 - 727
  • [19] Burke EK, 2010, INT SER OPER RES MAN, V146, P449, DOI 10.1007/978-1-4419-1665-5_15
  • [20] BURNHAM K.P., 2002, MODEL SELECTION MULT, P352