Exploring the high selectivity of 3-D protein structures using distributed memetic algorithms

被引:3
作者
Inostroza-Ponta, Mario [1 ]
Dorn, Marcio [2 ]
Escobar, Ivan [1 ]
Lima, Leonardo de Correa [2 ]
Rosas, Erika [4 ]
Hidalgo, Nicolas [3 ]
Marin, Mauricio [1 ]
机构
[1] Univ Santiago Chile, Dept Ingn Informat, Santiago, Chile
[2] Univ Fed Rio Grande do Sul, Inst Informat, Av Bento Goncalves 9500, Porto Alegre, RS, Brazil
[3] Univ Diego Portales, Fac Ingn & Cs, Escuela Informat & Telecomunicac, Santiago, Chile
[4] Univ Tecn Federico Santa Maria, Dept Informat, Valparaiso, Chile
关键词
Protein structure prediction; Memetic algorithm; Distributed memetic algorithms; Structural bioinformatics; STRUCTURE PREDICTION; ENERGY LANDSCAPE; SERVER;
D O I
10.1016/j.jocs.2020.101087
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper addresses the problem of predicting the tertiary structure of a protein given its amino acid sequence, which has been reported to belong to the NP-Complete class of problems. We design an ad-hoc distributed memetic algorithm (DMA) and evaluate several algorithm configurations in terms of different distributed population structures, ad-hoc local search strategies and the combination of two energy functions. The algorithm uses an asynchronous hierarchical population of agents that exchange solutions along the execution of the algorithm. Extensive computational experiments were carried out in order to test: (1) the impact of the communication on different population structures, (2) the combination of the energy functions used for fitness calculations, (3) the scalability of the algorithm for structures with a larger number of agents, (4) the performance of the different approaches proposed for local search and diversity calculations, (5) the biological significance of the predicted structures and (6) to compare the best performing configuration of the DMA with other algorithms from the literature. The algorithm was tested on 20 sequences of different size, and the analysis was performed regarding both computational quality and biological significance of the predicted structures. Results show that the combination of energy functions and the proposed Distributed Memetic Algorithm allows the prediction of structures that are similar to the experimental ones. Performance analysis shows that increasing parallelism improves the execution times, without worsening the quality of the solutions. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Meandering Down the Energy Landscape of Protein Folding: Are We There Yet?
    Abaskharon, Rachel M.
    Gai, Feng
    [J]. BIOPHYSICAL JOURNAL, 2016, 110 (09) : 1924 - 1932
  • [2] PDP: protein domain parser
    Alexandrov, N
    Shindyalov, I
    [J]. BIOINFORMATICS, 2003, 19 (03) : 429 - 430
  • [3] [Anonymous], PROTEINS STRUCT FUNC
  • [4] [Anonymous], BACKGROUNDS PROTEIN
  • [5] [Anonymous], COEVOLUTION MEMETIC
  • [6] [Anonymous], J MATH MODEL ALGORIT
  • [7] [Anonymous], PYMOL MOL GRAPH SYST
  • [8] [Anonymous], COMP MAT P GEN EV C
  • [9] [Anonymous], 2016, 2016 35 INT C CHIL C
  • [10] [Anonymous], PROTEINS STRUCT FUNC