A Memetic Algorithm for 3D Protein Structure Prediction Problem

被引:19
作者
Correa, Leonardo [1 ]
Borguesan, Bruno [1 ]
Farfan, Camilo [2 ]
Inostroza-Ponta, Mario [2 ]
Dorn, Marcio [1 ]
机构
[1] Univ Fed Rio Grande do Sul, Inst Informat, BR-90040060 Porto Alegre, RS, Brazil
[2] Univ Santiago, Dept Ingn Informat, Ctr Biotechnol & Bioengn, Santiago, Chile
关键词
Optimization; metaheuristics; evolutionary algorithms; knowledge based algorithm; structural bioinformatics; SECONDARY STRUCTURE; SEQUENCES; SERVER; OPTIMIZATION; FRAGMENTS; PATHWAYS; DESIGN; CASP10; MODEL; FOLD;
D O I
10.1109/TCBB.2016.2635143
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Memetic Algorithms are population-based metaheuristics intrinsically concerned with exploiting all available knowledge about the problem under study. The incorporation of problem domain knowledge is not an optional mechanism, but a fundamental feature of the Memetic Algorithms. In this paper, we present a Memetic Algorithm to tackle the three-dimensional protein structure prediction problem. The method uses a structured population and incorporates a Simulated Annealing algorithm as a local search strategy, as well as ad-hoc crossover and mutation operators to deal with the problem. It takes advantage of structural knowledge stored in the Protein Data Bank, by using an Angle Probability List that helps to reduce the search space and to guide the search strategy. The proposed algorithm was tested on 19 protein sequences of amino acid residues, and the results show the ability of the algorithm to find native-like protein structures. Experimental results have revealed that the proposed algorithm can find good solutions regarding root-mean-square deviation and global distance total score test in comparison with the experimental protein structures. We also show that our results are comparable in terms of folding organization with state-of-the-art prediction methods, corroborating the effectiveness of our proposal.
引用
收藏
页码:690 / 704
页数:15
相关论文
共 78 条
[1]   PRINCIPLES THAT GOVERN FOLDING OF PROTEIN CHAINS [J].
ANFINSEN, CB .
SCIENCE, 1973, 181 (4096) :223-230
[2]  
[Anonymous], 2011, Multiobjective Genetic Algorithms for Clustering: Applications in Data Mining and Bioinformatics
[3]  
Back T., 1996, Evolutionary Algorithms in Theory and Practice: Evolution Strategies, Evolutionary Programming, Genetic Algorithms
[4]  
Baxevanis Andreas D., 2004, Bioinformatics: a practical guide to the analysis of genes and proteins, V43
[5]  
Berretta R., 1999, NEW IDEAS OPTIMIZATI, P261
[6]  
Borguesan B., 2016, J COMPUT BIOL
[7]   APL: An angle probability list to improve knowledge-based metaheuristics for the three-dimensional protein structure prediction [J].
Borguesan, Bruno ;
Barbachan e Silva, Mariel ;
Grisci, Bruno ;
Inostroza-Ponta, Mario ;
Dorn, Marti .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2015, 59 :142-157
[8]   A survey on optimization metaheuristics [J].
Boussaid, Ilhern ;
Lepagnot, Julien ;
Siarry, Patrick .
INFORMATION SCIENCES, 2013, 237 :82-117
[9]   FUNNELS, PATHWAYS, AND THE ENERGY LANDSCAPE OF PROTEIN-FOLDING - A SYNTHESIS [J].
BRYNGELSON, JD ;
ONUCHIC, JN ;
SOCCI, ND ;
WOLYNES, PG .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1995, 21 (03) :167-195
[10]   How root-mean-square distance (r.m.s.d.) values depend on the resolution of protein structures that are compared [J].
Carugo, O .
JOURNAL OF APPLIED CRYSTALLOGRAPHY, 2003, 36 :125-128