A GMDH polynomial neural network-based method to predict approximate three-dimensional structures of polypeptides

被引:30
作者
Dorn, Marcio [1 ]
Braga, Andre L. S. [2 ]
Llanos, Carlos H. [2 ]
Coelho, Leandro S. [3 ]
机构
[1] Univ Fed Rio Grande do Sul, Inst Informat, BR-91501970 Porto Alegre, RS, Brazil
[2] Univ Brasilia, Dept Engn Mecan, BR-70910000 Brasilia, DF, Brazil
[3] Pontificia Univ Catolica Parana, Grp Prod, BR-80215901 Curitiba, Parana, Brazil
关键词
Artificial Neural Networks; Protein Structure Prediction; Group Method of Data Handling; Multilayer Perceptron; PROTEIN-STRUCTURE PREDICTION; MOLECULAR-DYNAMICS; GLOBULAR-PROTEINS; DATA-BANK; CHANNEL; FOLD; RECOGNITION; REFINEMENT; SIMULATION; PATTERNS;
D O I
10.1016/j.eswa.2012.04.046
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tertiary Protein Structure Prediction is one of the most important problems in Structural Bioinformatics. Along the last 20 years many algorithms have been proposed as to solve this problem. However, it still remains a challenging issue because of the complexity and of the dimensionality of the protein conformational search space. In this article a first principle method which uses database information for the prediction of the 3-D structure of polypeptides is presented. The technique is based on the Group Method of Data Handling (GMDH) algorithm, implemented by a software tool introduced on this work. GMDH Polynomial Neural Networks have been used with success in many fields such as data mining, knowledge discovery, pattern recognition and prediction. The proposed method was tested with seven protein sequences whose sizes vary from 14 to 54 amino acid residues. Results show that the predicted tertiary structures adopt a fold similar to the experimental structures. RMSD and secondary structure analysis reveal that the proposed method present accurate results in their predictions. The predicted structures can be used as input structures in refinement methods based on molecular mechanics (MM), e.g. molecular dynamics (MD) simulations. The search space is expected to be greatly reduced and the ab initio methods can demand a much reduced computational time to achieve a more accurate polypeptide structure. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:12268 / 12279
页数:12
相关论文
共 64 条
  • [1] Modeling and forecasting the mean hourly wind speed time series using GMDH-based abductive networks
    Abdel-Aal, R. E.
    Elhadidy, M. A.
    Shaahid, S. M.
    [J]. RENEWABLE ENERGY, 2009, 34 (07) : 1686 - 1699
  • [2] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [3] KINETICS OF FORMATION OF NATIVE RIBONUCLEASE DURING OXIDATION OF REDUCED POLYPEPTIDE CHAIN
    ANFINSEN, CB
    HABER, E
    SELA, M
    WHITE, FH
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1961, 47 (09) : 1309 - +
  • [4] [Anonymous], MOL DYNAMICS SIMULAT
  • [5] [Anonymous], APPL INDUCTIVE LOGIC
  • [6] [Anonymous], COMPUTATIONAL COMPLE
  • [7] [Anonymous], ADV PROTEIN CHEM
  • [8] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [9] Blanc E, 1996, PROTEINS, V24, P359, DOI 10.1002/(SICI)1097-0134(199603)24:3<359::AID-PROT9>3.0.CO
  • [10] 2-B