CONFOLD: Residue-residue contact-guided ab initio protein folding

被引:101
作者
Adhikari, Badri [1 ]
Bhattacharya, Debswapna [1 ]
Cao, Renzhi [1 ]
Cheng, Jianlin [1 ]
机构
[1] Univ Missouri, Dept Comp Sci, Columbia, MO 65211 USA
关键词
protein residue-residue contacts; protein structure modeling; ab initio protein folding; contact assisted protein structure prediction; optimization; SECONDARY STRUCTURE PREDICTION; NEURAL-NETWORKS; BETA-SHEETS; RECONSTRUCTION; CRYSTALLOGRAPHY; DEFINITION; SOFTWARE;
D O I
10.1002/prot.24829
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Predicted protein residue-residue contacts can be used to build three-dimensional models and consequently to predict protein folds from scratch. A considerable amount of effort is currently being spent to improve contact prediction accuracy, whereas few methods are available to construct protein tertiary structures from predicted contacts. Here, we present an ab initio protein folding method to build three-dimensional models using predicted contacts and secondary structures. Our method first translates contacts and secondary structures into distance, dihedral angle, and hydrogen bond restraints according to a set of new conversion rules, and then provides these restraints as input for a distance geometry algorithm to build tertiary structure models. The initially reconstructed models are used to regenerate a set of physically realistic contact restraints and detect secondary structure patterns, which are then used to reconstruct final structural models. This unique two-stage modeling approach of integrating contacts and secondary structures improves the quality and accuracy of structural models and in particular generates better -sheets than other algorithms. We validate our method on two standard benchmark datasets using true contacts and secondary structures. Our method improves TM-score of reconstructed protein models by 45% and 42% over the existing method on the two datasets, respectively. On the dataset for benchmarking reconstructions methods with predicted contacts and secondary structures, the average TM-score of best models reconstructed by our method is 0.59, 5.5% higher than the existing method. The CONFOLD web server is available at . Proteins 2015; 83:1436-1449. (c) 2015 Wiley Periodicals, Inc.
引用
收藏
页码:1436 / 1449
页数:14
相关论文
共 49 条
  • [1] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [2] PROTEIN STRUCTURES FROM DISTANCE INEQUALITIES
    BOHR, J
    BOHR, H
    BRUNAK, S
    COTTERILL, RMJ
    FREDHOLM, H
    LAUTRUP, B
    PETERSEN, SB
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1993, 231 (03) : 861 - 869
  • [3] Crystallography & NMR system:: A new software suite for macromolecular structure determination
    Brunger, AT
    Adams, PD
    Clore, GM
    DeLano, WL
    Gros, P
    Grosse-Kunstleve, RW
    Jiang, JS
    Kuszewski, J
    Nilges, M
    Pannu, NS
    Read, RJ
    Rice, LM
    Simonson, T
    Warren, GL
    [J]. ACTA CRYSTALLOGRAPHICA SECTION D-BIOLOGICAL CRYSTALLOGRAPHY, 1998, 54 : 905 - 921
  • [4] Version 1.2 of the Crystallography and NMR system
    Brunger, Axel T.
    [J]. NATURE PROTOCOLS, 2007, 2 (11) : 2728 - 2733
  • [5] Chen Ke, 2013, Methods Mol Biol, V932, P63, DOI 10.1007/978-1-62703-065-6_5
  • [6] SCRATCH: a protein structure and structural feature prediction server
    Cheng, J
    Randall, AZ
    Sweredoski, MJ
    Baldi, P
    [J]. NUCLEIC ACIDS RESEARCH, 2005, 33 : W72 - W76
  • [7] Improved residue contact prediction using support vector machines and a large feature set
    Cheng, Jianlin
    Baldi, Pierre
    [J]. BMC BIOINFORMATICS, 2007, 8 (1)
  • [8] Three-stage prediction of protein β-sheets by neural networks, alignments and graph algorithms
    Cheng, JL
    Baldi, P
    [J]. BIOINFORMATICS, 2005, 21 : I75 - I84
  • [9] The Jpred 3 secondary structure prediction server
    Cole, Christian
    Barber, Jonathan D.
    Barton, Geoffrey J.
    [J]. NUCLEIC ACIDS RESEARCH, 2008, 36 : W197 - W201
  • [10] On the Reconstruction of Three-dimensional Protein Structures from Contact Maps
    Di Lena, Pietro
    Vassura, Marco
    Margara, Luciano
    Fariselli, Piero
    Casadio, Rita
    [J]. ALGORITHMS, 2009, 2 (01) : 76 - 92