Fast and accurate Ab Initio Protein structure prediction using deep learning potentials

被引:12
|
作者
Pearce, Robin [1 ]
Li, Yang [1 ]
Omenn, Gilbert S. [1 ,2 ,3 ,4 ]
Zhang, Yang [1 ,5 ]
机构
[1] Univ Michigan, Dept Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Dept Internal Med, Ann Arbor, MI 48109 USA
[3] Univ Michigan, Dept Human Genet, Ann Arbor, MI 48109 USA
[4] Univ Michigan, Sch Publ Hlth, Ann Arbor, MI 48109 USA
[5] Univ Michigan, Dept Biol Chem, Ann Arbor, MI 48109 USA
关键词
RESIDUE CONTACTS; FRAGMENTS;
D O I
10.1371/journal.pcbi.1010539
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Despite the immense progress recently witnessed in protein structure prediction, the modeling accuracy for proteins that lack sequence and/or structure homologs remains to be improved. We developed an open-source program, DeepFold, which integrates spatial restraints predicted by multi-task deep residual neural-networks along with a knowledgebased energy function to guide its gradient-descent folding simulations. The results on large-scale benchmark tests showed that DeepFold creates full-length models with accuracy significantly beyond classical folding approaches and other leading deep learning methods. Of particular interest is the modeling performance on the most difficult targets with very few homologous sequences, where DeepFold achieved an average TM-score that was 40.3% higher than trRosetta and 44.9% higher than DMPfold. Furthermore, the folding simulations for DeepFold were 262 times faster than traditional fragment assembly simulations. These results demonstrate the power of accurately predicted deep learning potentials to improve both the accuracy and speed of ab initio protein structure prediction. Author summary Template-free protein structure prediction remains an important unsolved problem. We proposed a new pipeline to construct full-length protein structures by coupling multiplelevel deep learning potentials with fast gradient-based folding simulations. The large-scale benchmark tests demonstrated significant advantages in both accuracy and speed over other fragment-assembly and deep learning-based approaches. The results revealed that the key factor for the success of the deep learning approach is its ability to provide an abundant set of accurate spatial restraints (similar to 93*L where L is the protein length), which help smooth the energy landscape and make gradient-based simulation searching a feasible optimization tool. Nevertheless, extensive folding simulations are still needed for the cases where only sparse restraints are available as provided by threading alignments and low-resolution structural biology experiments.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] Ab initio protein structure prediction using a combined hierarchical approach
    Samudrala, R
    Xia, Y
    Huang, E
    Levitt, M
    PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1999, : 194 - 198
  • [22] Contact order and ab initio protein structure prediction
    Bonneau, R
    Ruczinski, I
    Tsai, J
    Baker, D
    PROTEIN SCIENCE, 2002, 11 (08) : 1937 - 1944
  • [23] Ab Initio prediction of mycobacteriophages protein structure and function
    Chiraag D Kapadia
    Claire A Rinehart
    BMC Bioinformatics, 14
  • [24] Ab initio protein structure prediction: Progress and prospects
    Bonneau, R
    Baker, D
    ANNUAL REVIEW OF BIOPHYSICS AND BIOMOLECULAR STRUCTURE, 2001, 30 : 173 - 189
  • [25] Ab Initio prediction of mycobacteriophages protein structure and function
    Kapadia, Chiraag D.
    Rinehart, Claire A.
    BMC BIOINFORMATICS, 2013, 14
  • [26] Ab initio protein structure prediction of CASP III targets using ROSETTA
    Simons, KT
    Bonneau, R
    Ruczinski, I
    Baker, D
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1999, : 171 - 176
  • [27] TSSFinder-fast and accurate ab initio prediction of the core promoter in eukaryotic genomes
    Oliveira, Mauro de Medeiros
    Bonadio, Igor
    de Melo, Alicia Lie
    Souza, Glaucia Mendes
    Durham, Alan Mitchell
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (06)
  • [28] Ab initio protein structure prediction and a comparative genetic algorithm
    Gregurick, S
    Pedersen, J
    Moult, J
    BIOPHYSICAL JOURNAL, 1999, 76 (01) : A382 - A382
  • [29] An Enhanced Genetic Algorithm for Ab Initio Protein Structure Prediction
    Rashid, Mahmood A.
    Khatib, Firas
    Hoque, Md Tamjidul
    Sattar, Abdul
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2016, 20 (04) : 627 - 644
  • [30] Principles, Challenges and Advances in ab initio Protein Structure Prediction
    Jothi, Arunachalam
    PROTEIN AND PEPTIDE LETTERS, 2012, 19 (11): : 1194 - 1204