Fast and accurate Ab Initio Protein structure prediction using deep learning potentials

被引:12
|
作者
Pearce, Robin [1 ]
Li, Yang [1 ]
Omenn, Gilbert S. [1 ,2 ,3 ,4 ]
Zhang, Yang [1 ,5 ]
机构
[1] Univ Michigan, Dept Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Dept Internal Med, Ann Arbor, MI 48109 USA
[3] Univ Michigan, Dept Human Genet, Ann Arbor, MI 48109 USA
[4] Univ Michigan, Sch Publ Hlth, Ann Arbor, MI 48109 USA
[5] Univ Michigan, Dept Biol Chem, Ann Arbor, MI 48109 USA
关键词
RESIDUE CONTACTS; FRAGMENTS;
D O I
10.1371/journal.pcbi.1010539
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Despite the immense progress recently witnessed in protein structure prediction, the modeling accuracy for proteins that lack sequence and/or structure homologs remains to be improved. We developed an open-source program, DeepFold, which integrates spatial restraints predicted by multi-task deep residual neural-networks along with a knowledgebased energy function to guide its gradient-descent folding simulations. The results on large-scale benchmark tests showed that DeepFold creates full-length models with accuracy significantly beyond classical folding approaches and other leading deep learning methods. Of particular interest is the modeling performance on the most difficult targets with very few homologous sequences, where DeepFold achieved an average TM-score that was 40.3% higher than trRosetta and 44.9% higher than DMPfold. Furthermore, the folding simulations for DeepFold were 262 times faster than traditional fragment assembly simulations. These results demonstrate the power of accurately predicted deep learning potentials to improve both the accuracy and speed of ab initio protein structure prediction. Author summary Template-free protein structure prediction remains an important unsolved problem. We proposed a new pipeline to construct full-length protein structures by coupling multiplelevel deep learning potentials with fast gradient-based folding simulations. The large-scale benchmark tests demonstrated significant advantages in both accuracy and speed over other fragment-assembly and deep learning-based approaches. The results revealed that the key factor for the success of the deep learning approach is its ability to provide an abundant set of accurate spatial restraints (similar to 93*L where L is the protein length), which help smooth the energy landscape and make gradient-based simulation searching a feasible optimization tool. Nevertheless, extensive folding simulations are still needed for the cases where only sparse restraints are available as provided by threading alignments and low-resolution structural biology experiments.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] A Deep Learning Network Approach to ab initio Protein Secondary Structure Prediction
    Spencer, Matt
    Eickholt, Jesse
    Cheng, Jianlin
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2015, 12 (01) : 103 - 112
  • [2] DEVELOPMENT OF ACCURATE AND EFFICIENT AB INITIO POTENTIALS FOR EFFECTIVE CRYSTAL STRUCTURE PREDICTION
    Bowskill, David
    Sugden, Isaac
    Pantelides, Constantinos
    Adjiman, Claire
    ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 2019, 75 : E549 - E549
  • [3] New potentials for accurate and efficient ab initio crystal structure prediction methods
    Bowskill, David
    Sugden, Isaac
    Gatsiou, Christina-Anna
    Adjiman, Claire S.
    Pantelides, Constantinos C.
    ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 2018, 74 : E362 - E362
  • [4] Integrating end-to-end learning with deep geometrical potentials for ab initio RNA structure prediction
    Li, Yang
    Zhang, Chengxin
    Feng, Chenjie
    Pearce, Robin
    Lydia Freddolino, P.
    Zhang, Yang
    NATURE COMMUNICATIONS, 2023, 14 (01)
  • [5] Integrating end-to-end learning with deep geometrical potentials for ab initio RNA structure prediction
    Yang Li
    Chengxin Zhang
    Chenjie Feng
    Robin Pearce
    P. Lydia Freddolino
    Yang Zhang
    Nature Communications, 14
  • [6] Ab initio prediction of protein structure using LINUS
    Srinivasan, R
    Rose, GD
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2002, 47 (04) : 489 - 495
  • [7] Deep learning geometrical potential for high-accuracy ab initio protein structure prediction
    Li, Yang
    Zhang, Chengxin
    Yu, Dong-Jun
    Zhang, Yang
    ISCIENCE, 2022, 25 (06)
  • [8] Improved protein structure prediction using potentials from deep learning
    Andrew W. Senior
    Richard Evans
    John Jumper
    James Kirkpatrick
    Laurent Sifre
    Tim Green
    Chongli Qin
    Augustin Žídek
    Alexander W. R. Nelson
    Alex Bridgland
    Hugo Penedones
    Stig Petersen
    Karen Simonyan
    Steve Crossan
    Pushmeet Kohli
    David T. Jones
    David Silver
    Koray Kavukcuoglu
    Demis Hassabis
    Nature, 2020, 577 : 706 - 710
  • [9] Improved protein structure prediction using potentials from deep learning
    Senior, Andrew W.
    Evans, Richard
    Jumper, John
    Kirkpatrick, James
    Sifre, Laurent
    Green, Tim
    Qin, Chongli
    Zidek, Augustin
    Nelson, Alexander W. R.
    Bridgland, Alex
    Penedones, Hugo
    Petersen, Stig
    Simonyan, Karen
    Crossan, Steve
    Kohli, Pushmeet
    Jones, David T.
    Silver, David
    Kavukcuoglu, Koray
    Hassabis, Demis
    NATURE, 2020, 577 (7792) : 706 - +
  • [10] Ab initio protein structure prediction
    Hardin, C
    Pogorelov, TV
    Luthey-Schulten, Z
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2002, 12 (02) : 176 - 181