Protein structure prediction using deep learning distance and hydrogen-bonding restraints in CASP14

被引:42
|
作者
Zheng, Wei [1 ]
Li, Yang [1 ,2 ]
Zhang, Chengxin [1 ]
Zhou, Xiaogen [1 ]
Pearce, Robin [1 ]
Bell, Eric W. [1 ]
Huang, Xiaoqiang [1 ]
Zhang, Yang [1 ,3 ]
机构
[1] Univ Michigan, Dept Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing, Peoples R China
[3] Univ Michigan, Dept Biol Chem, Ann Arbor, MI 48109 USA
基金
美国国家科学基金会;
关键词
ab initio folding; CASP14; deep learning; domain partition; multiple sequence alignment; protein structure prediction; residue-residue distance prediction; FOLD-RECOGNITION; I-TASSER; SIMILARITY; SERVER;
D O I
10.1002/prot.26193
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In this article, we report 3D structure prediction results by two of our best server groups ("Zhang-Server" and "QUARK") in CASP14. These two servers were built based on the D-I-TASSER and D-QUARK algorithms, which integrated four newly developed components into the classical protein folding pipelines, I-TASSER and QUARK, respectively. The new components include: (a) a new multiple sequence alignment (MSA) collection tool, DeepMSA2, which is extended from the DeepMSA program; (b) a contact-based domain boundary prediction algorithm, FUpred, to detect protein domain boundaries; (c) a residual convolutional neural network-based method, DeepPotential, to predict multiple spatial restraints by co-evolutionary features derived from the MSA; and (d) optimized spatial restraint energy potentials to guide the structure assembly simulations. For 37 FM targets, the average TM-scores of the first models produced by D-I-TASSER and D-QUARK were 96% and 112% higher than those constructed by I-TASSER and QUARK, respectively. The data analysis indicates noticeable improvements produced by each of the four new components, especially for the newly added spatial restraints from DeepPotential and the well-tuned force field that combines spatial restraints, threading templates, and generic knowledge-based potentials. However, challenges still exist in the current pipelines. These include difficulties in modeling multi-domain proteins due to low accuracy in inter-domain distance prediction and modeling protein domains from oligomer complexes, as the co-evolutionary analysis cannot distinguish inter-chain and intra-chain distances. Specifically tuning the deep learning-based predictors for multi-domain targets and protein complexes may be helpful to address these issues.
引用
收藏
页码:1734 / 1751
页数:18
相关论文
共 50 条
  • [41] Protein Secondary Structure Prediction Based on Deep Learning
    Zheng, Lin
    Li, Hong-ling
    Wu, Nan
    Ao, Li
    3RD INTERNATIONAL SYMPOSIUM ON MECHATRONICS AND INDUSTRIAL INFORMATICS, (ISMII 2017), 2017, : 171 - 177
  • [42] Ensemble deep learning model for protein secondary structure prediction using NLP metrics and explainable AI
    Vignesh, U.
    Parvathi, R.
    Ram, K. Gokul
    RESULTS IN ENGINEERING, 2024, 24
  • [43] Evaluating GPCR modeling and docking strategies in the era of deep learning-based protein structure prediction
    Lee, Sumin
    Kim, Seeun
    Lee, Gyu Rie
    Kwon, Sohee
    Woo, Hyeonuk
    Seok, Chaok
    Park, Hahnbeom
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2023, 21 : 158 - 167
  • [44] A Deep Learning Network Approach to ab initio Protein Secondary Structure Prediction
    Spencer, Matt
    Eickholt, Jesse
    Cheng, Jianlin
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2015, 12 (01) : 103 - 112
  • [45] Protein Secondary Structure Prediction Using Deep Convolutional Neural Fields
    Wang, Sheng
    Peng, Jian
    Ma, Jianzhu
    Xu, Jinbo
    SCIENTIFIC REPORTS, 2016, 6
  • [46] Assessment of protein model structure accuracy estimation in CASP13: Challenges in the era of deep learning
    Won, Jonghun
    Baek, Minkyung
    Monastyrskyy, Bohdan
    Kryshtafovych, Andriy
    Seok, Chaok
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2019, 87 (12) : 1351 - 1360
  • [47] Protein Secondary Structure Prediction With a Reductive Deep Learning Method
    Lyu, Zhiliang
    Wang, Zhijin
    Luo, Fangfang
    Shuai, Jianwei
    Huang, Yandong
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2021, 9
  • [48] Antibody structure prediction using interpretable deep learning
    Ruffolo, Jeffrey A.
    Sulam, Jeremias
    Gray, Jeffrey J.
    PATTERNS, 2022, 3 (02):
  • [49] Protein structure prediction (RMSD ≤ 5 Å) using machine learning models
    Pathak, Yadunath
    Rana, Prashant Singh
    Singh, P. K.
    Saraswat, Mukesh
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2016, 14 (01) : 71 - 85
  • [50] Protein Structure Prediction Using Quantile Dragonfly and Structural Class-Based Deep Learning
    Nallasamy, Varanavasi
    Seshiah, Malarvizhi
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (04)