Analysis of several key factors influencing deep learning-based inter-residue contact prediction

被引:22
|
作者
Wu, Tianqi [1 ]
Hou, Jie [1 ]
Adhikari, Badri [2 ]
Cheng, Jianlin [1 ]
机构
[1] Univ Missouri, Dept Elect Engn & Comp Sci, Columbia, MO 65211 USA
[2] Univ Missouri, Dept Math & Comp Sci, St Louis, MO 63121 USA
关键词
PROTEIN; SEQUENCE;
D O I
10.1093/bioinformatics/btz679
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Deep learning has become the dominant technology for protein contact prediction. However, the factors that affect the performance of deep learning in contact prediction have not been systematically investigated. Results: We analyzed the results of our three deep learning-based contact prediction methods (MULTICOM-CLUSTER, MULTICOM-CONSTRUCT and MULTICOM-NOVEL) in the CASP13 experiment and identified several key factors [i.e. deep learning technique, multiple sequence alignment (MSA), distance distribution prediction and domain-based contact integration] that influenced the contact prediction accuracy. We compared our convolutional neural network (CNN)-based contact prediction methods with three coevolution-based methods on 75 CASP13 targets consisting of 108 domains. We demonstrated that the CNN-based multi-distance approach was able to leverage global coevolutionary coupling patterns comprised of multiple correlated contacts for more accurate contact prediction than the local coevolution-based methods, leading to a substantial increase of precision by 19.2 percentage points. We also tested different alignment methods and domain-based contact prediction with the deep learning contact predictors. The comparison of the three methods showed deeper sequence alignments and the integration of domain-based contact prediction with the full-length contact prediction improved the performance of contact prediction. Moreover, we demonstrated that the domain-based contact prediction based on a novel ab initio approach of parsing domains from MSAs alone without using known protein structures was a simple, fast approach to improve contact prediction. Finally, we showed that predicting the distribution of inter-residue distances in multiple distance intervals could capture more structural information and improve binary contact prediction.
引用
收藏
页码:1091 / 1098
页数:8
相关论文
共 50 条
  • [11] Protein model accuracy estimation empowered by deep learning and inter-residue distance prediction in CASP14
    Chen, Xiao
    Liu, Jian
    Guo, Zhiye
    Wu, Tianqi
    Hou, Jie
    Cheng, Jianlin
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [12] Protein model accuracy estimation empowered by deep learning and inter-residue distance prediction in CASP14
    Xiao Chen
    Jian Liu
    Zhiye Guo
    Tianqi Wu
    Jie Hou
    Jianlin Cheng
    Scientific Reports, 11
  • [13] Using deep-learning predictions of inter-residue distances for model validation
    Rodriguez, Filomeno Sanchez
    Chojnowski, Grzegorz
    Keegan, Ronan M.
    Rigden, Daniel J.
    ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2022, 78 : 1412 - 1427
  • [14] Enhancing protein contact map prediction accuracy via ensembles of inter-residue distance predictors
    Newton, M. A. Hakim
    Rahman, Julia
    Zaman, Rianon
    Sattar, Abdul
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2022, 99
  • [15] Improved inter-residue contact prediction via a hybrid generative model and dynamic loss function
    Madani, Mohammad
    Behzadi, Mohammad Mahdi
    Song, Dongjin
    Ilies, Horea T.
    Tarakanova, Anna
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2022, 20 : 6138 - 6148
  • [16] Protein inter-residue contact and distance prediction by coupling complementary coevolution features with deep residual networks in CASP14
    Li, Yang
    Zhang, Chengxin
    Zheng, Wei
    Zhou, Xiaogen
    Bell, Eric W.
    Yu, Dong-Jun
    Zhang, Yang
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2021, 89 (12) : 1911 - 1921
  • [17] DeepBSRPred: deep learning-based binding site residue prediction for proteins
    Rahul Nikam
    Kumar Yugandhar
    M. Michael Gromiha
    Amino Acids, 2023, 55 : 1305 - 1316
  • [18] DeepBSRPred: deep learning-based binding site residue prediction for proteins
    Nikam, Rahul
    Yugandhar, Kumar
    Gromiha, M. Michael
    AMINO ACIDS, 2023, 55 (10) : 1305 - 1316
  • [19] Predicting protein inter-residue contacts using composite likelihood maximization and deep learning
    Zhang, Haicang
    Zhang, Qi
    Ju, Fusong
    Zhu, Jianwei
    Gao, Yujuan
    Xie, Ziwei
    Deng, Minghua
    Sun, Shiwei
    Zheng, Wei-Mou
    Bu, Dongbo
    BMC BIOINFORMATICS, 2019, 20 (01)
  • [20] DeepDist: real-value inter-residue distance prediction with deep residual convolutional network
    Tianqi Wu
    Zhiye Guo
    Jie Hou
    Jianlin Cheng
    BMC Bioinformatics, 22