Analysis of several key factors influencing deep learning-based inter-residue contact prediction

被引:22
|
作者
Wu, Tianqi [1 ]
Hou, Jie [1 ]
Adhikari, Badri [2 ]
Cheng, Jianlin [1 ]
机构
[1] Univ Missouri, Dept Elect Engn & Comp Sci, Columbia, MO 65211 USA
[2] Univ Missouri, Dept Math & Comp Sci, St Louis, MO 63121 USA
关键词
PROTEIN; SEQUENCE;
D O I
10.1093/bioinformatics/btz679
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Deep learning has become the dominant technology for protein contact prediction. However, the factors that affect the performance of deep learning in contact prediction have not been systematically investigated. Results: We analyzed the results of our three deep learning-based contact prediction methods (MULTICOM-CLUSTER, MULTICOM-CONSTRUCT and MULTICOM-NOVEL) in the CASP13 experiment and identified several key factors [i.e. deep learning technique, multiple sequence alignment (MSA), distance distribution prediction and domain-based contact integration] that influenced the contact prediction accuracy. We compared our convolutional neural network (CNN)-based contact prediction methods with three coevolution-based methods on 75 CASP13 targets consisting of 108 domains. We demonstrated that the CNN-based multi-distance approach was able to leverage global coevolutionary coupling patterns comprised of multiple correlated contacts for more accurate contact prediction than the local coevolution-based methods, leading to a substantial increase of precision by 19.2 percentage points. We also tested different alignment methods and domain-based contact prediction with the deep learning contact predictors. The comparison of the three methods showed deeper sequence alignments and the integration of domain-based contact prediction with the full-length contact prediction improved the performance of contact prediction. Moreover, we demonstrated that the domain-based contact prediction based on a novel ab initio approach of parsing domains from MSAs alone without using known protein structures was a simple, fast approach to improve contact prediction. Finally, we showed that predicting the distribution of inter-residue distances in multiple distance intervals could capture more structural information and improve binary contact prediction.
引用
收藏
页码:1091 / 1098
页数:8
相关论文
共 50 条
  • [21] Combining a binary input encoding scheme with RBFNN for globulin protein inter-residue contact map prediction
    Zhang, GZ
    Huang, DS
    Quan, ZH
    PATTERN RECOGNITION LETTERS, 2005, 26 (10) : 1543 - 1553
  • [22] Correction to: Predicting protein inter-residue contacts using composite likelihood maximization and deep learning
    Haicang Zhang
    Qi Zhang
    Fusong Ju
    Jianwei Zhu
    Yujuan Gao
    Ziwei Xie
    Minghua Deng
    Shiwei Sun
    Wei-Mou Zheng
    Dongbo Bu
    BMC Bioinformatics, 20
  • [23] Influencing Factors and Machine Learning-Based Prediction of Side Effects in Psychotherapy
    Yao, Lijun
    Zhao, Xudong
    Xu, Zhiwei
    Chen, Yang
    Liu, Liang
    Feng, Qiang
    Chen, Fazhan
    FRONTIERS IN PSYCHIATRY, 2020, 11
  • [24] Deep Learning-Based Prediction of Key Performance Indicators for Electrical Machines
    Parekh, Vivek
    Flore, Dominik
    Schoeps, Sebastian
    IEEE ACCESS, 2021, 9 : 21786 - 21797
  • [25] Influencing Factors Evaluation of Machine Learning-Based Energy Consumption Prediction
    Khan, Prince Waqas
    Kim, Yongjun
    Byun, Yung-Cheol
    Lee, Sang-Joon
    ENERGIES, 2021, 14 (21)
  • [26] DeepTFactor: A deep learning-based tool for the prediction of transcription factors
    Kim, Gi Bae
    Gao, Ye
    Palsson, Bernhard O.
    Lee, Sang Yup
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2021, 118 (02)
  • [27] DomBpred: Protein Domain Boundary Prediction Based on Domain-Residue Clustering Using Inter-Residue Distance
    Yu, Zhong-Ze
    Peng, Chun-Xiang
    Liu, Jun
    Zhang, Biao
    Zhou, Xiao-Gen
    Zhang, Gui-Jun
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (02) : 912 - 922
  • [28] CONSRANK: a server for the analysis, comparison and ranking of docking models based on inter-residue contacts
    Chermak, Edrisse
    Petta, Andrea
    Serra, Luigi
    Vangone, Anna
    Scarano, Vittorio
    Cavallo, Luigi
    Oliva, Romina
    BIOINFORMATICS, 2015, 31 (09) : 1481 - 1483
  • [29] Deep Learning-Based Prediction of Contact Maps and Crystal Structures of Inorganic Materials
    Hu, Jianjun
    Zhao, Yong
    Li, Qin
    Song, Yuqi
    Dong, Rongzhi
    Yang, Wenhui
    Siriwardane, Edirisuriya M. D.
    ACS OMEGA, 2023, 8 (29): : 26170 - 26179
  • [30] Deep learning-based fishing ground prediction with multiple environmental factors
    Xie, Mingyang
    Liu, Bin
    Chen, Xinjun
    MARINE LIFE SCIENCE & TECHNOLOGY, 2024, 6 (04) : 736 - 749