Detecting distant-homology protein structures by aligning deep neural-network based contact maps

被引:34
|
作者
Zheng, Wei [1 ,2 ,3 ]
Wuyun, Qiqige [2 ,3 ,4 ]
Li, Yang [1 ]
Mortuza, S. M. [1 ]
Zhang, Chengxin [1 ]
Pearce, Robin [1 ]
Ruan, Jishou [2 ,3 ,5 ]
Zhang, Yang [1 ,6 ]
机构
[1] Univ Michigan, Dept Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
[2] Nankai Univ, Coll Math Sci, Tianjin, Peoples R China
[3] Nankai Univ, LPMC, Tianjin, Peoples R China
[4] Michigan State Univ, Comp Sci & Engn Dept, E Lansing, MI 48824 USA
[5] Nankai Univ, State Key Lab Med Chem Biol, Tianjin, Peoples R China
[6] Univ Michigan, Dept Biol Chem, Ann Arbor, MI 48109 USA
基金
美国国家科学基金会;
关键词
STRUCTURE PREDICTION; SEQUENCE; ALIGNMENT; FRAGMENTS; SEARCH;
D O I
10.1371/journal.pcbi.1007411
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Accurate prediction of atomic-level protein structure is important for annotating the biological functions of protein molecules and for designing new compounds to regulate the functions. Template-based modeling (TBM), which aims to construct structural models by copying and refining the structural frameworks of other known proteins, remains the most accurate method for protein structure prediction. Due to the difficulty in recognizing distant-homology templates, however, the accuracy of TBM decreases rapidly when the evolutionary relationship between the query and template vanishes. In this study, we propose a new method, CEthreader, which first predicts residue-residue contacts by coupling evolutionary precision matrices with deep residual convolutional neural-networks. The predicted contact maps are then integrated with sequence profile alignments to recognize structural templates from the PDB. The method was tested on two independent benchmark sets consisting collectively of 1,153 non-homologous protein targets, where CEthreader detected 176% or 36% more correct templates with a TM-score >0.5 than the best state-of-the-art profile- or contact-based threading methods, respectively, for the Hard targets that lacked homologous templates. Moreover, CEthreader was able to identify 114% or 20% more correct templates with the same Fold as the query, after excluding structures from the same SCOPe Superfamily, than the best profile- or contact-based threading methods. Detailed analyses show that the major advantage of CEthreader lies in the efficient coupling of contact maps with profile alignments, which helps recognize global fold of protein structures when the homologous relationship between the query and template is weak. These results demonstrate an efficient new strategy to combine ab initio contact map prediction with profile alignments to significantly improve the accuracy of template-based structure prediction, especially for distant-homology proteins. Author summary Despite decades of effort in computational method development, template-based modeling (TBM) still remains the most reliable approach to high-resolution protein structure prediction. Previous studies have shown that the PDB library is complete for single-domain proteins and TBM is in principle sufficient to solve the structure prediction problem if the most similar structure in the PDB could be reliably identified and used as template for model reconstruction. But in reality, the success of TBM depends on the availability of closely-homologous templates, where its accuracy and reliability decrease sharply when the evolutionary relationship between query and template becomes more distant. We developed a new threading approach, CEthreader, which allows for dynamic programing alignments of predicted contact-maps through eigen-decomposition. The large-scale benchmark tests show that the coupling of contact map with profile and secondary structure alignments through the proposed protocol can significantly improve the accuracy of template recognition for distantly-homologous protein targets.
引用
收藏
页数:27
相关论文
共 50 条
  • [21] Multimodal neural network for enhanced protein stability prediction by integration of contact scores and spatial maps
    Sigamani, G. Gladstone
    Vincent, P. M. Durai Raj
    RESULTS IN ENGINEERING, 2024, 24
  • [22] A deep neural-network classifier for photograph-based estimation of hearing protection attenuation and fita)
    Smalt, Christoper J.
    Ciccarelli, Gregory A.
    Rodriguez, Aaron R.
    Murphy, William J.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2021, 150 (02): : 1067 - 1075
  • [23] Detecting ChatGPT Generated Texts Based on Deep Pyramid Convolutional Neural Network
    Fan, Zhiwu
    Yao, Jinliang
    Data Analysis and Knowledge Discovery, 2024, 8 (07) : 14 - 22
  • [24] Transfer Learning Based Deep Neural Network for Detecting Artefacts in Endoscopic Images
    Natarajan, Kirthika
    Balusamy, Sargunam
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2022, 13 (08) : 633 - 641
  • [25] From Interatomic Distances to Protein Tertiary Structures with a Deep Convolutional Neural Network
    Du, Yuanqi
    Kabir, Anowarul
    Zhao, Liang
    Shehu, Amarda
    ACM-BCB 2020 - 11TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2020,
  • [26] Combination of deep neural network with attention mechanism enhances the explainability of protein contact prediction
    Chen, Chen
    Wu, Tianqi
    Guo, Zhiye
    Cheng, Jianlin
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2021, 89 (06) : 697 - 707
  • [27] Improved Prediction Method of Protein Contact Based on RBF Neural Network
    Sun Pengfei
    Zhang Jianpei
    2009 3RD INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, VOLS 1-11, 2009, : 816 - 819
  • [28] A protein-protein interaction extraction approach based on deep neural network
    Zhao, Zhehuan
    Yang, Zhihao
    Lin, Hongfei
    Wang, Jian
    Gao, Song
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2016, 15 (02) : 145 - 164
  • [29] A Non-Contact PPG Biometric System Based on Deep Neural Network
    Patil, Omkar R.
    Wang, Wei
    Gao, Yang
    Xu, Wenyao
    Jin, Zhanpeng
    2018 IEEE 9TH INTERNATIONAL CONFERENCE ON BIOMETRICS THEORY, APPLICATIONS AND SYSTEMS (BTAS), 2018,
  • [30] A DEEP NEURAL NETWORK-BASED NUMERICAL METHOD FOR SOLVING CONTACT PROBLEMS
    Shen, X. I. N. G.
    Cheng, X. I. A. O. L. I. A. N. G.
    Liang, K. E. W. E., I
    Wang, X. I. L. U.
    Wu, Z. H. E. N. G. H. U. A.
    JOURNAL OF NONLINEAR AND VARIATIONAL ANALYSIS, 2022, 6 (05): : 483 - 498