Detecting distant-homology protein structures by aligning deep neural-network based contact maps

被引:34
|
作者
Zheng, Wei [1 ,2 ,3 ]
Wuyun, Qiqige [2 ,3 ,4 ]
Li, Yang [1 ]
Mortuza, S. M. [1 ]
Zhang, Chengxin [1 ]
Pearce, Robin [1 ]
Ruan, Jishou [2 ,3 ,5 ]
Zhang, Yang [1 ,6 ]
机构
[1] Univ Michigan, Dept Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
[2] Nankai Univ, Coll Math Sci, Tianjin, Peoples R China
[3] Nankai Univ, LPMC, Tianjin, Peoples R China
[4] Michigan State Univ, Comp Sci & Engn Dept, E Lansing, MI 48824 USA
[5] Nankai Univ, State Key Lab Med Chem Biol, Tianjin, Peoples R China
[6] Univ Michigan, Dept Biol Chem, Ann Arbor, MI 48109 USA
基金
美国国家科学基金会;
关键词
STRUCTURE PREDICTION; SEQUENCE; ALIGNMENT; FRAGMENTS; SEARCH;
D O I
10.1371/journal.pcbi.1007411
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Accurate prediction of atomic-level protein structure is important for annotating the biological functions of protein molecules and for designing new compounds to regulate the functions. Template-based modeling (TBM), which aims to construct structural models by copying and refining the structural frameworks of other known proteins, remains the most accurate method for protein structure prediction. Due to the difficulty in recognizing distant-homology templates, however, the accuracy of TBM decreases rapidly when the evolutionary relationship between the query and template vanishes. In this study, we propose a new method, CEthreader, which first predicts residue-residue contacts by coupling evolutionary precision matrices with deep residual convolutional neural-networks. The predicted contact maps are then integrated with sequence profile alignments to recognize structural templates from the PDB. The method was tested on two independent benchmark sets consisting collectively of 1,153 non-homologous protein targets, where CEthreader detected 176% or 36% more correct templates with a TM-score >0.5 than the best state-of-the-art profile- or contact-based threading methods, respectively, for the Hard targets that lacked homologous templates. Moreover, CEthreader was able to identify 114% or 20% more correct templates with the same Fold as the query, after excluding structures from the same SCOPe Superfamily, than the best profile- or contact-based threading methods. Detailed analyses show that the major advantage of CEthreader lies in the efficient coupling of contact maps with profile alignments, which helps recognize global fold of protein structures when the homologous relationship between the query and template is weak. These results demonstrate an efficient new strategy to combine ab initio contact map prediction with profile alignments to significantly improve the accuracy of template-based structure prediction, especially for distant-homology proteins. Author summary Despite decades of effort in computational method development, template-based modeling (TBM) still remains the most reliable approach to high-resolution protein structure prediction. Previous studies have shown that the PDB library is complete for single-domain proteins and TBM is in principle sufficient to solve the structure prediction problem if the most similar structure in the PDB could be reliably identified and used as template for model reconstruction. But in reality, the success of TBM depends on the availability of closely-homologous templates, where its accuracy and reliability decrease sharply when the evolutionary relationship between query and template becomes more distant. We developed a new threading approach, CEthreader, which allows for dynamic programing alignments of predicted contact-maps through eigen-decomposition. The large-scale benchmark tests show that the coupling of contact map with profile and secondary structure alignments through the proposed protocol can significantly improve the accuracy of template recognition for distantly-homologous protein targets.
引用
收藏
页数:27
相关论文
共 50 条
  • [41] Prediction of protein supersecondary structures based on the artificial neural network method
    Sun, ZR
    Rao, XQ
    Peng, LW
    Xu, D
    PROTEIN ENGINEERING, 1997, 10 (07): : 763 - 769
  • [42] DeepInteract: Deep Neural Network Based Protein-Protein Interaction Prediction Tool
    Patel, Sunil
    Tripathi, Rashmi
    Kumari, Vandana
    Varadwaj, Pritish
    CURRENT BIOINFORMATICS, 2017, 12 (06) : 551 - 557
  • [43] FUpred: detecting protein domains through deep-learning-based contact map prediction
    Zheng, Wei
    Zhou, Xiaogen
    Wuyun, Qiqige
    Pearce, Robin
    Li, Yang
    Zhang, Yang
    BIOINFORMATICS, 2020, 36 (12) : 3749 - 3757
  • [44] Deep-Unfolding Neural-Network Aided Hybrid Beamforming Based on Symbol-Error Probability Minimization
    Shi, Shuhan
    Cai, Yunlong
    Hu, Qiyu
    Champagne, Benoit
    Hanzo, Lajos
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (01) : 529 - 545
  • [45] ITT-RNA: Imperfection Tolerable Training for RRAM-Crossbar-Based Deep Neural-Network Accelerator
    Song, Zhuoran
    Sun, Yanan
    Chen, Lerong
    Li, Tianjian
    Jing, Naifeng
    Liang, Xiaoyao
    Jiang, Li
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2021, 40 (01) : 129 - 142
  • [46] A deep convolutional neural network based on U-Net to predict annual luminance maps
    Qorbani, Mohammad Ali
    Dalirani, Farhad
    Rahmati, Mohammad
    Hafezi, Mohammad Reza
    JOURNAL OF BUILDING PERFORMANCE SIMULATION, 2022, 15 (01) : 62 - 80
  • [47] Detecting COVID-19 patients based on fuzzy inference engine and Deep Neural Network
    Shaban, Warda M.
    Rabie, Asmaa H.
    Saleh, Ahmed I.
    Abo-Elsoud, M. A.
    APPLIED SOFT COMPUTING, 2021, 99
  • [48] Deep neural network for detecting arbitrary precision peptide features through attention based segmentation
    Zohora, Fatema Tuz
    Rahman, M. Ziaur
    Tran, Ngoc Hieu
    Xin, Lei
    Shan, Baozhen
    Li, Ming
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [49] Detecting Hate Speech on Twitter Using a Convolution-GRU Based Deep Neural Network
    Zhang, Ziqi
    Robinson, David
    Tepper, Jonathan
    SEMANTIC WEB (ESWC 2018), 2018, 10843 : 745 - 760
  • [50] Deep neural network for detecting arbitrary precision peptide features through attention based segmentation
    Fatema Tuz Zohora
    M. Ziaur Rahman
    Ngoc Hieu Tran
    Lei Xin
    Baozhen Shan
    Ming Li
    Scientific Reports, 11