Analyzing effect of quadruple multiple sequence alignments on deep learning based protein inter-residue distance prediction

被引:0
|
作者
Aashish Jain
Genki Terashi
Yuki Kagaya
Sai Raghavendra Maddhuri Venkata Subramaniya
Charles Christoffer
Daisuke Kihara
机构
[1] Purdue University,Department of Computer Science
[2] Purdue University,Department of Biological Sciences
[3] Tohoku University,Graduate School of Information Sciences
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Protein 3D structure prediction has advanced significantly in recent years due to improving contact prediction accuracy. This improvement has been largely due to deep learning approaches that predict inter-residue contacts and, more recently, distances using multiple sequence alignments (MSAs). In this work we present AttentiveDist, a novel approach that uses different MSAs generated with different E-values in a single model to increase the co-evolutionary information provided to the model. To determine the importance of each MSA’s feature at the inter-residue level, we added an attention layer to the deep neural network. We show that combining four MSAs of different E-value cutoffs improved the model prediction performance as compared to single E-value MSA features. A further improvement was observed when an attention layer was used and even more when additional prediction tasks of bond angle predictions were added. The improvement of distance predictions were successfully transferred to achieve better protein tertiary structure modeling.
引用
收藏
相关论文
共 50 条
  • [31] Protein Residue Contact Prediction Based on Deep Learning and Massive Statistical Features from Multi-Sequence Alignment
    Zhang, Huiling
    Hao, Min
    Wu, Hao
    Ting, Hing-Fung
    Tang, Yihong
    Xi, Wenhui
    Wei, Yanjie
    TSINGHUA SCIENCE AND TECHNOLOGY, 2022, 27 (05) : 843 - 854
  • [32] Deep Graph Learning to Estimate Protein Model Quality Using Structural Constraints From Multiple Sequence Alignments
    Rahbar, Mahdi
    Chauhan, Rahul Kumar
    Shah, Pankil Nimeshbhai
    Cao, Renzhi
    Si, Dong
    Hou, Jie
    13TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND HEALTH INFORMATICS, BCB 2022, 2022,
  • [33] Protein-Protein Interaction Interface Residue Pair Prediction Based on Deep Learning Architecture
    Zhao, Zhenni
    Gong, Xinqi
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (05) : 1753 - 1759
  • [34] Sequence representation approaches for sequence-based protein prediction tasks that use deep learning
    Cui, Feifei
    Zhang, Zilong
    Zou, Quan
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2021, 20 (01) : 61 - 73
  • [35] DeepCrystal: a deep learning framework for sequence-based protein crystallization prediction
    Elbasir, Abdurrahman
    Moovarkumudalvan, Balasubramanian
    Kunji, Khalid
    Kolatkar, Prasanna R.
    Mall, Raghvendra
    Bensmail, Halima
    BIOINFORMATICS, 2019, 35 (13) : 2216 - 2225
  • [36] DeepCrystal: A Deep Learning Framework for Sequence-based Protein Crystallization Prediction
    Elbasir, Abdurrahman
    Moovarkumudalvan, Balasubramanian
    Kunji, Khalid
    Kolatkar, Prasanna R.
    Bensmail, Halima
    Mall, Raghvendra
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 2747 - 2749
  • [37] DeepSol: a deep learning framework for sequence-based protein solubility prediction
    Khurana, Sameer
    Rawi, Reda
    Kunji, Khalid
    Chuang, Gwo-Yu
    Bensmail, Halima
    Mall, Raghvendra
    BIOINFORMATICS, 2018, 34 (15) : 2605 - 2613
  • [38] Sequence-based prediction of protein protein interaction using a deep-learning algorithm
    Sun, Tanlin
    Zhou, Bo
    Lai, Luhua
    Pei, Jianfeng
    BMC BIOINFORMATICS, 2017, 18
  • [39] Sequence-based prediction of protein protein interaction using a deep-learning algorithm
    Tanlin Sun
    Bo Zhou
    Luhua Lai
    Jianfeng Pei
    BMC Bioinformatics, 18
  • [40] Improving deep learning-based protein distance prediction in CASP14
    Guo, Zhiye
    Wu, Tianqi
    Liu, Jian
    Hou, Jie
    Cheng, Jianlin
    BIOINFORMATICS, 2021, 37 (19) : 3190 - 3196