DeepHomo2.0: improved protein-protein contact prediction of homodimers by transformer -enhanced deep learning

被引:13
作者
Lin, Peicong [1 ]
Yan, Yumeng [1 ]
Huang, Sheng-You [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Phys, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
protein-protein interaction; homo-oligomers; residue-residue contact prediction; deep learning; transformer features; CRYO-EM; RESIDUE CONTACTS; COEVOLUTION; IDENTIFICATION; PRINCIPLES; SYMMETRY; DOCKING;
D O I
10.1093/bib/bbac499
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Protein-protein interactions play an important role in many biological processes. However, although structure prediction for monomer proteins has achieved great progress with the advent of advanced deep learning algorithms like AlphaFold, the structure prediction for protein-protein complexes remains an open question. Taking advantage of the Transformer model of ESM-MSA, we have developed a deep learning -based model, named DeepHomo2.0, to predict protein-protein interactions of homodimeric complexes by leveraging the direct-coupling analysis (DCA) and Transformer features of sequences and the structure features of monomers. DeepHomo2.0 was extensively evaluated on diverse test sets and compared with eight state-of-the-art methods including protein language model based, DCA-based and machine learning -based methods. It was shown that DeepHomo2.0 achieved a high precision of >70% with experimental monomer structures and >60% with predicted monomer structures for the top 10 predicted contacts on the test sets and outperformed the other eight methods. Moreover, even the version without using structure information, named DeepHomoSeq, still achieved a good precision of >55% for the top 10 predicted contacts. Integrating the predicted contacts into protein docking significantly improved the structure prediction of realistic Critical Assessment of Protein Structure Prediction homodimeric complexes. DeepHomo2.0 and DeepHomoSeq are available at http://huanglab.phys.hust.edu.cn/DeepHomo2/.
引用
收藏
页数:14
相关论文
共 82 条
  • [1] DNCON2: improved protein contact prediction using two-level deep convolutional neural networks
    Adhikari, Badri
    Hou, Jie
    Cheng, Jianlin
    [J]. BIOINFORMATICS, 2018, 34 (09) : 1466 - 1472
  • [2] AlphaFold at CASP13
    AlQuraishi, Mohammed
    [J]. BIOINFORMATICS, 2019, 35 (22) : 4862 - 4865
  • [3] Emergence of symmetry in homooligomeric biological assemblies
    Andre, Ingemar
    Strauss, Charlie E. M.
    Kaplan, David B.
    Bradley, Philip
    Baker, David
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (42) : 16148 - 16152
  • [4] Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkw1099, 10.1093/nar/gkh131]
  • [5] How cryo-EM is revolutionizing structural biology
    Bai, Xiao-Chen
    McMullan, Greg
    Scheres, Sjors H. W.
    [J]. TRENDS IN BIOCHEMICAL SCIENCES, 2015, 40 (01) : 49 - 57
  • [6] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [7] ProteinBERT: a universal deep-learning model of protein sequence and function
    Brandes, Nadav
    Ofer, Dan
    Peleg, Yam
    Rappoport, Nadav
    Linial, Michal
    [J]. BIOINFORMATICS, 2022, 38 (08) : 2102 - 2110
  • [8] Improved prediction of protein-protein interactions using AlphaFold2
    Bryant, P.
    Pozzati, G.
    Elofsson, A.
    [J]. NATURE COMMUNICATIONS, 2022, 13 (01)
  • [9] Estimation of model accuracy in CASP13
    Chene, Jianlin
    Choe, Myong-Ho
    Elofsson, Arne
    Han, Kun-Sop
    Hoe, Jie
    Maghrabi, Ali H. A.
    McGuffin, Liam J.
    Menendez-Hurtado, David
    Olechnovic, Klinnent
    Schwede, Torsten
    Studer, Gabriel
    Uziela, Karolis
    Venclovas, Ceslovas
    Wallner, Bjorn
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2019, 87 (12) : 1361 - 1377
  • [10] Single-particle cryo-EM-How did it get here and where will it go
    Cheng, Yifan
    [J]. SCIENCE, 2018, 361 (6405) : 876 - +