DeepHomo2.0: improved protein-protein contact prediction of homodimers by transformer -enhanced deep learning

被引:13
作者
Lin, Peicong [1 ]
Yan, Yumeng [1 ]
Huang, Sheng-You [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Phys, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
protein-protein interaction; homo-oligomers; residue-residue contact prediction; deep learning; transformer features; CRYO-EM; RESIDUE CONTACTS; COEVOLUTION; IDENTIFICATION; PRINCIPLES; SYMMETRY; DOCKING;
D O I
10.1093/bib/bbac499
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Protein-protein interactions play an important role in many biological processes. However, although structure prediction for monomer proteins has achieved great progress with the advent of advanced deep learning algorithms like AlphaFold, the structure prediction for protein-protein complexes remains an open question. Taking advantage of the Transformer model of ESM-MSA, we have developed a deep learning -based model, named DeepHomo2.0, to predict protein-protein interactions of homodimeric complexes by leveraging the direct-coupling analysis (DCA) and Transformer features of sequences and the structure features of monomers. DeepHomo2.0 was extensively evaluated on diverse test sets and compared with eight state-of-the-art methods including protein language model based, DCA-based and machine learning -based methods. It was shown that DeepHomo2.0 achieved a high precision of >70% with experimental monomer structures and >60% with predicted monomer structures for the top 10 predicted contacts on the test sets and outperformed the other eight methods. Moreover, even the version without using structure information, named DeepHomoSeq, still achieved a good precision of >55% for the top 10 predicted contacts. Integrating the predicted contacts into protein docking significantly improved the structure prediction of realistic Critical Assessment of Protein Structure Prediction homodimeric complexes. DeepHomo2.0 and DeepHomoSeq are available at http://huanglab.phys.hust.edu.cn/DeepHomo2/.
引用
收藏
页数:14
相关论文
共 82 条
[31]   Blind prediction of homo- and hetero-protein complexes: The CASP13-CAPRI experiment [J].
Lensink, Marc F. ;
Brysbaert, Guillaume ;
Nadzirin, Nurul ;
Velankar, Sameer ;
Chaleil, Raphael A. G. ;
Gerguri, Tereza ;
Bates, Paul A. ;
Laine, Elodie ;
Carbone, Alessandra ;
Grudinin, Sergei ;
Kong, Ren ;
Liu, Ran-Ran ;
Xu, Xi-Ming ;
Shi, Hang ;
Chang, Shan ;
Eisenstein, Miriam ;
Karczynska, Agnieszka ;
Czaplewski, Cezary ;
Lubecka, Emilia ;
Lipska, Agnieszka ;
Krupa, Pawel ;
Mozolewska, Magdalena ;
Golon, Lukasz ;
Samsonov, Sergey ;
Liwo, Adam ;
Crivelli, Silvia ;
Pages, Guillaume ;
Karasikov, Mikhail ;
Kadukova, Maria ;
Yan, Yumeng ;
Huang, Sheng-You ;
Rosell, Mireia ;
Rodriguez-Lumbreras, Luis A. ;
Romero-Durana, Miguel ;
Diaz-Bueno, Lucia ;
Fernandez-Recio, Juan ;
Christoffer, Charles ;
Terashi, Genki ;
Shin, Woong-Hee ;
Aderinwale, Tunde ;
Maddhuri Venkata Subraman, Sai Raghavendra ;
Kihara, Daisuke ;
Kozakov, Dima ;
Vajda, Sandor ;
Porter, Kathryn ;
Padhorny, Dzmitry ;
Desta, Israel ;
Beglov, Dmitri ;
Ignatov, Mikhail ;
Kotelnikov, Sergey .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2019, 87 (12) :1200-1221
[32]   The challenge of modeling protein assemblies: the CASP12-CAPRI experiment [J].
Lensink, Marc F. ;
Velankar, Sameer ;
Baek, Minkyung ;
Heo, Lim ;
Seok, Chaok ;
Wodak, Shoshana J. .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2018, 86 :257-273
[33]   Prediction of homoprotein and heteroprotein complexes by protein docking and template-based modeling: A CASP-CAPRI experiment [J].
Lensink, Marc F. ;
Velankar, Sameer ;
Kryshtafovych, Andriy ;
Huang, Shen-You ;
Schneidman-Duhovny, Dina ;
Sali, Andrej ;
Segura, Joan ;
Fernandez-Fuentes, Narcis ;
Viswanath, Shruthi ;
Elber, Ron ;
Grudinin, Sergei ;
Popov, Petr ;
Neveu, Emilie ;
Lee, Hasup ;
Baek, Minkyung ;
Park, Sangwoo ;
Heo, Lim ;
Lee, Gyu Rie ;
Seok, Chaok ;
Qin, Sanbo ;
Zhou, Huan-Xiang ;
Ritchie, David W. ;
Maigret, Bernard ;
Devignes, Marie-Dominique ;
Ghoorah, Anisah ;
Torchala, Mieczyslaw ;
Chaleil, Raphael A. G. ;
Bates, Paul A. ;
Ben-Zeev, Efrat ;
Eisenstein, Miriam ;
Negi, Surendra S. ;
Weng, Zhiping ;
Vreven, Thom ;
Pierce, Brian G. ;
Borrman, Tyler M. ;
Yu, Jinchao ;
Ochsenbein, Francoise ;
Guerois, Raphael ;
Vangone, Anna ;
Rodrigues, Joao P. G. L. M. ;
van Zundert, Gydo ;
Nellen, Mehdi ;
Xue, Li ;
Karaca, Ezgi ;
Melquiond, Adrien S. J. ;
Visscher, Koen ;
Kastritis, Panagiotis L. ;
Bonvin, Alexandre M. J. J. ;
Xu, Xianjin ;
Qiu, Liming .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2016, 84 :323-348
[34]  
Lesieur C, 2014, OLIGOMERIZATION OF CHEMICAL AND BIOLOGICAL COMPOUNDS, P327, DOI 10.5772/58576
[35]   Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks [J].
Li, Yang ;
Zhang, Chengxin ;
Bell, Eric W. ;
Zheng, Wei ;
Zhou, Xiaogen ;
Yu, Dong-Jun ;
Zhang, Yang ;
Kolodny, Rachel .
PLOS COMPUTATIONAL BIOLOGY, 2021, 17 (03)
[36]   ResPRE: high-accuracy protein contact prediction by coupling precision matrix with deep residual neural networks [J].
Li, Yang ;
Hu, Jun ;
Zhang, Chengxin ;
Yu, Dong-Jun ;
Zhang, Yang .
BIOINFORMATICS, 2019, 35 (22) :4647-4655
[37]   Attention mechanism enhanced LSTM with residual architecture and its application for protein-protein interaction residue pairs prediction [J].
Liu, Jiale ;
Gong, Xinqi .
BMC BIOINFORMATICS, 2019, 20 (01)
[38]   SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler [J].
Luo, Ruibang ;
Liu, Binghang ;
Xie, Yinlong ;
Li, Zhenyu ;
Huang, Weihua ;
Yuan, Jianying ;
He, Guangzhu ;
Chen, Yanxiang ;
Pan, Qi ;
Liu, Yunjie ;
Tang, Jingbo ;
Wu, Gengxiong ;
Zhang, Hao ;
Shi, Yujian ;
Liu, Yong ;
Yu, Chang ;
Wang, Bo ;
Lu, Yao ;
Han, Changlei ;
Cheung, David W. ;
Yiu, Siu-Ming ;
Peng, Shaoliang ;
Zhu Xiaoqian ;
Liu, Guangming ;
Liao, Xiangke ;
Li, Yingrui ;
Yang, Huanming ;
Wang, Jian ;
Lam, Tak-Wah ;
Wang, Jun .
GIGASCIENCE, 2012, 1
[39]   Membrane-dependent oligomeric structure and pore formation of β-hairpin antimicrobial peptide in lipid bilayers from solid-state NMR [J].
Mani, Rajeswari ;
Cady, Sarah D. ;
Tang, Ming ;
Waring, Alan J. ;
Lehrert, Robert I. ;
Hong, Mei .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (44) :16242-16247
[40]   Uniclust databases of clustered and deeply annotated protein sequences and alignments [J].
Mirdita, Milot ;
von den Driesch, Lars ;
Galiez, Clovis ;
Martin, Maria J. ;
Soeding, Johannes ;
Steinegger, Martin .
NUCLEIC ACIDS RESEARCH, 2017, 45 (D1) :D170-D176