Learning spatial structures of proteins improves protein-protein interaction prediction

被引:76
作者
Song, Bosheng [1 ]
Luo, Xiaoyan [1 ]
Luo, Xiaoli [1 ]
Liu, Yuansheng [1 ]
Niu, Zhangming [2 ]
Zeng, Xiangxiang [1 ]
机构
[1] Hunan Univ, Changsha, Hunan, Peoples R China
[2] MindRank AI Ltd, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
protein-protein interaction; protein representation learning; graph neural network; multi-dimension feature confusion; DATABASE; MODEL;
D O I
10.1093/bib/bbab558
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Spatial structures of proteins are closely related to protein functions. Integrating protein structures improves the performance of protein-protein interaction (PPI) prediction. However, the limited quantity of known protein structures restricts the application of structure-based prediction methods. Utilizing the predicted protein structure information is a promising method to improve the performance of sequence-based prediction methods. We propose a novel end-to-end framework, TAGPPI, to predict PPIs using protein sequence alone. TAGPPI extracts multi-dimensional features by employing 1D convolution operation on protein sequences and graph learning method on contact maps constructed from AlphaFold. A contact map contains abundant spatial structure information, which is difficult to obtain from 1D sequence data directly. We further demonstrate that the spatial information learned from contact maps improves the ability of TAGPPI in PPI prediction tasks. We compare the performance of TAGPPI with those of nine state-of-the-art sequence-based methods, and TAGPPI outperforms such methods in all metrics. To the best of our knowledge, this is the first method to use the predicted protein topology structure graph for sequence-based PPI prediction. More importantly, our proposed architecture could be extended to other prediction tasks related to proteins.
引用
收藏
页数:11
相关论文
共 45 条
[1]   Systematic evaluation of machine learning methods for identifying human-pathogen protein-protein interactions [J].
Chen, Huaming ;
Li, Fuyi ;
Wang, Lei ;
Jin, Yaochu ;
Chi, Chi-Hung ;
Kurgan, Lukasz ;
Song, Jiangning ;
Shen, Jun .
BRIEFINGS IN BIOINFORMATICS, 2021, 22 (03)
[2]   Protein-protein interaction prediction using a hybrid feature representation and a stacked generalization scheme [J].
Chen, Kuan-Hsi ;
Wang, Tsai-Feng ;
Hu, Yuh-Jyh .
BMC BIOINFORMATICS, 2019, 20 (1)
[3]   Multifaceted protein-protein interaction prediction based on Siamese residual RCNN [J].
Chen, Muhao ;
Ju, Chelsea J. -T. ;
Zhou, Guangyu ;
Chen, Xuelu ;
Zhang, Tianran ;
Chang, Kai-Wei ;
Zaniolo, Carlo ;
Wang, Wei .
BIOINFORMATICS, 2019, 35 (14) :I305-I314
[4]   Molecular design in drug discovery: a comprehensive review of deep generative models [J].
Cheng, Yu ;
Gong, Yongshun ;
Liu, Yuansheng ;
Song, Bosheng ;
Zou, Quan .
BRIEFINGS IN BIOINFORMATICS, 2021, 22 (06)
[5]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[6]   DeepPPI: Boosting Prediction of Protein-Protein Interactions with Deep Neural Networks [J].
Du, Xiuquan ;
Sun, Shiwei ;
Hu, Changlin ;
Yao, Yu ;
Yan, Yuanting ;
Zhang, Yanping .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2017, 57 (06) :1499-1510
[7]   Functional organization of the yeast proteome by systematic analysis of protein complexes [J].
Gavin, AC ;
Bösche, M ;
Krause, R ;
Grandi, P ;
Marzioch, M ;
Bauer, A ;
Schultz, J ;
Rick, JM ;
Michon, AM ;
Cruciat, CM ;
Remor, M ;
Höfert, C ;
Schelder, M ;
Brajenovic, M ;
Ruffner, H ;
Merino, A ;
Klein, K ;
Hudak, M ;
Dickson, D ;
Rudi, T ;
Gnau, V ;
Bauch, A ;
Bastuck, S ;
Huhse, B ;
Leutwein, C ;
Heurtier, MA ;
Copley, RR ;
Edelmann, A ;
Querfurth, E ;
Rybin, V ;
Drewes, G ;
Raida, M ;
Bouwmeester, T ;
Bork, P ;
Seraphin, B ;
Kuster, B ;
Neubauer, G ;
Superti-Furga, G .
NATURE, 2002, 415 (6868) :141-147
[8]  
Graves A, 2013, INT CONF ACOUST SPEE, P6645, DOI 10.1109/ICASSP.2013.6638947
[9]   Using support vector machine combined with auto covariance to predict proteinprotein interactions from protein sequences [J].
Guo, Yanzhi ;
Yu, Lezheng ;
Wen, Zhining ;
Li, Menglong .
NUCLEIC ACIDS RESEARCH, 2008, 36 (09) :3025-3030
[10]   Predicting protein-protein interactions through sequence-based deep learning [J].
Hashemifar, Somaye ;
Neyshabur, Behnam ;
Khan, Aly A. ;
Xu, Jinbo .
BIOINFORMATICS, 2018, 34 (17) :802-810