Predicting protein-protein interactions through sequence-based deep learning

被引:244
作者
Hashemifar, Somaye [1 ]
Neyshabur, Behnam [1 ]
Khan, Aly A. [1 ]
Xu, Jinbo [1 ]
机构
[1] Toyota Technol Inst, Chicago, IL 60637 USA
关键词
D O I
10.1093/bioinformatics/bty573
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: High-throughput experimental techniques have produced a large amount of protein-protein interaction (PPI) data, but their coverage is still low and the PPI data is also very noisy. Computational prediction of PPIs can be used to discover new PPIs and identify errors in the experimental PPI data. Results: We present a novel deep learning framework, DPPI, to model and predict PPIs from sequence information alone. Our model efficiently applies a deep, Siamese-like convolutional neural network combined with random projection and data augmentation to predict PPIs, leveraging existing high-quality experimental PPI data and evolutionary information of a protein pair under prediction. Our experimental results show that DPPI outperforms the state-of-the-art methods on several benchmarks in terms of area under precision-recall curve (auPR), and computationally is more efficient. We also show that DPPI is able to predict homodimeric interactions where other methods fail to work accurately, and the effectiveness of DPPI in specific applications such as predicting cytokine-receptor binding affinities.
引用
收藏
页码:802 / 810
页数:9
相关论文
共 35 条
[21]   Short Co-occurring Polypeptide Regions Can Predict Global Protein Interaction Maps [J].
Pitre, Sylvain ;
Hooshyar, Mohsen ;
Schoenrock, Andrew ;
Samanfar, Bahram ;
Jessulat, Matthew ;
Green, James R. ;
Dehne, Frank ;
Golshani, Ashkan .
SCIENTIFIC REPORTS, 2012, 2
[22]   The Database of Interacting Proteins: 2004 update [J].
Salwinski, L ;
Miller, CS ;
Smith, AJ ;
Pettit, FK ;
Bowie, JU ;
Eisenberg, D .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D449-D451
[23]   HIPPIE: Integrating Protein Interaction Networks with Experiment Based Quality Scores [J].
Schaefer, Martin H. ;
Fontaine, Jean-Fred ;
Vinayagam, Arunachalam ;
Porras, Pablo ;
Wanker, Erich E. ;
Andrade-Navarro, Miguel A. .
PLOS ONE, 2012, 7 (02)
[24]   Electrostatic aspects of protein-protein interactions [J].
Sheinerman, FB ;
Norel, R ;
Honig, B .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2000, 10 (02) :153-159
[25]   Predictina protein-protein interactions based only on sequences information [J].
Shen, Juwen ;
Zhang, Jian ;
Luo, Xiaomin ;
Zhu, Weiliang ;
Yu, Kunqian ;
Chen, Kaixian ;
Li, Yixue ;
Jiang, Hualiang .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (11) :4337-4341
[26]   CREATING ARTIFICIAL NEURAL NETWORKS THAT GENERALIZE [J].
SIETSMA, J ;
DOW, RJF .
NEURAL NETWORKS, 1991, 4 (01) :67-79
[27]   Sequence-based prediction of protein protein interaction using a deep-learning algorithm [J].
Sun, Tanlin ;
Zhou, Bo ;
Lai, Luhua ;
Pei, Jianfeng .
BMC BIOINFORMATICS, 2017, 18
[28]  
Sutskever I, 2013, INT C MACH LEARN, V28, P1139
[29]   Protein interaction mapping in C-elegans using proteins involved in vulval development [J].
Walhout, AJM ;
Sordella, R ;
Lu, XW ;
Hartley, JL ;
Temple, GF ;
Brasch, MA ;
Thierry-Mieg, N ;
Vidal, M .
SCIENCE, 2000, 287 (5450) :116-122
[30]   Detection of Protein-Protein Interactions from Amino Acid Sequences Using a Rotation Forest Model with a Novel PR-LPQ Descriptor [J].
Wong, Leon ;
You, Zhu-Hong ;
Li, Shuai ;
Huang, Yu-An ;
Liu, Gang .
ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, ICIC 2015, PT III, 2015, 9227 :713-720