Predicting protein-protein interactions through sequence-based deep learning

被引:226
|
作者
Hashemifar, Somaye [1 ]
Neyshabur, Behnam [1 ]
Khan, Aly A. [1 ]
Xu, Jinbo [1 ]
机构
[1] Toyota Technol Inst, Chicago, IL 60637 USA
关键词
D O I
10.1093/bioinformatics/bty573
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: High-throughput experimental techniques have produced a large amount of protein-protein interaction (PPI) data, but their coverage is still low and the PPI data is also very noisy. Computational prediction of PPIs can be used to discover new PPIs and identify errors in the experimental PPI data. Results: We present a novel deep learning framework, DPPI, to model and predict PPIs from sequence information alone. Our model efficiently applies a deep, Siamese-like convolutional neural network combined with random projection and data augmentation to predict PPIs, leveraging existing high-quality experimental PPI data and evolutionary information of a protein pair under prediction. Our experimental results show that DPPI outperforms the state-of-the-art methods on several benchmarks in terms of area under precision-recall curve (auPR), and computationally is more efficient. We also show that DPPI is able to predict homodimeric interactions where other methods fail to work accurately, and the effectiveness of DPPI in specific applications such as predicting cytokine-receptor binding affinities.
引用
收藏
页码:802 / 810
页数:9
相关论文
共 50 条
  • [31] Evolution of Sequence-based Bioinformatics Tools for Protein-protein Interaction Prediction
    Khatun, Mst Shamima
    Shoombuatong, Watshara
    Hasan, Md Mehedi
    Kurata, Hiroyuki
    CURRENT GENOMICS, 2020, 21 (06) : 454 - 463
  • [32] An Efficient Ensemble Learning Approach for Predicting Protein-Protein Interactions by Integrating Protein Primary Sequence and Evolutionary Information
    You, Zhu-Hong
    Huang, Wen-Zhun
    Zhang, Shanwen
    Huang, Yu-An
    Yu, Chang-Qing
    Li, Li-Ping
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (03) : 809 - 817
  • [33] Prediction of Protein-Protein Interactions Based on Integrating Deep Learning and Feature Fusion
    Tran, Hoai-Nhan
    Nguyen, Phuc-Xuan-Quynh
    Guo, Fei
    Wang, Jianxin
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (11)
  • [34] Predicting Protein-Protein Interactions based on ensemble classifiers
    Zhou, Zheng-Rong
    Song, Xiao-Feng
    Wang, Ming-Hao
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2010, 38 (06): : 1464 - 1467
  • [35] CollaPPI: A Collaborative Learning Framework for Predicting Protein-Protein Interactions
    Ma, Wenjian
    Bi, Xiangpeng
    Jiang, Huasen
    Zhang, Shugang
    Wei, Zhiqiang
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (05) : 3167 - 3177
  • [36] Recent developments of sequence-based prediction of protein–protein interactions
    Yoichi Murakami
    Kenji Mizuguchi
    Biophysical Reviews, 2022, 14 : 1393 - 1411
  • [37] DeepCrystal: a deep learning framework for sequence-based protein crystallization prediction
    Elbasir, Abdurrahman
    Moovarkumudalvan, Balasubramanian
    Kunji, Khalid
    Kolatkar, Prasanna R.
    Mall, Raghvendra
    Bensmail, Halima
    BIOINFORMATICS, 2019, 35 (13) : 2216 - 2225
  • [38] DeepCrystal: A Deep Learning Framework for Sequence-based Protein Crystallization Prediction
    Elbasir, Abdurrahman
    Moovarkumudalvan, Balasubramanian
    Kunji, Khalid
    Kolatkar, Prasanna R.
    Bensmail, Halima
    Mall, Raghvendra
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 2747 - 2749
  • [39] DeepSol: a deep learning framework for sequence-based protein solubility prediction
    Khurana, Sameer
    Rawi, Reda
    Kunji, Khalid
    Chuang, Gwo-Yu
    Bensmail, Halima
    Mall, Raghvendra
    BIOINFORMATICS, 2018, 34 (15) : 2605 - 2613
  • [40] Unified rational protein engineering with sequence-based deep representation learning
    Alley, Ethan C.
    Khimulya, Grigory
    Biswas, Surojit
    AlQuraishi, Mohammed
    Church, George M.
    NATURE METHODS, 2019, 16 (12) : 1315 - +