Predicting protein-protein interactions using high-quality non-interacting pairs

被引:24
|
作者
Zhang, Long [1 ]
Yu, Guoxian [1 ]
Guo, Maozu [2 ,3 ]
Wang, Jun [1 ]
机构
[1] Southwest Univ, Coll Comp & Informat Sci, Chongqing, Peoples R China
[2] Beijing Univ Civil Engn & Architecture, Sch Elect & Informat Engn, Beijing, Peoples R China
[3] Beijing Key Lab Intelligent Proc Bldg Big Data, Beijing, Peoples R China
来源
BMC BIOINFORMATICS | 2018年 / 19卷
关键词
Protein-protein interactions; Non-interacting proteins; Deep neural networks; Sequence similarity; Random walk; HYDROPHOBICITY; NETWORKS; GENOME;
D O I
10.1186/s12859-018-2525-3
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundIdentifying protein-protein interactions (PPIs) is of paramount importance for understanding cellular processes. Machine learning-based approaches have been developed to predict PPIs, but the effectiveness of these approaches is unsatisfactory. One major reason is that they randomly choose non-interacting protein pairs (negative samples) or heuristically select non-interacting pairs with low quality.ResultsTo boost the effectiveness of predicting PPIs, we propose two novel approaches (NIP-SS and NIP-RW) to generate high quality non-interacting pairs based on sequence similarity and random walk, respectively. Specifically, the known PPIs collected from public databases are used to generate the positive samples. NIP-SS then selects the top-m dissimilar protein pairs as negative examples and controls the degree distribution of selected proteins to construct the negative dataset. NIP-RW performs random walk on the PPI network to update the adjacency matrix of the network, and then selects protein pairs not connected in the updated network as negative samples. Next, we use auto covariance (AC) descriptor to encode the feature information of amino acid sequences. After that, we employ deep neural networks (DNNs) to predict PPIs based on extracted features, positive and negative examples. Extensive experiments show that NIP-SS and NIP-RW can generate negative samples with higher quality than existing strategies and thus enable more accurate prediction.ConclusionsThe experimental results prove that negative datasets constructed by NIP-SS and NIP-RW can reduce the bias and have good generalization ability. NIP-SS and NIP-RW can be used as a plugin to boost the effectiveness of PPIs prediction. Codes and datasets are available at http://mlda.swu.edu.cn/codes.php?name=NIP.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Predicting protein-protein interactions from protein sequences using meta predictor
    Xia, Jun-Feng
    Zhao, Xing-Ming
    Huang, De-Shuang
    AMINO ACIDS, 2010, 39 (05) : 1595 - 1599
  • [22] A High Efficient Biological Language Model for Predicting Protein-Protein Interactions
    Wang, Yanbin
    You, Zhu-Hong
    Yang, Shan
    Li, Xiao
    Jiang, Tong-Hai
    Zhou, Xi
    CELLS, 2019, 8 (02)
  • [24] Fusion of classifiers for predicting protein-protein interactions
    Nanni, L
    NEUROCOMPUTING, 2005, 68 : 289 - 296
  • [25] Computational Resources for Predicting Protein-Protein Interactions
    Tanwar, Himani
    Doss, C. George Priya
    PROTEIN-PROTEIN INTERACTIONS IN HUMAN DISEASE, PT A, 2018, 110 : 251 - 275
  • [26] A database server for predicting protein-protein interactions
    Han, K
    Park, B
    COMPUTATIONAL SCIENCE - ICCS 2004, PT 1, PROCEEDINGS, 2004, 3036 : 271 - 278
  • [27] Computational Methods For Predicting Protein-Protein Interactions
    Pitre, Sylvain
    Alamgir, Md
    Green, James R.
    Dumontier, Michel
    Dehne, Frank
    Golshani, Ashkan
    PROTEIN - PROTEIN INTERACTION, 2008, 110 : 247 - 267
  • [28] Role of ChatGPT in predicting protein-protein interactions
    Ray, Partha Ratim
    Majumder, Poulami
    CURRENT SCIENCE, 2023, 125 (02): : 114 - 114
  • [29] A Simple Approach for Predicting Protein-Protein Interactions
    Rashid, Mamoon
    Ramasamy, Sumathy
    Raghava, Gajendra P. S.
    CURRENT PROTEIN & PEPTIDE SCIENCE, 2010, 11 (07) : 589 - 600
  • [30] Kernel methods for predicting protein-protein interactions
    Ben-Hur, A
    Noble, WS
    BIOINFORMATICS, 2005, 21 : I38 - I46