Predicting protein-protein interactions using high-quality non-interacting pairs

被引:24
|
作者
Zhang, Long [1 ]
Yu, Guoxian [1 ]
Guo, Maozu [2 ,3 ]
Wang, Jun [1 ]
机构
[1] Southwest Univ, Coll Comp & Informat Sci, Chongqing, Peoples R China
[2] Beijing Univ Civil Engn & Architecture, Sch Elect & Informat Engn, Beijing, Peoples R China
[3] Beijing Key Lab Intelligent Proc Bldg Big Data, Beijing, Peoples R China
来源
BMC BIOINFORMATICS | 2018年 / 19卷
关键词
Protein-protein interactions; Non-interacting proteins; Deep neural networks; Sequence similarity; Random walk; HYDROPHOBICITY; NETWORKS; GENOME;
D O I
10.1186/s12859-018-2525-3
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundIdentifying protein-protein interactions (PPIs) is of paramount importance for understanding cellular processes. Machine learning-based approaches have been developed to predict PPIs, but the effectiveness of these approaches is unsatisfactory. One major reason is that they randomly choose non-interacting protein pairs (negative samples) or heuristically select non-interacting pairs with low quality.ResultsTo boost the effectiveness of predicting PPIs, we propose two novel approaches (NIP-SS and NIP-RW) to generate high quality non-interacting pairs based on sequence similarity and random walk, respectively. Specifically, the known PPIs collected from public databases are used to generate the positive samples. NIP-SS then selects the top-m dissimilar protein pairs as negative examples and controls the degree distribution of selected proteins to construct the negative dataset. NIP-RW performs random walk on the PPI network to update the adjacency matrix of the network, and then selects protein pairs not connected in the updated network as negative samples. Next, we use auto covariance (AC) descriptor to encode the feature information of amino acid sequences. After that, we employ deep neural networks (DNNs) to predict PPIs based on extracted features, positive and negative examples. Extensive experiments show that NIP-SS and NIP-RW can generate negative samples with higher quality than existing strategies and thus enable more accurate prediction.ConclusionsThe experimental results prove that negative datasets constructed by NIP-SS and NIP-RW can reduce the bias and have good generalization ability. NIP-SS and NIP-RW can be used as a plugin to boost the effectiveness of PPIs prediction. Codes and datasets are available at http://mlda.swu.edu.cn/codes.php?name=NIP.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Information assessment on predicting protein-protein interactions
    Nan Lin
    Baolin Wu
    Ronald Jansen
    Mark Gerstein
    Hongyu Zhao
    BMC Bioinformatics, 5
  • [32] The interactome: Predicting the protein-protein interactions in cells
    Dariusz Plewczyński
    Krzysztof Ginalski
    Cellular & Molecular Biology Letters, 2009, 14 : 1 - 22
  • [33] The interactome: Predicting the protein-protein interactions in cells
    Plewczynski, Dariusz
    Ginalski, Krzysztof
    CELLULAR & MOLECULAR BIOLOGY LETTERS, 2009, 14 (01) : 1 - 22
  • [34] Predicting the essentialities of protein-protein interactions in cancer
    Cooper, Lee A. D.
    Moran, Josue D.
    Li, Zenggang
    Du, Yuhong
    Harati, Sahar
    Ivanov, Andrey A.
    Webber, Phillip
    Havel, Jonathan J.
    Johns, Margaret A.
    Fu, Haian
    Moreno, Carlos S.
    CANCER RESEARCH, 2015, 75 (22)
  • [35] Predicting protein-protein interactions by association mining
    Kotlyar, M
    Jurisica, I
    INFORMATION SYSTEMS FRONTIERS, 2006, 8 (01) : 37 - 46
  • [36] Predicting Protein-Protein Interactions by Association Mining
    Information Systems Frontiers, 2006, 8 : 37 - 47
  • [37] Information assessment on predicting protein-protein interactions
    Lin, N
    Wu, BL
    Jansen, R
    Gerstein, M
    Zhao, HY
    BMC BIOINFORMATICS, 2004, 5 (1)
  • [38] ProteinPrompt: a webserver for predicting protein-protein interactions
    Canzler, Sebastian
    Fischer, Markus
    Ulbricht, David
    Ristic, Nikola
    Hildebrand, Peter W.
    Staritzbichler, Rene
    BIOINFORMATICS ADVANCES, 2022, 2 (01):
  • [39] Predicting Protein-Protein Interactions Using Symmetric Logistic Matrix Factorization
    Pei, Fen
    Shi, Qingya
    Zhang, Haotian
    Bahar, Ivet
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2021, 61 (04) : 1670 - 1682
  • [40] Predicting protein-protein interactions using graph invariants and a neural network
    Knisley, D.
    Knisley, J.
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2011, 35 (02) : 108 - 113