Predicting protein-protein interactions using high-quality non-interacting pairs

被引:24
|
作者
Zhang, Long [1 ]
Yu, Guoxian [1 ]
Guo, Maozu [2 ,3 ]
Wang, Jun [1 ]
机构
[1] Southwest Univ, Coll Comp & Informat Sci, Chongqing, Peoples R China
[2] Beijing Univ Civil Engn & Architecture, Sch Elect & Informat Engn, Beijing, Peoples R China
[3] Beijing Key Lab Intelligent Proc Bldg Big Data, Beijing, Peoples R China
来源
BMC BIOINFORMATICS | 2018年 / 19卷
关键词
Protein-protein interactions; Non-interacting proteins; Deep neural networks; Sequence similarity; Random walk; HYDROPHOBICITY; NETWORKS; GENOME;
D O I
10.1186/s12859-018-2525-3
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundIdentifying protein-protein interactions (PPIs) is of paramount importance for understanding cellular processes. Machine learning-based approaches have been developed to predict PPIs, but the effectiveness of these approaches is unsatisfactory. One major reason is that they randomly choose non-interacting protein pairs (negative samples) or heuristically select non-interacting pairs with low quality.ResultsTo boost the effectiveness of predicting PPIs, we propose two novel approaches (NIP-SS and NIP-RW) to generate high quality non-interacting pairs based on sequence similarity and random walk, respectively. Specifically, the known PPIs collected from public databases are used to generate the positive samples. NIP-SS then selects the top-m dissimilar protein pairs as negative examples and controls the degree distribution of selected proteins to construct the negative dataset. NIP-RW performs random walk on the PPI network to update the adjacency matrix of the network, and then selects protein pairs not connected in the updated network as negative samples. Next, we use auto covariance (AC) descriptor to encode the feature information of amino acid sequences. After that, we employ deep neural networks (DNNs) to predict PPIs based on extracted features, positive and negative examples. Extensive experiments show that NIP-SS and NIP-RW can generate negative samples with higher quality than existing strategies and thus enable more accurate prediction.ConclusionsThe experimental results prove that negative datasets constructed by NIP-SS and NIP-RW can reduce the bias and have good generalization ability. NIP-SS and NIP-RW can be used as a plugin to boost the effectiveness of PPIs prediction. Codes and datasets are available at http://mlda.swu.edu.cn/codes.php?name=NIP.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Predicting protein-protein interactions from one feature using SVM
    Chung, Y
    Kim, GM
    Hwang, YS
    Park, H
    INNOVATIONS IN APPLIED ARTIFICIAL INTELLIGENCE, 2004, 3029 : 50 - 55
  • [42] Modeling Protein-Protein Interface Interactions as a Means for Predicting Protein-Protein Interaction Partners
    Reyes, Vicente M.
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2009, 26 (06): : 873 - 873
  • [43] Predicting protein-protein interactions from protein domains using a set cover approach
    Huang, Chengbang
    Morcos, Faruck
    Kanaan, Simon P.
    Wuchty, Stefan
    Chen, Danny Z.
    Izaguirre, Jesus A.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2007, 4 (01) : 78 - 87
  • [44] Computational approaches for predicting protein-protein interactions: A survey
    Yu J.
    Fotouhi F.
    Journal of Medical Systems, 2006, 30 (1) : 39 - 44
  • [45] Predicting protein-protein interactions by a supervised learning classifier
    Huang, Y
    Frishman, D
    Muchnik, I
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2004, 28 (04) : 291 - 301
  • [46] Machine learning solutions for predicting protein-protein interactions
    Casadio, Rita
    Martelli, Pier Luigi
    Savojardo, Castrense
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2022, 12 (06)
  • [47] Predicting Protein-Protein Interactions based on ensemble classifiers
    Zhou, Zheng-Rong
    Song, Xiao-Feng
    Wang, Ming-Hao
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2010, 38 (06): : 1464 - 1467
  • [48] Recent advances in predicting and modeling protein-protein interactions
    Durham, Jesse
    Zhang, Jing
    Humphreys, Ian R.
    Pei, Jimin
    Cong, Qian
    TRENDS IN BIOCHEMICAL SCIENCES, 2023, 48 (06) : 527 - 538
  • [49] A survey on computational models for predicting protein-protein interactions
    Hu, Lun
    Wang, Xiaojuan
    Huang, Yu-An
    Hu, Pengwei
    You, Zhu-Hong
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (05)
  • [50] Sequence Representations and Their Utility for Predicting Protein-Protein Interactions
    Kimothi, Dhananjay
    Biyani, Pravesh
    Hogan, James M.
    Davis, Melissa J.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (01) : 646 - 657