A novel link prediction algorithm for protein-protein interaction networks by attributed graph embedding

被引:90
作者
Nasiri, Elahe [1 ]
Berahmand, Kamal [2 ]
Rostami, Mehrdad [3 ]
Dabiri, Mohammad [4 ]
机构
[1] Azarbaijan Shahid Madani Univ, Dept Informat Technol & Commun, Tabriz, Iran
[2] Queensland Univ Technol, Sch Comp Sci, Dept Sci & Engn, Brisbane, Qld, Australia
[3] Univ Kurdistan, Dept Comp Engn, Sanandaj, Iran
[4] Univ Kurdistan, Dept Plant Biotechnol, Sanandaj, Iran
关键词
Protein-protein interaction network; Link prediction; Graph embedding; Random walk; Feature selection; SEQUENCE;
D O I
10.1016/j.compbiomed.2021.104772
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The prediction of interactions in protein networks is very critical in various biological processes. In recent years, scientists have focused on computational approaches to predict the interactions of proteins. In protein-protein interaction (PPI) networks, each protein is accompanied by various features, including amino acid sequence, subcellular location, and protein domains. Embedding-based methods have been widely applied for many network analysis tasks, such as link prediction. The Deepwalk algorithm is one of the most popular graph embedding methods that capture the network structure using pure random walking. Here in this paper, we treat the protein-protein interaction prediction problem as a link prediction in attributed networks, and we use an attributed embedding approach to predict the interactions between proteins in the PPI network. In particular, the present paper seeks to present a modified version of Deepwalk based on feature selection for solving link prediction in the protein-protein interaction, which will benefit both network structure and protein features. More specifically the feature selection step consists of two distinct parts. First, a set of relevant features are selected from the original feature set, such that the dimensionality of features is reduced. Second, in the selected set of features, each feature is assigned with a weight based on its significance and therefore the contribution of each feature is distinguished from others. In this method, the new random walk model for link prediction will be introduced by integrating network structure and protein features, based on the assumption that two nodes on the network will be linked since they are nearby in the network. In order to justify the proposal, the authors carry out many experiments on protein-protein interaction networks for comparison with the state-of-the-art network embedding methods. The experimental results from the graphs indicate that our proposed approach is more capable compared to other link prediction approaches and increases the accuracy of prediction.
引用
收藏
页数:11
相关论文
共 76 条
  • [1] Friendship Prediction and Homophily in Social Media
    Aiello, Luca Maria
    Barrat, Alain
    Schifanella, Rossano
    Cattuto, Ciro
    Markines, Benjamin
    Menczer, Filippo
    [J]. ACM TRANSACTIONS ON THE WEB, 2012, 6 (02)
  • [2] A systematic survey of centrality measures for protein-protein interaction networks
    Ashtiani, Minoo
    Salehzadeh-Yazdi, Ali
    Razaghi-Moghadam, Zahra
    Hennig, Holger
    Wolkenhauer, Olaf
    Mirzaie, Mehdi
    Jafari, Mohieddin
    [J]. BMC SYSTEMS BIOLOGY, 2018, 12
  • [3] Bandyopadhyay S, 2018, ARXIV PREPRINT ARXIV
  • [4] Berahmand K, J KING SAUD UNIV-COM, P2021
  • [5] A new attributed graph clustering by using label propagation in complex networks
    Berahmand, Kamal
    Haghani, Sogol
    Rostami, Mehrdad
    Li, Yuefeng
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (05) : 1869 - 1883
  • [6] Effect of rich-club on diffusion in complex networks
    Berahmand, Kamal
    Samadi, Negin
    Sheikholeslami, Seyed Mahmood
    [J]. INTERNATIONAL JOURNAL OF MODERN PHYSICS B, 2018, 32 (12):
  • [7] Clustering and Summarizing Protein-Protein Interaction Networks: A Survey
    Bhowmick, Sourav S.
    Seah, Boon Siew
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (03) : 638 - 658
  • [8] Community-based link prediction
    Biswas, Anupam
    Biswas, Bhaskar
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (18) : 18619 - 18639
  • [9] Fast unfolding of communities in large networks
    Blondel, Vincent D.
    Guillaume, Jean-Loup
    Lambiotte, Renaud
    Lefebvre, Etienne
    [J]. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,
  • [10] Geometric Deep Learning Going beyond Euclidean data
    Bronstein, Michael M.
    Bruna, Joan
    LeCun, Yann
    Szlam, Arthur
    Vandergheynst, Pierre
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (04) : 18 - 42