Enhancing link prediction through node embedding and ensemble learning

被引:0
作者
Chen, Zhongyuan [1 ]
Wang, Yongji [2 ]
机构
[1] Guangxi Univ Nationalities, Xiangsihu Coll, Acad Affairs Off, Nanning 530225, Guangxi, Peoples R China
[2] Guangxi Univ Nationalities, Xiangsihu Coll, Sch Art & Design, Nanning 530225, Guangxi, Peoples R China
关键词
Complex networks; Social networks; Link prediction; Node2vec embedding; XGBoost classifier;
D O I
10.1007/s10115-024-02203-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social networks, characterized by their dynamic and continually evolving nature, present challenges for effective link prediction (LP) due to the constant addition of nodes and connections. In response to this, we propose a novel approach to LP in social networks through Node Embedding and Ensemble Learning (LP-NEEL). Our method constructs a transition matrix from the network's adjacency matrix and computes similarity measures between node pairs. Utilizing node2vec embedding, we extract features from nodes and generate edge embeddings by computing the inner product of node embeddings for each edge. This process yields a well-labeled dataset suitable for LP tasks. To mitigate overfitting, we balance the dataset by ensuring an equal number of negative and positive samples edge samples during both the testing and training phases. Leveraging this balanced dataset, we employ the XGBoost machine learning algorithm for final link prediction. Extensive experimentation across six social network datasets validates the efficacy of our approach, demonstrating improved predictive performance compared to existing methods.
引用
收藏
页码:7697 / 7715
页数:19
相关论文
共 43 条
  • [1] A differential machine learning approach for trust prediction in signed social networks
    Abadeh, Maryam Nooraei
    Mirzaie, Mansooreh
    [J]. JOURNAL OF SUPERCOMPUTING, 2023, 79 (09) : 9443 - 9466
  • [2] Al Hasan M, 2011, SOCIAL NETWORK DATA ANALYTICS, P243
  • [3] A comprehensive survey of link prediction methods
    Arrar, Djihad
    Kamel, Nadjet
    Lakhfif, Abdelaziz
    [J]. JOURNAL OF SUPERCOMPUTING, 2024, 80 (03) : 3902 - 3942
  • [4] An efficient recommendation generation using relevant Jaccard similarity
    Bag, Sujoy
    Kumar, Sri Krishna
    Tiwari, Manoj Kumar
    [J]. INFORMATION SCIENCES, 2019, 483 : 53 - 64
  • [5] Autoencoders and their applications in machine learning: a survey
    Berahmand, Kamal
    Daneshfar, Fatemeh
    Salehi, Elaheh Sadat
    Li, Yuefeng
    Xu, Yue
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (02)
  • [6] WSNMF: Weighted Symmetric Nonnegative Matrix Factorization for attributed graph clustering
    Berahmand, Kamal
    Mohammadi, Mehrnoush
    Sheikhpour, Razieh
    Li, Yuefeng
    Xu, Yue
    [J]. NEUROCOMPUTING, 2024, 566
  • [7] A Deep Semi-Supervised Community Detection Based on Point-Wise Mutual Information
    Berahmand, Kamal
    Li, Yuefeng
    Xu, Yue
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (03) : 3444 - 3456
  • [8] A modified DeepWalk method for link prediction in attributed social network
    Berahmand, Kamal
    Nasiri, Elahe
    Rostami, Mehrdad
    Forouzandeh, Saman
    [J]. COMPUTING, 2021, 103 (10) : 2227 - 2249
  • [9] Identifying influential nodes based on new layer metrics and layer weighting in multiplex networks
    Bouyer, Asgarali
    Mohammadi, Moslem
    Arasteh, Bahman
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (02) : 1011 - 1035
  • [10] webTWAS: a resource for disease candidate susceptibility genes identified by transcriptome-wide association study
    Cao, Chen
    Wang, Jianhua
    Kwok, Devin
    Cui, Feifei
    Zhang, Zilong
    Zhao, Da
    Li, Mulin Jun
    Zou, Quan
    [J]. NUCLEIC ACIDS RESEARCH, 2022, 50 (D1) : D1123 - D1130