Positive-Unlabeled Learning for Network Link Prediction

被引:5
作者
Gan, Shengfeng [1 ]
Alshahrani, Mohammed [2 ]
Liu, Shichao [3 ]
机构
[1] Hubei Univ Educ, Coll Comp, Wuhan 430205, Peoples R China
[2] Albaha Univ, Coll Comp Sci & IT, Albaha 65515, Saudi Arabia
[3] Huazhong Agr Univ, Coll Informat, Wuhan 430070, Peoples R China
基金
中国国家自然科学基金;
关键词
network link prediction; positive-unlabeled learning; network representation learning; supervised classification; CLASSIFICATION; SVM;
D O I
10.3390/math10183345
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Link prediction is an important problem in network data mining, which is dedicated to predicting the potential relationship between nodes in the network. Normally, network link prediction based on supervised classification will be trained on a dataset consisting of a set of positive samples and a set of negative samples. However, well-labeled training datasets with positive and negative annotations are always inadequate in real-world scenarios, and the datasets contain a large number of unlabeled samples that may hinder the performance of the model. To address this problem, we propose a positive-unlabeled learning framework with network representation for network link prediction only using positive samples and unlabeled samples. We first learn representation vectors of nodes using a network representation method. Next, we concatenate representation vectors of node pairs and then feed them into different classifiers to predict whether the link exists or not. To alleviate data imbalance and enhance the prediction precision, we adopt three types of positive-unlabeled (PU) learning strategies to improve the prediction performance using traditional classifier estimation, bagging strategy and reliable negative sampling. We conduct experiments on three datasets to compare different PU learning methods and discuss their influence on the prediction results. The experimental results demonstrate that PU learning has a positive impact on predictive performances and the promotion effects vary with different network structures.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] A Novel Weakly Supervised Problem: Learning from Positive-Unlabeled Proportions
    Hernandez-Gonzalez, Jeronimo
    Inza, Inaki
    Lozano, Jose A.
    ADVANCES IN ARTIFICIAL INTELLIGENCE (CAEPIA 2015), 2015, 9422 : 3 - 13
  • [22] Intrusion Detection based on Non-negative Positive-unlabeled Learning
    Lv, Sicai
    Liu, Yang
    Liu, Zhiyao
    Chao, Wang
    Wu, Chenrui
    Wang, Bailing
    PROCEEDINGS OF 2020 IEEE 9TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS'20), 2020, : 1015 - 1020
  • [23] An Integrated Framework of Positive-Unlabeled and Imbalanced Learning for Landslide Susceptibility Mapping
    Fu, Zijin
    Ma, Hao
    Wang, Fawu
    Dou, Jie
    Zhang, Bo
    Fang, Zhice
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 15596 - 15611
  • [24] Computational Identification of Lysine Glutarylation Sites Using Positive-Unlabeled Learning
    Ju, Zhe
    Wang, Shi-Yun
    CURRENT GENOMICS, 2020, 21 (03) : 204 - 211
  • [25] Predicting drug-target interaction using positive-unlabeled learning
    Lan, Wei
    Wang, Jianxin
    Li, Min
    Liu, Jin
    Li, Yaohang
    Wu, Fang-Xiang
    Pan, Yi
    NEUROCOMPUTING, 2016, 206 : 50 - 57
  • [26] Risk Bounds for Positive-Unlabeled Learning Under the Selected At Random Assumption
    Coudray, Olivier
    Keribin, Christine
    Massart, Pascal
    Pamphile, Patrick
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [27] Estimating classification accuracy in positive-unlabeled learning: characterization and correction strategies
    Ramola, Rashika
    Jain, Shantanu
    Radivojac, Predrag
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2019, 2019, : 124 - 135
  • [28] Positive-Unlabeled Learning for Cell Detection in Histopathology Images with Incomplete Annotations
    Zhao, Zipei
    Pang, Fengqian
    Liu, Zhiwen
    Ye, Chuyang
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT VIII, 2021, 12908 : 509 - 518
  • [29] An optimized positive-unlabeled learning method for detecting a large scale of malware variants
    Zhang, Jixin
    Khan, Mohammad Faham
    Lin, Xiaodong
    Qin, Zheng
    2019 IEEE CONFERENCE ON DEPENDABLE AND SECURE COMPUTING (DSC), 2019, : 182 - 189
  • [30] Predicting HIV-1 Protease Cleavage Sites With Positive-Unlabeled Learning
    Li, Zhenfeng
    Hu, Lun
    Tang, Zehai
    Zhao, Cheng
    FRONTIERS IN GENETICS, 2021, 12