Positive-Unlabeled Learning for Network Link Prediction

被引：5

作者：

Gan, Shengfeng ^{[1
]}

Alshahrani, Mohammed ^{[2
]}

Liu, Shichao ^{[3
]}

机构：

[1] Hubei Univ Educ, Coll Comp, Wuhan 430205, Peoples R China

[2] Albaha Univ, Coll Comp Sci & IT, Albaha 65515, Saudi Arabia

[3] Huazhong Agr Univ, Coll Informat, Wuhan 430070, Peoples R China

来源：

MATHEMATICS | 2022年 / 10卷 / 18期

基金：

中国国家自然科学基金;

关键词：

network link prediction; positive-unlabeled learning; network representation learning; supervised classification; CLASSIFICATION; SVM;

D O I：

10.3390/math10183345

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

Link prediction is an important problem in network data mining, which is dedicated to predicting the potential relationship between nodes in the network. Normally, network link prediction based on supervised classification will be trained on a dataset consisting of a set of positive samples and a set of negative samples. However, well-labeled training datasets with positive and negative annotations are always inadequate in real-world scenarios, and the datasets contain a large number of unlabeled samples that may hinder the performance of the model. To address this problem, we propose a positive-unlabeled learning framework with network representation for network link prediction only using positive samples and unlabeled samples. We first learn representation vectors of nodes using a network representation method. Next, we concatenate representation vectors of node pairs and then feed them into different classifiers to predict whether the link exists or not. To alleviate data imbalance and enhance the prediction precision, we adopt three types of positive-unlabeled (PU) learning strategies to improve the prediction performance using traditional classifier estimation, bagging strategy and reliable negative sampling. We conduct experiments on three datasets to compare different PU learning methods and discuss their influence on the prediction results. The experimental results demonstrate that PU learning has a positive impact on predictive performances and the promotion effects vary with different network structures.

引用

页数：13

共 50 条

[41] Leveraging Positive-Unlabeled Learning for Enhanced Black Spot Accident Identification on Greek Road Networks [J].

Sevetlidis, Vasileios ;

Pavlidis, George ;

Mouroutsos, Spyridon G. ;

Gasteratos, Antonios .

COMPUTERS, 2024, 13 (02)

[42] Biological Network Derivation by Positive Unlabeled Learning Algorithms [J].

Pancaroglu, Doruk ;

Tan, Mehmet .

CURRENT BIOINFORMATICS, 2016, 11 (05) :531-536

[43] GA-Auto-PU: A Genetic Algorithm-based Automated Machine Learning System for Positive-Unlabeled Learning [J].

Saunders, Jack D. ;

Freitas, Alex A. .

PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2022, 2022, :288-291

[44] GKF-PUAL: A group kernel-free approach to positive-unlabeled learning with variable selection [J].

Wang, Xiaoke ;

Zhu, Rui ;

Xue, Jing-Hao .

INFORMATION SCIENCES, 2025, 690

[45] Crystal synthesizability prediction using contrastive positive unlabeled learning [J].

Sun, Tao ;

Yuan, Jianmei .

COMPUTER PHYSICS COMMUNICATIONS, 2025, 308

[46] Improving Positive Unlabeled Learning Algorithms for Protein Interaction Prediction [J].

Pancaroglu, Doruk ;

Tan, Mehmet .

8TH INTERNATIONAL CONFERENCE ON PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY & BIOINFORMATICS (PACBB 2014), 2014, 294 :81-88

[47] High-fidelity positive-unlabeled deep learning for semi-supervised fault detection of chemical processes [J].

Zheng, Shaodong ;

Zhao, Jinsong .

PROCESS SAFETY AND ENVIRONMENTAL PROTECTION, 2022, 165 :191-204

[48] False positive rate control for positive unlabeled learning [J].

Kong, Shuchen ;

Shen, Weiwei ;

Zheng, Yingbin ;

Zhang, Ao ;

Pu, Jian ;

Wang, Jun .

NEUROCOMPUTING, 2019, 367 :13-19

[49] Predicting disease-associated circular RNAs using deep forests combined with positive-unlabeled learning methods [J].

Zeng, Xiangxiang ;

Zhong, Yue ;

Lin, Wei ;

Zou, Quan .

BRIEFINGS IN BIOINFORMATICS, 2020, 21 (04) :1425-1436

[50] Enhancing landslide susceptibility mapping using a positive-unlabeled machine learning approach: a case study in Chamoli, India [J].

Zhang, Danrong ;

Jindal, Dipali ;

Roy, Nimisha ;

Vangla, Prashanth ;

Frost, J. David .

GEOENVIRONMENTAL DISASTERS, 2024, 11 (01)

← 1 2 3 4 5 →