Understanding Negative Sampling in Graph Representation Learning

被引：106

作者：

Yang, Zhen ^{[1
]}

Ding, Ming ^{[1
]}

Zhou, Chang ^{[2
]}

Yang, Hongxia ^{[2
]}

Zhou, Jingren ^{[2
]}

Tang, Jie ^{[1
]}

机构：

[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China

[2] Alibaba Grp, DAMO Acad, Hangzhou, Peoples R China

来源：

KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING | 2020年

关键词：

Negative Sampling; Graph Representation Learning; Network Embedding;

D O I：

10.1145/3394486.3403218

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Graph representation learning has been extensively studied in recent years, in which sampling is a critical point. Prior arts usually focus on sampling positive node pairs, while the strategy for negative sampling is left insufficiently explored. To bridge the gap, we systematically analyze the role of negative sampling from the perspectives of both objective and risk, theoretically demonstrating that negative sampling is as important as positive sampling in determining the optimization objective and the resulted variance. To the best of our knowledge, we are the first to derive the theory and quantify that a nice negative sampling distribution is p(n) (u vertical bar v) proportional to p(d) (u vertical bar v)(alpha), 0 < alpha < 1. With the guidance of the theory, we propose MCNS, approximating the positive distribution with self-contrast approximation and accelerating negative sampling by Metropolis-Hastings. We evaluate our method on 5 datasets that cover extensive downstream graph learning tasks, including link prediction, node classification and recommendation, on a total of 19 experimental settings. These relatively comprehensive experimental results demonstrate its robustness and superiorities.

引用

页码：1666 / 1676

页数：11

共 50 条

[1] AdaNS: Adaptive negative sampling for unsupervised graph representation learning
Wang, Yu
Hu, Liang
Gao, Wanfu
Cao, Xiaofeng
Chang, Yi
PATTERN RECOGNITION, 2023, 136
[2] Negative sampling strategy based on multi-hop neighbors for graph representation learning
Zhang, Kaiyu
Sang, Guoming
Cheng, Junkai
Liu, Zhi
Zhang, Yijia
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 263
[3] ConCur: Self-supervised graph representation based on contrastive learning with curriculum negative sampling
Yan, Rong
Bao, Peng
NEUROCOMPUTING, 2023, 551
[4] Probing Negative Sampling for Contrastive Learning to Learn Graph Representations
Chen, Shiyi
Wang, Ziao
Zhang, Xinni
Zhang, Xiaofeng
Peng, Dan
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT II, 2021, 12976 : 434 - 449
[5] A novel negative sampling based on TFIDF for learning word representation
Qin, Pengda
Xu, Weiran
Guo, Jun
NEUROCOMPUTING, 2016, 177 : 257 - 265
[6] Context-aware Sampling of Large Networks via Graph Representation Learning
Zhou, Zhiguang
Shi, Chen
Shen, Xilong
Cai, Lihong
Wang, Haoxuan
Liu, Yuhua
Zhao, Ying
Chen, Wei
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2021, 27 (02) : 1709 - 1719
[7] HeteroSample: Meta-Path Guided Sampling for Heterogeneous Graph Representation Learning
Liu, Ao
Chen, Jing
Du, Ruiying
Wu, Cong
Feng, Yebo
Li, Teng
Ma, Jianfeng
IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (04): : 4390 - 4402
[8] Effects of Negative Sampling on Knowledge Graph Completion
Bayrak, Betul
2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2020, : 264 - 267
[9] Reinforced Negative Sampling for Knowledge Graph Embedding
Xie, Yushun
Wang, Haiyan
Wang, Le
Luo, Lei
Lie, Jianxin
Gu, Zhaoquan
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2024, PT IV, 2024, 14853 : 358 - 374
[10] Graph representation learning with encoding edges
Li, Qi
Cao, Zehong
Zhong, Jiang
Li, Qing
NEUROCOMPUTING, 2019, 361 : 29 - 39

← 1 2 3 4 5 →