Understanding Negative Sampling in Graph Representation Learning

被引:106
|
作者
Yang, Zhen [1 ]
Ding, Ming [1 ]
Zhou, Chang [2 ]
Yang, Hongxia [2 ]
Zhou, Jingren [2 ]
Tang, Jie [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
[2] Alibaba Grp, DAMO Acad, Hangzhou, Peoples R China
来源
KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING | 2020年
关键词
Negative Sampling; Graph Representation Learning; Network Embedding;
D O I
10.1145/3394486.3403218
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph representation learning has been extensively studied in recent years, in which sampling is a critical point. Prior arts usually focus on sampling positive node pairs, while the strategy for negative sampling is left insufficiently explored. To bridge the gap, we systematically analyze the role of negative sampling from the perspectives of both objective and risk, theoretically demonstrating that negative sampling is as important as positive sampling in determining the optimization objective and the resulted variance. To the best of our knowledge, we are the first to derive the theory and quantify that a nice negative sampling distribution is p(n) (u vertical bar v) proportional to p(d) (u vertical bar v)(alpha), 0 < alpha < 1. With the guidance of the theory, we propose MCNS, approximating the positive distribution with self-contrast approximation and accelerating negative sampling by Metropolis-Hastings. We evaluate our method on 5 datasets that cover extensive downstream graph learning tasks, including link prediction, node classification and recommendation, on a total of 19 experimental settings. These relatively comprehensive experimental results demonstrate its robustness and superiorities.
引用
收藏
页码:1666 / 1676
页数:11
相关论文
共 50 条
  • [1] AdaNS: Adaptive negative sampling for unsupervised graph representation learning
    Wang, Yu
    Hu, Liang
    Gao, Wanfu
    Cao, Xiaofeng
    Chang, Yi
    PATTERN RECOGNITION, 2023, 136
  • [2] Negative sampling strategy based on multi-hop neighbors for graph representation learning
    Zhang, Kaiyu
    Sang, Guoming
    Cheng, Junkai
    Liu, Zhi
    Zhang, Yijia
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 263
  • [3] ConCur: Self-supervised graph representation based on contrastive learning with curriculum negative sampling
    Yan, Rong
    Bao, Peng
    NEUROCOMPUTING, 2023, 551
  • [4] Probing Negative Sampling for Contrastive Learning to Learn Graph Representations
    Chen, Shiyi
    Wang, Ziao
    Zhang, Xinni
    Zhang, Xiaofeng
    Peng, Dan
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT II, 2021, 12976 : 434 - 449
  • [5] A novel negative sampling based on TFIDF for learning word representation
    Qin, Pengda
    Xu, Weiran
    Guo, Jun
    NEUROCOMPUTING, 2016, 177 : 257 - 265
  • [6] Context-aware Sampling of Large Networks via Graph Representation Learning
    Zhou, Zhiguang
    Shi, Chen
    Shen, Xilong
    Cai, Lihong
    Wang, Haoxuan
    Liu, Yuhua
    Zhao, Ying
    Chen, Wei
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2021, 27 (02) : 1709 - 1719
  • [7] HeteroSample: Meta-Path Guided Sampling for Heterogeneous Graph Representation Learning
    Liu, Ao
    Chen, Jing
    Du, Ruiying
    Wu, Cong
    Feng, Yebo
    Li, Teng
    Ma, Jianfeng
    IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (04): : 4390 - 4402
  • [8] Effects of Negative Sampling on Knowledge Graph Completion
    Bayrak, Betul
    2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2020, : 264 - 267
  • [9] Reinforced Negative Sampling for Knowledge Graph Embedding
    Xie, Yushun
    Wang, Haiyan
    Wang, Le
    Luo, Lei
    Lie, Jianxin
    Gu, Zhaoquan
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2024, PT IV, 2024, 14853 : 358 - 374
  • [10] Graph representation learning with encoding edges
    Li, Qi
    Cao, Zehong
    Zhong, Jiang
    Li, Qing
    NEUROCOMPUTING, 2019, 361 : 29 - 39