Self-Supervised Learning on Graphs: Contrastive, Generative, or Predictive

被引:140
作者
Wu, Lirong [1 ]
Lin, Haitao [1 ]
Tan, Cheng [1 ]
Gao, Zhangyang [1 ]
Li, Stan Z. [1 ]
机构
[1] Westlake Univ, Sch Engn, AI Lab, Hangzhou 310000, Peoples R China
关键词
Deep learning; self-supervised learning; graph neural networks; unsupervised learning; survey; CONVOLUTIONAL NETWORKS;
D O I
10.1109/TKDE.2021.3131584
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning on graphs has recently achieved remarkable success on a variety of tasks, while such success relies heavily on the massive and carefully labeled data. However, precise annotations are generally very expensive and time-consuming. To address this problem, self-supervised learning (SSL) is emerging as a new paradigm for extracting informative knowledge through well-designed pretext tasks without relying on manual labels. In this survey, we extend the concept of SSL, which first emerged in the fields of computer vision and natural language processing, to present a timely and comprehensive review of existing SSL techniques for graph data. Specifically, we divide existing graph SSL methods into three categories: contrastive, generative, and predictive. More importantly, unlike other surveys that only provide a high-level description of published research, we present an additional mathematical summary of existing works in a unified framework. Furthermore, to facilitate methodological development and empirical comparisons, we also summarize the commonly used datasets, evaluation metrics, downstream tasks, open-source implementations, and experimental study of various algorithms. Finally, we discuss the technical challenges and potential future directions for improving graph self-supervised learning. Latest advances in graph SSL are summarized in a GitHub repository https://github.com/LirongWu/awesome-graph-self-supervised-learning.
引用
收藏
页码:4216 / 4235
页数:20
相关论文
共 90 条
[1]  
Belghazi MI, 2018, PR MACH LEARN RES, V80
[2]   Fast unfolding of communities in large networks [J].
Blondel, Vincent D. ;
Guillaume, Jean-Loup ;
Lambiotte, Renaud ;
Lefebvre, Etienne .
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,
[3]  
BONACICH P, 1987, AM J SOCIOL, V92, P1170, DOI 10.1086/228631
[4]   Bipartite Graph Embedding via Mutual Information Maximization [J].
Cao, Jiangxia ;
Lin, Xixun ;
Guo, Shu ;
Liu, Luchen ;
Liu, Tingwen ;
Wang, Bin .
WSDM '21: PROCEEDINGS OF THE 14TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2021, :635-643
[5]   Deep Clustering for Unsupervised Learning of Visual Features [J].
Caron, Mathilde ;
Bojanowski, Piotr ;
Joulin, Armand ;
Douze, Matthijs .
COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :139-156
[6]  
Chen T, 2020, PR MACH LEARN RES, V119
[7]   Beyond triplet loss: a deep quadruplet network for person re-identification [J].
Chen, Weihua ;
Chen, Xiaotang ;
Zhang, Jianguo ;
Huang, Kaiqi .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1320-1329
[8]  
Chuang C.-Y., 2020, ADV NEURAL INFORM PR
[9]  
Devlin J, 2019, Arxiv, DOI [arXiv:1810.04805, 10.48550/arXiv.1810.04805]
[10]   Distinguishing enzyme structures from non-enzymes without alignments [J].
Dobson, PD ;
Doig, AJ .
JOURNAL OF MOLECULAR BIOLOGY, 2003, 330 (04) :771-783