SAug: Structural Imbalance Aware Augmentation for Graph Neural Networks

被引:0
作者
Chen, Ke-Jia [1 ,2 ,3 ]
Mu, Wenhui [2 ]
Liu, Zulong [2 ]
Liu, Zheng [2 ]
机构
[1] Nanjing Univ Posts & Telecommun, Jiangsu Key Lab Big Data Secur & Intelligent Proc, Nanjing, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Sch Comp Sci & Technol, Nanjing, Peoples R China
[3] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph Neural Networks; Graph Representation Learning; Structural Imbalance; Data Augmentation;
D O I
10.1145/3712699
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph machine learning (GML) has made great progress in node classification, link prediction, graph classification, and so on. However, graphs in reality are often structurally imbalanced, that is, only a few hub nodes have a denser local structure and higher influence. The imbalance may compromise the robustness of existing GML models, especially in learning tail nodes. This article proposes a selective graph augmentation method to solve this problem. Firstly, a Pagerank-based sampling strategy is designed to identify hub nodes and tail nodes in the graph. Secondly, a selective augmentation strategy is proposed, which drops the noise neighbors of hub nodes on one side, and discovers the latent neighbors and generates pseudo neighbors for tail nodes on the other side. Also, it can alleviate the structural imbalance between two types of nodes. Finally, a GNN model is retrained on the augmented graph. Extensive experiments demonstrate that the proposed method can significantly improve the backbone GNNs and achieve superior performance to its competitors of graph augmentation methods and hub/tail aware methods.
引用
收藏
页数:22
相关论文
共 48 条
[21]  
Kingma DP, 2014, Arxiv, DOI [arXiv:1312.6114, 10.48550/arXiv.1312.6114]
[22]  
Page L., 1999, The pagerank citation ranking: Bringing order to the web
[23]  
Pei HB, 2020, Arxiv, DOI arXiv:2002.05287
[24]   DeepWalk: Online Learning of Social Representations [J].
Perozzi, Bryan ;
Al-Rfou, Rami ;
Skiena, Steven .
PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, :701-710
[25]  
Qu L, 2021, Arxiv, DOI arXiv:2106.02817
[26]  
Rong Y, 2020, INT C LEARNING REPRE, DOI [10.48550/arXiv.1907.10903, DOI 10.48550/ARXIV.1907.10903]
[27]   Multi-Scale attributed node embedding [J].
Rozemberczki, Benedek ;
Allen, Carl ;
Sarkar, Rik .
JOURNAL OF COMPLEX NETWORKS, 2021, 9 (02) :1-22
[28]   NPI-GNN: Predicting ncRNA-protein interactions with deep graph neural networks [J].
Shen, Zi-Ang ;
Luo, Tao ;
Zhou, Yuan-Ke ;
Yu, Han ;
Du, Pu-Feng .
BRIEFINGS IN BIOINFORMATICS, 2021, 22 (05)
[29]  
Sun ZQ, 2020, AAAI CONF ARTIF INTE, V34, P222
[30]   Deep Representation Learning for Social Network Analysis [J].
Tan, Qiaoyu ;
Liu, Ninghao ;
Hu, Xia .
FRONTIERS IN BIG DATA, 2019, 2