Heterogeneous Graph Neural Network

被引:1063
作者
Zhang, Chuxu [1 ]
Song, Dongjin [2 ]
Huang, Chao [1 ,3 ]
Swami, Ananthram [4 ]
Chawla, Nitesh V. [1 ]
机构
[1] Univ Notre Dame, Notre Dame, IN 46556 USA
[2] NEC Labs Amer Inc, Princeton, NJ USA
[3] JD Digits, Beijing, Peoples R China
[4] US Army, Res Lab, Adelphi, MD USA
来源
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING | 2019年
基金
美国国家科学基金会;
关键词
Heterogeneous graphs; Graph neural networks; Graph embedding;
D O I
10.1145/3292500.3330961
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Representation learning in heterogeneous graphs aims to pursue a meaningful vector representation for each node so as to facilitate downstream applications such as link prediction, personalized recommendation, node classification, etc. This task, however, is challenging not only because of the demand to incorporate heterogeneous structural (graph) information consisting of multiple types of nodes and edges, but also due to the need for considering heterogeneous attributes or contents (e.g., text or image) associated with each node. Despite a substantial amount of effort has been made to homogeneous (or heterogeneous) graph embedding, attributed graph embedding as well as graph neural networks, few of them can jointly consider heterogeneous structural (graph) information as well as heterogeneous contents information of each node effectively. In this paper, we propose HetGNN, a heterogeneous graph neural network model, to resolve this issue. Specifically, we first introduce a random walk with restart strategy to sample a fixed size of strongly correlated heterogeneous neighbors for each node and group them based upon node types. Next, we design a neural network architecture with two modules to aggregate feature information of those sampled neighboring nodes. The first module encodes "deep" feature interactions of heterogeneous contents and generates content embedding for each node. The second module aggregates content (attribute) embeddings of different neighboring groups (types) and further combines them by considering the impacts of different groups to obtain the ultimate node embedding. Finally, we leverage a graph context loss and a mini-batch gradient descent procedure to train the model in an end-to-end manner. Extensive experiments on several datasets demonstrate that HetGNN can outperform state-of-the-art baselines in various graph mining tasks, i.e., link prediction, recommendation, node classification & clustering and inductive node classification & clustering.
引用
收藏
页码:793 / 803
页数:11
相关论文
共 36 条
[1]  
[Anonymous], 2017, PROC INT C LEARN REP
[2]  
[Anonymous], TKDE
[3]  
[Anonymous], 2015, PROC CVPR IEEE
[4]  
[Anonymous], KDD
[5]  
[Anonymous], WEB C 2018 P WORLD
[6]  
[Anonymous], 2018, TKDE, DOI DOI 10.1109/TKDE.2018.2819980
[7]  
c P., 2018, INT C LEARN REPR
[8]   Heterogeneous Network Embedding via Deep Architectures [J].
Chang, Shiyu ;
Han, Wei ;
Tang, Jiliang ;
Qi, Guo-Jun ;
Aggarwal, Charu C. ;
Huang, Thomas S. .
KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, :119-128
[9]   Task-Guided and Path-Augmented Heterogeneous Network Embedding for Author Identification [J].
Chen, Ting ;
Sun, Yizhou .
WSDM'17: PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2017, :295-304
[10]   metapath2vec: Scalable Representation Learning for Heterogeneous Networks [J].
Dong, Yuxiao ;
Chawla, Nitesh V. ;
Swami, Ananthram .
KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, :135-144