Heterogeneous Graph Neural Network

被引:1082
作者
Zhang, Chuxu [1 ]
Song, Dongjin [2 ]
Huang, Chao [1 ,3 ]
Swami, Ananthram [4 ]
Chawla, Nitesh V. [1 ]
机构
[1] Univ Notre Dame, Notre Dame, IN 46556 USA
[2] NEC Labs Amer Inc, Princeton, NJ USA
[3] JD Digits, Beijing, Peoples R China
[4] US Army, Res Lab, Adelphi, MD USA
来源
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING | 2019年
基金
美国国家科学基金会;
关键词
Heterogeneous graphs; Graph neural networks; Graph embedding;
D O I
10.1145/3292500.3330961
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Representation learning in heterogeneous graphs aims to pursue a meaningful vector representation for each node so as to facilitate downstream applications such as link prediction, personalized recommendation, node classification, etc. This task, however, is challenging not only because of the demand to incorporate heterogeneous structural (graph) information consisting of multiple types of nodes and edges, but also due to the need for considering heterogeneous attributes or contents (e.g., text or image) associated with each node. Despite a substantial amount of effort has been made to homogeneous (or heterogeneous) graph embedding, attributed graph embedding as well as graph neural networks, few of them can jointly consider heterogeneous structural (graph) information as well as heterogeneous contents information of each node effectively. In this paper, we propose HetGNN, a heterogeneous graph neural network model, to resolve this issue. Specifically, we first introduce a random walk with restart strategy to sample a fixed size of strongly correlated heterogeneous neighbors for each node and group them based upon node types. Next, we design a neural network architecture with two modules to aggregate feature information of those sampled neighboring nodes. The first module encodes "deep" feature interactions of heterogeneous contents and generates content embedding for each node. The second module aggregates content (attribute) embeddings of different neighboring groups (types) and further combines them by considering the impacts of different groups to obtain the ultimate node embedding. Finally, we leverage a graph context loss and a mini-batch gradient descent procedure to train the model in an end-to-end manner. Extensive experiments on several datasets demonstrate that HetGNN can outperform state-of-the-art baselines in various graph mining tasks, i.e., link prediction, recommendation, node classification & clustering and inductive node classification & clustering.
引用
收藏
页码:793 / 803
页数:11
相关论文
共 36 条
[21]   Hierarchical Taxonomy Aware Network Embedding [J].
Ma, Jianxin ;
Cui, Peng ;
Wang, Xiao ;
Zhu, Wenwu .
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, :1920-1929
[22]  
Mikolov T., 2013, Advances in Neural Information Processing Systems, V26, P1
[23]  
Perozzi B., 2014, PROC 20 ACM SIGKDD, P701, DOI DOI 10.1145/2623330.2623732
[24]   Network Embedding as Matrix Factorization: Unifying DeepWalk, LINE, PTE, and node2vec [J].
Qiu, Jiezhong ;
Dong, Yuxiao ;
Ma, Hao ;
Li, Jian ;
Wang, Kuansan ;
Tang, Jie .
WSDM'18: PROCEEDINGS OF THE ELEVENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2018, :459-467
[25]   Curriculum Learning for Heterogeneous Star Network Embedding via Deep Reinforcement Learning [J].
Qu, Meng ;
Tang, Jian ;
Han, Jiawei .
WSDM'18: PROCEEDINGS OF THE ELEVENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2018, :468-476
[26]   ClusCite: Effective Citation Recommendation by Information Network-Based Clustering [J].
Ren, Xiang ;
Liu, Jialu ;
Yu, Xiao ;
Khandelwal, Urvashi ;
Gu, Quanquan ;
Wang, Lidan ;
Han, Jiawei .
PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, :821-830
[27]  
Schlichtkrull M., 2018, P EUR SEM WEB C, P593
[28]  
Sun Y., 2012, P 5 ACM INT C WEB SE, P663, DOI DOI 10.1145/2124295.2124373
[29]  
Sunt YZ, 2011, PROC VLDB ENDOW, V4, P992
[30]  
Tang J., 2008, P 14 ACM SIGKDD INT, P990, DOI 10.1145/1401890.1402008