M-HIN: Complex Embeddings for Heterogeneous Information Networks via Metagraphs

被引:15
作者
Fang, Yang [1 ]
Zhao, Xiang [1 ]
Huang, Peixin [1 ]
Xiao, Weidong [1 ]
de Rijke, Maarten [2 ]
机构
[1] Natl Univ Def Technol, Changsha, Hunan, Peoples R China
[2] Univ Amsterdam, Amsterdam, Netherlands
来源
PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19) | 2019年
关键词
Heterogeneous information network; Representation learning; Network embedding; Metagraph;
D O I
10.1145/3331184.3331281
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To represent a complex network, paths are often employed for capturing relationships among node: random walks for (homogeneous) networks and metapaths for heterogeneous information networks (HINs). However, there is structural (and possibly semantic) information loss when using paths to represent the subgraph between two nodes, since a path is a linear structure and a subgraph often is not. Can we find a better alternative for network embeddings? We offer a novel mechanism to capture the features of HIN nodes via metagraphs, which retains more structural and semantic information than path-oriented models. Inspired by developments in knowledge graph embedding, we propose to construct HIN triplets using nodes and metagraphs between them. Metagraphs are generated by harnessing the GRAMI algorithm, which enumerates frequent subgraph patterns in a HIN. Subsequently, the Hadamard function is applied to encode relationships between nodes and metagraphs, and the probability whether a HIN triplet can be evaluated. Further, to better distinguish between symmetric and asymmetric cases of metagraphs, we introduce a complex embedding scheme that is able to precisely express fine-grained features of HIN nodes. We evaluate the proposed model, M-HIN, on real-life datasets and demonstrate that it significantly and consistently outperforms state-of-the-art models.
引用
收藏
页码:913 / 916
页数:4
相关论文
共 11 条
[1]  
Battaglia PW, 2016, ADV NEUR IN, V29
[2]  
Dai ZJ, 2018, GEOTECH SP, P196
[3]   metapath2vec: Scalable Representation Learning for Heterogeneous Networks [J].
Dong, Yuxiao ;
Chawla, Nitesh V. ;
Swami, Ananthram .
KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, :135-144
[4]   GRAMI: Frequent Subgraph and Pattern Mining in a Single Large Graph [J].
Elseidy, Mohammed ;
Abdelhamid, Ehab ;
Skiadopoulos, Spiros ;
Kalnis, Panos .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 7 (07) :517-528
[5]  
Fang Y, 2016, PROC INT CONF DATA, P277, DOI 10.1109/ICDE.2016.7498247
[6]   HIN2Vec: Explore Meta-paths in Heterogeneous Information Networks for Representation Learning [J].
Fu, Tao-yang ;
Lee, Wang-Chien ;
Lei, Zhen .
CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, :1797-1806
[7]   node2vec: Scalable Feature Learning for Networks [J].
Grover, Aditya ;
Leskovec, Jure .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :855-864
[8]  
Huang Zehao, 2017, CORR
[9]  
Perozzi B., 2014, PROC 20 ACM SIGKDD, P701, DOI DOI 10.1145/2623330.2623732
[10]   LINE: Large-scale Information Network Embedding [J].
Tang, Jian ;
Qu, Meng ;
Wang, Mingzhe ;
Zhang, Ming ;
Yan, Jun ;
Mei, Qiaozhu .
PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW 2015), 2015, :1067-1077