mg2vec: Learning Relationship-Preserving Heterogeneous Graph Representations via Metagraph Embedding

被引:23
作者
Zhang, Wentao [1 ]
Fang, Yuan [2 ]
Liu, Zemin [2 ]
Wu, Min [3 ]
Zhang, Xinming [1 ]
机构
[1] Univ Sci & Technol China, Hefei 230052, Peoples R China
[2] Singapore Management Univ, Singapore 188065, Singapore
[3] Inst Infocomm Res, Singapore 138632, Singapore
基金
中国国家自然科学基金;
关键词
Task analysis; Semantics; Peer-to-peer computing; Data mining; Toy manufacturing industry; Tools; Companies; Heterogeneous information networks; network embedding; relationship mining;
D O I
10.1109/TKDE.2020.2992500
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given that heterogeneous information networks (HIN) encompass nodes and edges belonging to different semantic types, they can model complex data in real-world scenarios. Thus, HIN embedding has received increasing attention, which aims to learn node representations in a low-dimensional space, in order to preserve the structural and semantic information on the HIN. In this regard, metagraphs, which model common and recurring patterns on HINs, emerge as a powerful tool to capture semantic-rich and often latent relationships on HINs. Although metagraphs have been employed to address several specific data mining tasks, they have not been thoroughly explored for the more general HIN embedding. In this paper, we leverage metagraphs to learn relationship-preserving HIN embedding in a self-supervised setting, to support various relationship mining tasks. In particular, we observe that most of the current approaches often under-utilize metagraphs, which are only applied in a pre-processing step and do not actively guide representation learning afterwards. Thus, we propose the novel framework of mg2vec, which learns the embeddings for metagraphs and nodes jointly. That is, metagraphs actively participates in the learning process by mapping themselves to the same embedding space as the nodes do. Moreover, metagraphs guide the learning through both first- and second-order constraints on node embeddings, to model not only latent relationships between a pair of nodes, but also individual preferences of each node. Finally, we conduct extensive experiments on three public datasets. Results show that mg2vec significantly outperforms a suite of state-of-the-art baselines in relationship mining tasks including relationship prediction, search and visualization.
引用
收藏
页码:1317 / 1329
页数:13
相关论文
共 55 条
  • [31] Asymmetric Transitivity Preserving Graph Embedding
    Ou, Mingdong
    Cui, Peng
    Pei, Jian
    Zhang, Ziwei
    Zhu, Wenwu
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1105 - 1114
  • [32] DeepWalk: Online Learning of Social Representations
    Perozzi, Bryan
    Al-Rfou, Rami
    Skiena, Steven
    [J]. PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 701 - 710
  • [33] An Attention-based Collaboration Framework for Multi-View Network Representation Learning
    Qu, Meng
    Tang, Jian
    Shang, Jingbo
    Ren, Xiang
    Zhang, Ming
    Han, Jiawei
    [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 1767 - 1776
  • [34] Meta-GNN: Metagraph Neural Network for Semi-supervised learning in Attributed Heterogeneous Information Networks
    Sankar, Aravind
    Zhang, Xinyang
    Chang, Kevin Chen-Chuan
    [J]. PROCEEDINGS OF THE 2019 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2019), 2019, : 137 - 144
  • [35] Heterogeneous Information Network Embedding for Recommendation
    Shi, Chuan
    Hu, Binbin
    Zhao, Wayne Xin
    Yu, Philip S.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (02) : 357 - 370
  • [36] A Survey of Heterogeneous Information Network Analysis
    Shi, Chuan
    Li, Yitong
    Zhang, Jiawei
    Sun, Yizhou
    Yu, Philip S.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (01) : 17 - 37
  • [37] Easing Embedding Learning by Comprehensive Transcription of Heterogeneous Information Networks
    Shi, Yu
    Zhu, Qi
    Guo, Fang
    Zhang, Chao
    Han, Jiawei
    [J]. KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 2190 - 2199
  • [38] Joint Embedding of Meta-Path and Meta-Graph for Heterogeneous Information Networks
    Sun, Lichao
    He, Lifang
    Huang, Zhipeng
    Cao, Bokai
    Xia, Congying
    Wei, Xiaokai
    Yu, Philip S.
    [J]. 2018 9TH IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (ICBK), 2018, : 131 - 138
  • [39] Efficient Parallel Subgraph Enumeration on a Single Machine
    Sun, Shixuan
    Che, Yulin
    Wang, Lipeng
    Luo, Qiong
    [J]. 2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 232 - 243
  • [40] Sunt YZ, 2011, PROC VLDB ENDOW, V4, P992