mg2vec: Learning Relationship-Preserving Heterogeneous Graph Representations via Metagraph Embedding

被引:23
作者
Zhang, Wentao [1 ]
Fang, Yuan [2 ]
Liu, Zemin [2 ]
Wu, Min [3 ]
Zhang, Xinming [1 ]
机构
[1] Univ Sci & Technol China, Hefei 230052, Peoples R China
[2] Singapore Management Univ, Singapore 188065, Singapore
[3] Inst Infocomm Res, Singapore 138632, Singapore
基金
中国国家自然科学基金;
关键词
Task analysis; Semantics; Peer-to-peer computing; Data mining; Toy manufacturing industry; Tools; Companies; Heterogeneous information networks; network embedding; relationship mining;
D O I
10.1109/TKDE.2020.2992500
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given that heterogeneous information networks (HIN) encompass nodes and edges belonging to different semantic types, they can model complex data in real-world scenarios. Thus, HIN embedding has received increasing attention, which aims to learn node representations in a low-dimensional space, in order to preserve the structural and semantic information on the HIN. In this regard, metagraphs, which model common and recurring patterns on HINs, emerge as a powerful tool to capture semantic-rich and often latent relationships on HINs. Although metagraphs have been employed to address several specific data mining tasks, they have not been thoroughly explored for the more general HIN embedding. In this paper, we leverage metagraphs to learn relationship-preserving HIN embedding in a self-supervised setting, to support various relationship mining tasks. In particular, we observe that most of the current approaches often under-utilize metagraphs, which are only applied in a pre-processing step and do not actively guide representation learning afterwards. Thus, we propose the novel framework of mg2vec, which learns the embeddings for metagraphs and nodes jointly. That is, metagraphs actively participates in the learning process by mapping themselves to the same embedding space as the nodes do. Moreover, metagraphs guide the learning through both first- and second-order constraints on node embeddings, to model not only latent relationships between a pair of nodes, but also individual preferences of each node. Finally, we conduct extensive experiments on three public datasets. Results show that mg2vec significantly outperforms a suite of state-of-the-art baselines in relationship mining tasks including relationship prediction, search and visualization.
引用
收藏
页码:1317 / 1329
页数:13
相关论文
共 55 条
  • [1] Disease gene classification with metagraph representations
    Ata, Sezin Kircali
    Fang, Yuan
    Wu, Min
    Li, Xiao-Li
    Xiao, Xiaokui
    [J]. METHODS, 2017, 131 : 83 - 92
  • [2] Efficient Subgraph Matching by Postponing Cartesian Products
    Bi, Fei
    Chang, Lijun
    Lin, Xuemin
    Qin, Lu
    Zhang, Wenjie
    [J]. SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 1199 - 1214
  • [3] Bordes A., 2013, P 26 INT C NEUR INF, V2, P2787
  • [4] A Comprehensive Survey of Graph Embedding: Problems, Techniques, and Applications
    Cai, HongYun
    Zheng, Vincent W.
    Chang, Kevin Chen-Chuan
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (09) : 1616 - 1637
  • [5] PME: Projected Metric Embedding on Heterogeneous Networks for Link Prediction
    Chen, Hongxu
    Yin, Hongzhi
    Wang, Weiqing
    Wang, Hao
    Quoc Viet Hung Nguyen
    Li, Xue
    [J]. KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1177 - 1186
  • [6] A Survey on Network Embedding
    Cui, Peng
    Wang, Xiao
    Pei, Jian
    Zhu, Wenwu
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (05) : 833 - 852
  • [7] Dai QY, 2018, AAAI CONF ARTIF INTE, P2167
  • [8] metapath2vec: Scalable Representation Learning for Heterogeneous Networks
    Dong, Yuxiao
    Chawla, Nitesh V.
    Swami, Ananthram
    [J]. KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 135 - 144
  • [9] GRAMI: Frequent Subgraph and Pattern Mining in a Single Large Graph
    Elseidy, Mohammed
    Abdelhamid, Ehab
    Skiadopoulos, Spiros
    Kalnis, Panos
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 7 (07): : 517 - 528
  • [10] Fang Y., 2011, Proceedings of the International Conference on Web Search and Data Mining, P825