DynG2G: An Efficient Stochastic Graph Embedding Method for Temporal Graphs

被引:5
作者
Xu, Mengjia [1 ,2 ]
Singh, Apoorva Vikram [3 ]
Karniadakis, George Em [4 ]
机构
[1] Brown Univ, Div Appl Math, Providence, RI 02912 USA
[2] MIT, McGovern Inst Brain Res, Cambridge, MA 02139 USA
[3] Natl Inst Technol, Dept Elect Engn, Silchar 788010, Assam, India
[4] Brown Univ, Div Appl Math, Providence, RI 02912 USA
关键词
Dynamic graph; graph embedding; multivariate Gaussian distribution; uncertainty quantification;
D O I
10.1109/TNNLS.2022.3178706
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dynamic graph embedding has gained great attention recently due to its capability of learning low-dimensional and meaningful graph representations for complex temporal graphs with high accuracy. However, recent advances mostly focus on learning node embeddings as deterministic "vectors" for static graphs, hence disregarding the key graph temporal dynamics and the evolving uncertainties associated with node embedding in the latent space. In this work, we propose an efficient stochastic dynamic graph embedding method (DynG2G) that applies an inductive feedforward encoder trained with node triplet energy-based ranking loss. Every node per timestamp is encoded as a time-dependent probabilistic multivariate Gaussian distribution in the latent space, and, hence, we are able to quantify the node embedding uncertainty on-the-fly. We have considered eight different benchmarks that represent diversity in size (from 96 nodes to 87626 and from 13398 edges to 4870863) as well as diversity in dynamics, from slowly changing temporal evolution to rapidly varying multirate dynamics. We demonstrate through extensive experiments based on these eight dynamic graph benchmarks that DynG2G achieves new state-of-the-art performance in capturing the underlying temporal node embeddings. We also demonstrate that DynG2G can simultaneously predict the evolving node embedding uncertainty, which plays a crucial role in quantifying the intrinsic dimensionality of the dynamical system over time. In particular, we obtain a "universal" relation of the optimal embedding dimension, L-0, versus the effective dimensionality of uncertainty, D-u , and infer that L-0=D-u for all cases. This, in turn, implies that the uncertainty quantification approach we employ in the DynG2G algorithm correctly captures the intrinsic dimensionality of the dynamics of such evolving graphs despite the diverse nature and composition of the graphs at each timestamp. In addition, this L-0 - D-u correlation provides a clear path to selecting adaptively the optimum embedding size at each timestamp by setting L >= D-u .
引用
收藏
页码:985 / 998
页数:14
相关论文
共 39 条
  • [1] Graph embedding and unsupervised learning predict genomic sub-compartments from HiC chromatin interaction data
    Ashoor, Haitham
    Chen, Xiaowen
    Rosikiewicz, Wojciech
    Wang, Jiahui
    Cheng, Albert
    Wang, Ping
    Ruan, Yijun
    Li, Sheng
    [J]. NATURE COMMUNICATIONS, 2020, 11 (01)
  • [2] Bojchevski A, 2018, Arxiv, DOI arXiv:1707.03815
  • [3] Brown T. B., 2020, ADV NEURAL INF PROCE, P1
  • [4] Chen TQ, 2016, Arxiv, DOI arXiv:1511.05641
  • [5] Choi E, 2020, AAAI CONF ARTIF INTE, V34, P606
  • [6] Du L, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2086
  • [7] Goyal P., 2018, arXiv
  • [8] Goyal P., 2020, ACM SIGWEB Newsletter, V2020, P1
  • [9] dyngraph2vec: Capturing network dynamics using dynamic graph representation learning
    Goyal, Palash
    Chhetri, Sujit Rokka
    Canedo, Arquimedes
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 187
  • [10] node2vec: Scalable Feature Learning for Networks
    Grover, Aditya
    Leskovec, Jure
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 855 - 864