Learning compact graph representations via an encoder-decoder network

被引:1
作者
Lee, John Boaz [1 ]
Kong, Xiangnan [1 ]
机构
[1] Worcester Polytech Inst, Worcester, MA 01609 USA
基金
美国国家科学基金会;
关键词
Deep learning; Encoder-decoder; Graph classification; Graph representation; RNN;
D O I
10.1007/s41109-019-0157-9
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Feature representation learning for classification of multiple graphs is a problem with practical applications in many domains. For instance, in chemoinformatics, the learned feature representations of molecular graphs can be used to classify molecules which exhibit anti-cancer properties. In many previous work, including discriminative subgraph mining and graphlet-based approaches, a graph representation is derived by counting the occurrence of various graph sub-structures. However, these representations fail to capture the co-occurrence patterns that are inherently present among different sub-structures. Recently, various methods (e.g., DeepWalk, node2vec) have been proposed to learn representations for nodes in a graph. These methods use node co-occurrence to learn node embeddings. However, these methods fail to capture the co-occurrence relationship between more complex sub-structures in the graph since they were designed primarily for node representation learning. In this work, we study the problem of learning graph representations that can capture the structural and functional similarity of sub-structures (as evidenced by their co-occurrence relationship) in graphs. This is particularly useful when classifying graphs that have very few sub-structures in common. The proposed method uses an encoder-decoder model to predict the random walk sequence along neighboring regions (or sub-structures) in a graph given a random walk along a particular region. The method is unsupervised and can be used to obtain generic feature representations of graphs making it applicable with various types of graphs. We evaluate the learned representations using several real-world datasets on the binary graph classification task. The proposed model is able to achieve superior results against multiple state-of-the-art techniques.
引用
收藏
页数:16
相关论文
共 40 条
  • [1] Ahmed N. K., 2018, PROC INT JOINT C ART
  • [2] Graphlet decomposition: framework, algorithms, and applications
    Ahmed, Nesreen K.
    Neville, Jennifer
    Rossi, Ryan A.
    Duffield, Nick G.
    Willke, Theodore L.
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 50 (03) : 689 - 722
  • [3] [Anonymous], 2013, EMNLP
  • [4] [Anonymous], 2009, Advances in neural information processing systems
  • [5] [Anonymous], 2016, P 30 INT C NEURAL IN
  • [6] [Anonymous], 2014, EMPIRICAL EVALUATION
  • [7] Shortest-path kernels on graphs
    Borgwardt, KM
    Kriegel, HP
    [J]. Fifth IEEE International Conference on Data Mining, Proceedings, 2005, : 74 - 81
  • [8] Cho K., 2014, P 2014 C EMP METH NA, V4, P1724, DOI [https://doi.org/10.3115/v1/D14-1179, DOI 10.3115/V1/D14-1179]
  • [9] Da San Martino G., 2012, P 2012 SIAM INT C DA, P975, DOI DOI 10.1137/1.9781611972825.84
  • [10] Automatic Chemical Design Using a Data-Driven Continuous Representation of Molecules
    Gomez-Bombarelli, Rafael
    Wei, Jennifer N.
    Duvenaud, David
    Hernandez-Lobato, Jose Miguel
    Sanchez-Lengeling, Benjamin
    Sheberla, Dennis
    Aguilera-Iparraguirre, Jorge
    Hirzel, Timothy D.
    Adams, Ryan P.
    Aspuru-Guzik, Alan
    [J]. ACS CENTRAL SCIENCE, 2018, 4 (02) : 268 - 276