Learning compact graph representations via an encoder-decoder network

被引：1

作者：

Lee, John Boaz ^{[1
]}

Kong, Xiangnan ^{[1
]}

机构：

[1] Worcester Polytech Inst, Worcester, MA 01609 USA

来源：

APPLIED NETWORK SCIENCE | 2019年 / 4卷 / 01期

基金：

美国国家科学基金会;

关键词：

Deep learning; Encoder-decoder; Graph classification; Graph representation; RNN;

D O I：

10.1007/s41109-019-0157-9

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Feature representation learning for classification of multiple graphs is a problem with practical applications in many domains. For instance, in chemoinformatics, the learned feature representations of molecular graphs can be used to classify molecules which exhibit anti-cancer properties. In many previous work, including discriminative subgraph mining and graphlet-based approaches, a graph representation is derived by counting the occurrence of various graph sub-structures. However, these representations fail to capture the co-occurrence patterns that are inherently present among different sub-structures. Recently, various methods (e.g., DeepWalk, node2vec) have been proposed to learn representations for nodes in a graph. These methods use node co-occurrence to learn node embeddings. However, these methods fail to capture the co-occurrence relationship between more complex sub-structures in the graph since they were designed primarily for node representation learning. In this work, we study the problem of learning graph representations that can capture the structural and functional similarity of sub-structures (as evidenced by their co-occurrence relationship) in graphs. This is particularly useful when classifying graphs that have very few sub-structures in common. The proposed method uses an encoder-decoder model to predict the random walk sequence along neighboring regions (or sub-structures) in a graph given a random walk along a particular region. The method is unsupervised and can be used to obtain generic feature representations of graphs making it applicable with various types of graphs. We evaluate the learned representations using several real-world datasets on the binary graph classification task. The proposed model is able to achieve superior results against multiple state-of-the-art techniques.

引用

页数：16

共 40 条

[1] Ahmed N. K., 2018, PROC INT JOINT C ART
[2] Graphlet decomposition: framework, algorithms, and applications
Ahmed, Nesreen K.
Neville, Jennifer
Rossi, Ryan A.
Duffield, Nick G.
Willke, Theodore L.
[J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 50 (03) : 689 - 722
[3] [Anonymous], 2013, EMNLP
[4] [Anonymous], 2009, Advances in neural information processing systems
[5] [Anonymous], 2016, P 30 INT C NEURAL IN
[6] [Anonymous], 2014, EMPIRICAL EVALUATION
[7] Shortest-path kernels on graphs
Borgwardt, KM
Kriegel, HP
[J]. Fifth IEEE International Conference on Data Mining, Proceedings, 2005, : 74 - 81
[8] Cho K., 2014, P 2014 C EMP METH NA, V4, P1724, DOI [https://doi.org/10.3115/v1/D14-1179, DOI 10.3115/V1/D14-1179]
[9] Da San Martino G., 2012, P 2012 SIAM INT C DA, P975, DOI DOI 10.1137/1.9781611972825.84
[10] Automatic Chemical Design Using a Data-Driven Continuous Representation of Molecules
Gomez-Bombarelli, Rafael
Wei, Jennifer N.
Duvenaud, David
Hernandez-Lobato, Jose Miguel
Sanchez-Lengeling, Benjamin
Sheberla, Dennis
Aguilera-Iparraguirre, Jorge
Hirzel, Timothy D.
Adams, Ryan P.
Aspuru-Guzik, Alan
[J]. ACS CENTRAL SCIENCE, 2018, 4 (02) : 268 - 276

← 1 2 3 4 →