Hierarchical stochastic graphlet embedding for graph-based pattern recognition

被引:11
作者
Dutta, Anjan [1 ]
Riba, Pau [2 ]
Llados, Josep [2 ]
Fornes, Alicia [2 ]
机构
[1] Univ Exeter, Innovat Ctr, Dept Comp Sci, Streatham Campus, Exeter EX4 4RN, Devon, England
[2] Autonomous Univ Barcelona, Comp Vis Ctr, Comp Sci Dept, Edifici O,Campus UAB, Barcelona 08193, Spain
基金
欧盟地平线“2020”;
关键词
Graph embedding; Hierarchical graph; Stochastic graphlets; Graph hashing; Graph classification; EDIT DISTANCE; VECTOR-SPACE; DATABASE; KERNELS;
D O I
10.1007/s00521-019-04642-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite being very successful within the pattern recognition and machine learning community, graph-based methods are often unusable because of the lack of mathematical operations defined in graph domain. Graph embedding, which maps graphs to a vectorial space, has been proposed as a way to tackle these difficulties enabling the use of standard machine learning techniques. However, it is well known that graph embedding functions usually suffer from the loss of structural information. In this paper, we consider the hierarchical structure of a graph as a way to mitigate this loss of information. The hierarchical structure is constructed by topologically clustering the graph nodes and considering each cluster as a node in the upper hierarchical level. Once this hierarchical structure is constructed, we consider several configurations to define the mapping into a vector space given a classical graph embedding, in particular, we propose to make use of the stochastic graphlet embedding (SGE). Broadly speaking, SGE produces a distribution of uniformly sampled low-to-high-order graphlets as a way to embed graphs into the vector space. In what follows, the coarse-to-fine structure of a graph hierarchy and the statistics fetched by the SGE complements each other and includes important structural information with varied contexts. Altogether, these two techniques substantially cope with the usual information loss involved in graph embedding techniques, obtaining a more robust graph representation. This fact has been corroborated through a detailed experimental evaluation on various benchmark graph datasets, where we outperform the state-of-the-art methods.
引用
收藏
页码:11579 / 11596
页数:18
相关论文
共 83 条
[1]  
Adelson Edward H, 1984, RCA Eng., V29, P33
[2]  
Ahuja N, 2010, LECT NOTES COMPUT SC, V6218, P1, DOI 10.1007/978-3-642-14980-1_1
[3]  
Almeida H, 2011, LECT NOTES ARTIF INT, V6911, P44, DOI 10.1007/978-3-642-23780-5_13
[4]  
[Anonymous], P S SSPR
[5]  
[Anonymous], 2005, The dissimilarity representation for pattern recognition-Foundations and applications
[6]  
[Anonymous], 2015, CVPR
[7]  
[Anonymous], 2005, 5 IEEE INT C DATA MI, DOI DOI 10.1109/ICDM.2005.132
[8]  
[Anonymous], 2016, SIGIR
[9]  
[Anonymous], 1984, Graph algorithms and NP-completeness
[10]  
Atwood J., 2016, ADV NEURAL INFORM PR, V29, P1993, DOI DOI 10.5555/3157096.3157320