Learning with Similarity Functions on Graphs using Matchings of Geometric Embeddings

被引:22
作者
Johansson, Fredrik D. [1 ]
Dubhashi, Devdatt [1 ]
机构
[1] Chalmers Univ Technol, Gothenburg, Sweden
来源
KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING | 2015年
关键词
Matchings; Similarity functions; Graphs; Geometric embeddings; Classification; KERNELS;
D O I
10.1145/2783258.2783341
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We develop and apply the Balcan Blum Srebro (BBS) theory of classification via similarity functions (which are not necessarily kernels) to the problem of graph classification. First we place the BBS theory into the unifying framework of optimal transport theory. This also opens the way to exploit coupling methods for establishing properties required of a good similarity function as per their definition. Next, we use the approach to the problem of graph classification via geometric embeddings such as the Laplacian, pseudo inverse Laplacian and the Lovasz orthogonal labellings. We consider the similarity function given by optimal and near optimal matchings with respect to Euclidean distance of the corresponding embeddings of the graphs in high dimensions. We use optimal couplings to rigorously establish that this yields a "good" similarity measure in the BBS sense for two well known families of graphs. Further, we show that the similarity yields better classification accuracy in practice, on these families, than matchings of other well-known graph embeddings. Finally we perform an extensive empirical evaluation on benchmark data sets where we show that classifying graphs using matchings of geometric embeddings outperforms the previous state of the art methods.
引用
收藏
页码:467 / 476
页数:10
相关论文
共 45 条
  • [21] Semidefinite programming in combinatorial optimization
    Goemans, MX
    [J]. MATHEMATICAL PROGRAMMING, 1997, 79 (1-3) : 143 - 161
  • [22] Gower J.C., 2004, Procrustes Problems, V3
  • [23] Gutman I, 2004, SERB AC B, P15
  • [24] Haussler David, 1999, Technical Report
  • [25] The Predictive Toxicology Challenge 2000-2001
    Helma, C
    King, RD
    Kramer, S
    Srinivasan, A
    [J]. BIOINFORMATICS, 2001, 17 (01) : 107 - 108
  • [26] Entity Disambiguation in Anonymized Graphs Using Graph Kernels
    Hermansson, Linus
    Kerola, Tommi
    Johansson, Fredrik
    Jethava, Vinay
    Dubhashi, Devdatt
    [J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1037 - 1046
  • [27] Jethava V, 2013, J MACH LEARN RES, V14, P3495
  • [28] Johansson FD, 2014, PR MACH LEARN RES, V32, P694
  • [29] Kar Purushottam., 2011, Proceedings of the 24th International Conference on Neural Information Processing Systems, NIPS11, P1998
  • [30] Kar Purushottam, 2012, ADV NEURAL INFORM PR, P215