Density of states for fast embedding node-attributed graphs

被引:0
作者
Zhao, Lingxiao [1 ]
Sawlani, Saurabh [2 ]
Akoglu, Leman [1 ]
机构
[1] Carnegie Mellon Univ, Heinz Coll, Pittsburgh, PA 15213 USA
[2] SoundHound Inc, Berlin, Germany
基金
美国安德鲁·梅隆基金会;
关键词
Attributed graphs; Spectral embedding; Graph filters; Band-pass; Density of states; MATRICES;
D O I
10.1007/s10115-023-01836-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given a node-attributed graph, how can we efficiently represent it with few numerical features that expressively reflect its topology and attribute information? We propose A-DOGE, for attributed DOS-based graph embedding, based on density of states (DOS, a.k.a. spectral density) to tackle this problem. A-DOGE is designed to fulfill a long desiderata of desirable characteristics. Most notably, it capitalizes on efficient approximation algorithms for DOS, that we extend to blend in node labels and attributes for the first time, making it fast and scalable for large attributed graphs and graph databases. Being based on the entire eigenspectrum of a graph, A-DOGE can capture structural and attribute properties at multiple ("glocal") scales. Moreover, it is unsupervised (i.e., agnostic to any specific objective) and lends itself to various interpretations, which makes it suitable for exploratory graph mining tasks. Finally, it processes each graph independent of others, making it amenable for streaming settings as well as parallelization. Through extensive experiments, we show the efficacy and efficiency of A-DOGE on exploratory graph analysis and graph classification tasks, where it significantly outperforms unsupervised baselines and achieves competitive performance with modern supervised GNNs, while achieving the best trade-off between accuracy and runtime.
引用
收藏
页码:2455 / 2483
页数:29
相关论文
共 51 条
  • [1] Abboud Ralph, 2021, P 30 INT JOINT C ART
  • [2] Babai Laszlo, 1982, P 14 ANN ACM S THEOR, P310, DOI [DOI 10.1145/800070, 10.1145/800070.802206]
  • [3] Balcilar Muhammet, 2021, ICLR
  • [4] Spectral plot properties: Towards a qualitative classification of networks
    Banerjee, Anirban
    Jost, Juergen
    [J]. NETWORKS AND HETEROGENEOUS MEDIA, 2008, 3 (02) : 395 - 411
  • [5] Barcelo Pablo, 2021, Advances in Neural Information Processing Systems, V34
  • [6] Protein function prediction via graph kernels
    Borgwardt, KM
    Ong, CS
    Schönauer, S
    Vishwanathan, SVN
    Smola, AJ
    Kriegel, HP
    [J]. BIOINFORMATICS, 2005, 21 : I47 - I56
  • [7] Bouritsas G., 2020, arXiv
  • [8] Epidemic thresholds in real networks
    Chakrabarti, Deepayan
    Wang, Yang
    Wang, Chenxi
    Leskovec, Jurij
    Faloutsos, Christos
    [J]. ACM TRANSACTIONS ON INFORMATION AND SYSTEM SECURITY, 2008, 10 (04)
  • [9] Chen Z., 2020, Advances in neural information processing systems, V33, P10383
  • [10] Corso G, 2020, ADV NEUR IN, V33