Scaling Graph Clustering with Distributed Sketches

被引:1
|
作者
Priest, Benjamin W. [1 ]
Dunton, Alec [2 ]
Sanders, Geoffrey [1 ]
机构
[1] Lawrence Livermore Natl Lab, Ctr Appl Sci Comp, Livermore, CA 94550 USA
[2] Univ Colorado, Dept Appl Math, Boulder, CO 80309 USA
关键词
streaming algorithms; random matrices; graph clustering;
D O I
10.1109/hpec43674.2020.9286252
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The unsupervised learning of community structure, in particular the partitioning vertices into clusters or communities, is a canonical and well-studied problem in exploratory graph analysis. However, like most graph analyses the introduction of immense scale presents challenges to traditional methods. Spectral clustering in distributed memory, for example, requires hundreds of expensive bulk-synchronous communication rounds to compute an embedding of vertices to a few eigenvectors of a graph associated matrix. Furthermore, the whole computation might need to be repeated if the underlying graph changes some low percentage of edge updates. We present a method inspired by spectral clustering where we instead use matrix sketches derived from random dimension-reducing projections. We show that our method produces embeddings that yield performant clustering results given a fully-dynamic stochastic block model stream using both the fast Johnson-Lindenstrauss and CountSketch transforms. We also discuss the effects of stochastic block model parameters upon the required dimensionality of the subsequent embeddings, and show how random projections could significantly improve the performance of graph clustering in distributed memory.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Learning Based Approach for Optimal Clustering of Distributed Program's Call Flow Graph
    Abofathi, Yousef
    Zarei, Bager
    Parsa, Saeed
    INNOVATIONS IN COMPUTING SCIENCES AND SOFTWARE ENGINEERING, 2010, : 207 - 211
  • [32] Median graph computation for graph clustering
    Hlaoui, A
    Wang, SR
    SOFT COMPUTING, 2006, 10 (01) : 47 - 53
  • [33] Auxiliary Graph for Attribute Graph Clustering
    Li, Wang
    Wang, Siwei
    Guo, Xifeng
    Zhou, Zhenyu
    Zhu, En
    ENTROPY, 2022, 24 (10)
  • [34] Graph Learning for Attributed Graph Clustering
    Zhang, Xiaoran
    Xie, Xuanting
    Kang, Zhao
    MATHEMATICS, 2022, 10 (24)
  • [35] Graph Clustering With Graph Capsule Network
    Zhang, Xianchao
    Mu, Jie
    Liu, Han
    Zhang, Xiaotong
    Zong, Linlin
    Wang, Guanglu
    NEURAL COMPUTATION, 2022, 34 (05) : 1256 - 1287
  • [36] Graph Clustering with Graph Neural Networks
    Tsitsulin, Anton
    Palowitch, John
    Perozzi, Bryan
    Mueller, Emmanuel
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [37] Median graph computation for graph clustering
    Adel Hlaoui
    Shengrui Wang
    Soft Computing, 2006, 10 : 47 - 53
  • [38] Efficient distributed computation of distance sketches in networks
    Das Sarma, Atish
    Dinitz, Michael
    Pandurangan, Gopal
    DISTRIBUTED COMPUTING, 2015, 28 (05) : 309 - 320
  • [39] Large Graph Clustering Using DCT-Based Graph Clustering
    Tsapanos, Nikolaos
    Tefas, Anastasios
    Nikolaidis, Nikolaos
    Pitas, Ioannis
    2014 IEEE Symposium on Computational Intelligence in Big Data (CIBD), 2014, : 108 - 111
  • [40] GLASS: A Graph Laplacian Autoencoder with Subspace Clustering Regularization for Graph Clustering
    Dengdi Sun
    Liang Liu
    Bin Luo
    Zhuanlian Ding
    Cognitive Computation, 2023, 15 : 803 - 821