Random Walks on Huge Graphs at Cache Efficiency

被引:15
|
作者
Yang, Ke [1 ,2 ,3 ]
Ma, Xiaosong [2 ]
Thirumuruganathan, Saravanan [2 ]
Chen, Kang [1 ,3 ]
Wu, Yongwei [1 ,3 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing Natl Res Ctr Informat Sci & Technol BNRis, Beijing, Peoples R China
[2] Hamad Bin Khalifa Univ, Qatar Comp Res Inst, Ar Rayyan, Qatar
[3] Beijing HaiZhi XingTu Technol Co Ltd, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 28TH ACM SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES, SOSP 2021 | 2021年
关键词
graph computing; random walk; memory; cache; ANALYTICS; NETWORKS; SYSTEM;
D O I
10.1145/3477132.3483575
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Data-intensive applications dominated by random accesses to large working sets fail to utilize the computing power of modern processors. Graph random walk, an indispensable workhorse for many important graph processing and learning applications, is one prominent case of such applications. Existing graph random walk systems are currently unable to match the GPU-side node embedding training speed. This work reveals that existing approaches fail to effectively utilize the modern CPU memory hierarchy, due to the widely held assumption that the inherent randomness in random walks and the skewed nature of graphs render most memory accesses random. We demonstrate that there is actually plenty of spatial and temporal locality to harvest, by careful partitioning, rearranging, and batching of operations. The resulting system, FlashMob, improves both cache and memory bandwidth utilization by making memory accesses more sequential and regular. We also found that a classical combinatorial optimization problem (and its exact pseudo-polynomial solution) can be applied to complex decision making, for accurate yet efficient data/task partitioning. Our comprehensive experiments over diverse graphs show that our system achieves an order of magnitude performance improvement over the fastest existing system. It processes a 58GB real graph at higher per-step speed than the existing system on a 600KB toy graph fitting in the L2 cache.
引用
收藏
页码:311 / 326
页数:16
相关论文
共 50 条
  • [1] Short random walks on graphs
    Barnes, G
    Feige, U
    SIAM JOURNAL ON DISCRETE MATHEMATICS, 1996, 9 (01) : 19 - 28
  • [2] On the speed of random walks on graphs
    Virág, B
    ANNALS OF PROBABILITY, 2000, 28 (01) : 379 - 394
  • [3] Random Walks on Directed Covers of Graphs
    Gilch, Lorenz A.
    Mueller, Sebastian
    JOURNAL OF THEORETICAL PROBABILITY, 2011, 24 (01) : 118 - 149
  • [4] Deterministic random walks on finite graphs
    Kijima, Shuji
    Koga, Kentaro
    Makino, Kazuhisa
    RANDOM STRUCTURES & ALGORITHMS, 2015, 46 (04) : 739 - 761
  • [5] Random Walks on Directed Covers of Graphs
    Lorenz A. Gilch
    Sebastian Müller
    Journal of Theoretical Probability, 2011, 24 : 118 - 149
  • [6] Reversible random walks on dynamic graphs
    Shimizu, Nobutaka
    Shiraga, Takeharu
    RANDOM STRUCTURES & ALGORITHMS, 2023, 63 (04) : 1100 - 1136
  • [7] Random Walks on Complete Multipartite Graphs
    Chang, Xiao
    Xu, Hao
    PURE AND APPLIED MATHEMATICS QUARTERLY, 2015, 11 (03) : 393 - 402
  • [8] On the norms of the random walks on planar graphs
    Zuk, A
    ANNALES DE L INSTITUT FOURIER, 1997, 47 (05) : 1463 - +
  • [9] RANDOM WALKS ON DENSE GRAPHS AND GRAPHONS
    Petit, Julien
    Lambiotte, Renaud
    Carletti, Timoteo
    SIAM JOURNAL ON APPLIED MATHEMATICS, 2021, 81 (06) : 2323 - 2345
  • [10] Reconstruction of graphs based on random walks
    Wittmann, Dominik M.
    Schmidl, Daniel
    Bloechl, Florian
    Theis, Fabian J.
    THEORETICAL COMPUTER SCIENCE, 2009, 410 (38-40) : 3826 - 3838