Random Walks on Huge Graphs at Cache Efficiency

被引:15
|
作者
Yang, Ke [1 ,2 ,3 ]
Ma, Xiaosong [2 ]
Thirumuruganathan, Saravanan [2 ]
Chen, Kang [1 ,3 ]
Wu, Yongwei [1 ,3 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing Natl Res Ctr Informat Sci & Technol BNRis, Beijing, Peoples R China
[2] Hamad Bin Khalifa Univ, Qatar Comp Res Inst, Ar Rayyan, Qatar
[3] Beijing HaiZhi XingTu Technol Co Ltd, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 28TH ACM SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES, SOSP 2021 | 2021年
关键词
graph computing; random walk; memory; cache; ANALYTICS; NETWORKS; SYSTEM;
D O I
10.1145/3477132.3483575
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Data-intensive applications dominated by random accesses to large working sets fail to utilize the computing power of modern processors. Graph random walk, an indispensable workhorse for many important graph processing and learning applications, is one prominent case of such applications. Existing graph random walk systems are currently unable to match the GPU-side node embedding training speed. This work reveals that existing approaches fail to effectively utilize the modern CPU memory hierarchy, due to the widely held assumption that the inherent randomness in random walks and the skewed nature of graphs render most memory accesses random. We demonstrate that there is actually plenty of spatial and temporal locality to harvest, by careful partitioning, rearranging, and batching of operations. The resulting system, FlashMob, improves both cache and memory bandwidth utilization by making memory accesses more sequential and regular. We also found that a classical combinatorial optimization problem (and its exact pseudo-polynomial solution) can be applied to complex decision making, for accurate yet efficient data/task partitioning. Our comprehensive experiments over diverse graphs show that our system achieves an order of magnitude performance improvement over the fastest existing system. It processes a 58GB real graph at higher per-step speed than the existing system on a 600KB toy graph fitting in the L2 cache.
引用
收藏
页码:311 / 326
页数:16
相关论文
共 50 条
  • [31] Growth Series and Random Walks on Some Hyperbolic Graphs
    Laurent Bartholdi
    Tullio G. Ceccherini-Silberstein
    Monatshefte für Mathematik, 2002, 136 : 181 - 202
  • [32] INTERLACINGS FOR RANDOM WALKS ON WEIGHTED GRAPHS AND THE INTERCHANGE PROCESS
    Dieker, A. B.
    SIAM JOURNAL ON DISCRETE MATHEMATICS, 2010, 24 (01) : 191 - 206
  • [33] An Explicit Formula of Hitting Times for Random Walks on Graphs
    Xu, Hao
    Yau, Shing-Tung
    PURE AND APPLIED MATHEMATICS QUARTERLY, 2014, 10 (03) : 567 - 581
  • [34] Protein localization prediction using random walks on graphs
    Xiaohua Xu
    Lin Lu
    Ping He
    Ling Chen
    BMC Bioinformatics, 14
  • [35] ON THE EIGENSPACES OF LAMPLIGHTER RANDOM WALKS AND PERCOLATION CLUSTERS ON GRAPHS
    Lehner, Franz
    PROCEEDINGS OF THE AMERICAN MATHEMATICAL SOCIETY, 2009, 137 (08) : 2631 - 2637
  • [36] Random walks on graphs with interval weights and precise marginals
    Skulj, Damjan
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2016, 73 : 76 - 86
  • [37] The connection between evolution algebras, random walks and graphs
    Cadavid, Paula
    Rodino Montoya, Mary Luz
    Rodriguez, Pablo M.
    JOURNAL OF ALGEBRA AND ITS APPLICATIONS, 2020, 19 (02)
  • [38] Learning without Recall by Random Walks on Directed Graphs
    Rahimian, M. A.
    Shahrampour, S.
    Jadbabaie, A.
    2015 54TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2015, : 5538 - 5543
  • [39] Random Walks with Invariant Loop Probabilities: Stereographic Random Walks
    Montero, Miquel
    ENTROPY, 2021, 23 (06)
  • [40] Analytical results for the distribution of cover times of random walks on random regular graphs
    Tishby, Ido
    Biham, Ofer
    Katzav, Eytan
    JOURNAL OF PHYSICS A-MATHEMATICAL AND THEORETICAL, 2022, 55 (01)