GraphFly: Efficient Asynchronous Streaming Graphs Processing via Dependency-Flow

被引:34
作者
Chen, Dan [1 ]
Gui, Chuangyi [1 ]
Zhang, Yi [1 ]
Jin, Hai [1 ]
Zheng, Long [1 ]
Huang, Yu [1 ]
Liao, Xiaofei [1 ]
机构
[1] Huazhong Univ Sci & Technol, Natl Engn Res Ctr Big Data Technol & Syst, Serv Comp Technol & Syst Lab, Clusters & Grid Comp Lab, Wuhan, Peoples R China
来源
SC22: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS | 2022年
关键词
streaming graphs; incremental processing; redundant memory accesses;
D O I
10.1109/SC41404.2022.00050
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Existing streaming graph processing systems typically adopt two phases of refinement and recomputation to ensure the correctness of the incremental computation. However, severe redundant memory accesses exist due to the unnecessary synchronization among independent edge updates. In this paper, we present GraphFly, a high-performance asynchronous streaming graph processing system based on dependency-flows. GraphFly features three key designs: 1) Dependency trees (D-trees), which helps quickly identify independent graph updates with low cost; 2) Dependency-flow based processing model, which exploits the space-time dependent co-scheduling for cache efficiency; 3) Specialized graph data layout, which further reduces memory accesses. We evaluate GraphFly, and the results show that GraphFly significantly outperforms state-of-the-art systems KickStarter and GraphBolt by 5.81x and 1.78x on average, respectively. Also, GraphFly scales well with different sizes of update batch and compute resources.
引用
收藏
页数:14
相关论文
共 43 条
  • [1] [Anonymous], 2014, ANN IEEE ACM INT S C
  • [2] [Anonymous], 2015, Friendster network dataset
  • [3] Backstrom L., 2006, PROC C KNOWL DISCOV, P44, DOI [10.1145/1150402.1150412, DOI 10.1145/1150402.1150412]
  • [4] Beamer S, 2012, INT CONF HIGH PERFOR
  • [5] Boldi P., 2004, P 13 INT C WORLD WID, P595, DOI DOI 10.1145/988672.988752
  • [6] The anatomy of a large-scale hypertextual Web search engine
    Brin, S
    Page, L
    [J]. COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7): : 107 - 117
  • [7] Cha M., 2010, 4 INT AAAI C WEBL SO, P10, DOI DOI 10.1609/ICWSM.V4I1.14033
  • [8] Scalable Single Source Shortest Path Algorithms for Massively Parallel Systems
    Chakaravarthy, Venkatesan T.
    Checconi, Fabio
    Petrini, Fabrizio
    Sabharwal, Yogish
    [J]. 2014 IEEE 28TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, 2014,
  • [9] Cheng R., 2012, P 7 ACM EUR C COMP S, P85
  • [10] Low-Latency Graph Streaming using Compressed Purely-Functional Trees
    Dhulipala, Laxman
    Blelloch, Guy E.
    Shun, Julian
    [J]. PROCEEDINGS OF THE 40TH ACM SIGPLAN CONFERENCE ON PROGRAMMING LANGUAGE DESIGN AND IMPLEMENTATION (PLDI '19), 2019, : 918 - 934