GraphFly: Efficient Asynchronous Streaming Graphs Processing via Dependency-Flow

被引：34

作者：

Chen, Dan ^{[1
]}

Gui, Chuangyi ^{[1
]}

Zhang, Yi ^{[1
]}

Jin, Hai ^{[1
]}

Zheng, Long ^{[1
]}

Huang, Yu ^{[1
]}

Liao, Xiaofei ^{[1
]}

机构：

[1] Huazhong Univ Sci & Technol, Natl Engn Res Ctr Big Data Technol & Syst, Serv Comp Technol & Syst Lab, Clusters & Grid Comp Lab, Wuhan, Peoples R China

来源：

SC22: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS | 2022年

关键词：

streaming graphs; incremental processing; redundant memory accesses;

D O I：

10.1109/SC41404.2022.00050

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Existing streaming graph processing systems typically adopt two phases of refinement and recomputation to ensure the correctness of the incremental computation. However, severe redundant memory accesses exist due to the unnecessary synchronization among independent edge updates. In this paper, we present GraphFly, a high-performance asynchronous streaming graph processing system based on dependency-flows. GraphFly features three key designs: 1) Dependency trees (D-trees), which helps quickly identify independent graph updates with low cost; 2) Dependency-flow based processing model, which exploits the space-time dependent co-scheduling for cache efficiency; 3) Specialized graph data layout, which further reduces memory accesses. We evaluate GraphFly, and the results show that GraphFly significantly outperforms state-of-the-art systems KickStarter and GraphBolt by 5.81x and 1.78x on average, respectively. Also, GraphFly scales well with different sizes of update batch and compute resources.

引用

页数：14

共 43 条

[1] [Anonymous], 2014, ANN IEEE ACM INT S C
[2] [Anonymous], 2015, Friendster network dataset
[3] Backstrom L., 2006, PROC C KNOWL DISCOV, P44, DOI [10.1145/1150402.1150412, DOI 10.1145/1150402.1150412]
[4] Beamer S, 2012, INT CONF HIGH PERFOR
[5] Boldi P., 2004, P 13 INT C WORLD WID, P595, DOI DOI 10.1145/988672.988752
[6] The anatomy of a large-scale hypertextual Web search engine
Brin, S
Page, L
[J]. COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7): : 107 - 117
[7] Cha M., 2010, 4 INT AAAI C WEBL SO, P10, DOI DOI 10.1609/ICWSM.V4I1.14033
[8] Scalable Single Source Shortest Path Algorithms for Massively Parallel Systems
Chakaravarthy, Venkatesan T.
Checconi, Fabio
Petrini, Fabrizio
Sabharwal, Yogish
[J]. 2014 IEEE 28TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, 2014,
[9] Cheng R., 2012, P 7 ACM EUR C COMP S, P85
[10] Low-Latency Graph Streaming using Compressed Purely-Functional Trees
Dhulipala, Laxman
Blelloch, Guy E.
Shun, Julian
[J]. PROCEEDINGS OF THE 40TH ACM SIGPLAN CONFERENCE ON PROGRAMMING LANGUAGE DESIGN AND IMPLEMENTATION (PLDI '19), 2019, : 918 - 934

← 1 2 3 4 5 →