vGraph: Memory-Efficient Multicore Graph Processing for Traversal-Centric Algorithms

被引：1

作者：

Jia, Menghan ^{[1
]}

Zhang, Yiming ^{[1
,2
]}

Gan, Xinbiao ^{[1
]}

Li, Dongsheng ^{[1
]}

Xu, Erci ^{[1
]}

Wang, Ruibo ^{[1
]}

Lu, Kai ^{[1
]}

机构：

[1] Natl Univ Def Technol, Changsha, Peoples R China

[2] Xiamen Univ, Xiamen, Peoples R China

来源：

SC22: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS | 2022年

基金：

中国国家自然科学基金;

关键词：

graph processing; traversal-centric algorithms; memory-efficient; multicore; NUMA; ANALYTICS; FRAMEWORK; COMMUNICATION; VERTEX;

D O I：

10.1109/SC41404.2022.00068

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

To lower the monetary/energy cost, single-machine multicore graph processing is gaining increasing attention for a wide range of traversal-centric graph algorithms such as BFS, SSSP, CC, and PageRank, of which the processing is relatively simple and the topology data (vertices and edges) dominates the memory footprint. This paper presents vGRAPH, a NUMA-aware, memory-efficient multicore graph processing system for traversal-centric algorithms. vGRAPH proposes an ultralight NUMA-aware graph preprocessing scheme which eliminates almost all complex preprocessing steps and pipelines perNUMA graph loading and compressing, to effectively reduce inter-NUMA memory accesses while keeping both preprocessing cost and peak memory footprint low. We further optimize vGRAPH with effective HPC techniques including prefetching and work-stealing. Evaluation on a 384GB-memory, four-NUMA machine shows that compared to the state-of-the-art NUMAaware/-unaware systems, vGRAPH can process much larger real-world and synthetic graphs with various traversal-centric algorithms, achieving significantly higher memory efficiency and lower processing time.

引用

页数：14

共 54 条

[1] Abdelaziz I, 2015, PROC VLDB ENDOW, V8, P1881
[2] [Anonymous], 2008, US
[3] [Anonymous], 2013, About us
[4] Beamer S, 2012, INT CONF HIGH PERFOR
[5] Balanced Graph Edge Partition
Bourse, Florian
Lelarge, Marc
Vojnovic, Milan
[J]. PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 1456 - 1465
[6] Scalable Single Source Shortest Path Algorithms for Massively Parallel Systems
Chakaravarthy, Venkatesan T.
Checconi, Fabio
Murali, Prakash
Petrini, Fabrizio
Sabharwal, Yogish
[J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (07) : 2031 - 2045
[7] G-Miner: An Efficient Task-Oriented Graph Mining System
Chen, Hongzhi
Liu, Miao
Zhao, Yunjian
Yan, Xiao
Yan, Da
Cheng, James
[J]. EUROSYS '18: PROCEEDINGS OF THE THIRTEENTH EUROSYS CONFERENCE, 2018,
[8] Blogel: A Block-Centric Framework for Distributed Computation on Real-World Graphs
Da Yan
Cheng, James
Yi Lu
Ng, Wilfred
[J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 7 (14): : 1981 - 1992
[9] Dathathri R, 2018, ACM SIGPLAN NOTICES, V53, P752, DOI [10.1145/3192366.3192404, 10.1145/3296979.3192404]
[10] Erwig M, 2000, NETWORKS, V36, P156, DOI 10.1002/1097-0037(200010)36:3<156::AID-NET2>3.0.CO

← 1 2 3 4 5 6 →