vGraph: Memory-Efficient Multicore Graph Processing for Traversal-Centric Algorithms

被引:1
作者
Jia, Menghan [1 ]
Zhang, Yiming [1 ,2 ]
Gan, Xinbiao [1 ]
Li, Dongsheng [1 ]
Xu, Erci [1 ]
Wang, Ruibo [1 ]
Lu, Kai [1 ]
机构
[1] Natl Univ Def Technol, Changsha, Peoples R China
[2] Xiamen Univ, Xiamen, Peoples R China
来源
SC22: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS | 2022年
基金
中国国家自然科学基金;
关键词
graph processing; traversal-centric algorithms; memory-efficient; multicore; NUMA; ANALYTICS; FRAMEWORK; COMMUNICATION; VERTEX;
D O I
10.1109/SC41404.2022.00068
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
To lower the monetary/energy cost, single-machine multicore graph processing is gaining increasing attention for a wide range of traversal-centric graph algorithms such as BFS, SSSP, CC, and PageRank, of which the processing is relatively simple and the topology data (vertices and edges) dominates the memory footprint. This paper presents vGRAPH, a NUMA-aware, memory-efficient multicore graph processing system for traversal-centric algorithms. vGRAPH proposes an ultralight NUMA-aware graph preprocessing scheme which eliminates almost all complex preprocessing steps and pipelines perNUMA graph loading and compressing, to effectively reduce inter-NUMA memory accesses while keeping both preprocessing cost and peak memory footprint low. We further optimize vGRAPH with effective HPC techniques including prefetching and work-stealing. Evaluation on a 384GB-memory, four-NUMA machine shows that compared to the state-of-the-art NUMAaware/-unaware systems, vGRAPH can process much larger real-world and synthetic graphs with various traversal-centric algorithms, achieving significantly higher memory efficiency and lower processing time.
引用
收藏
页数:14
相关论文
共 54 条
  • [1] Abdelaziz I, 2015, PROC VLDB ENDOW, V8, P1881
  • [2] [Anonymous], 2008, US
  • [3] [Anonymous], 2013, About us
  • [4] Beamer S, 2012, INT CONF HIGH PERFOR
  • [5] Balanced Graph Edge Partition
    Bourse, Florian
    Lelarge, Marc
    Vojnovic, Milan
    [J]. PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 1456 - 1465
  • [6] Scalable Single Source Shortest Path Algorithms for Massively Parallel Systems
    Chakaravarthy, Venkatesan T.
    Checconi, Fabio
    Murali, Prakash
    Petrini, Fabrizio
    Sabharwal, Yogish
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (07) : 2031 - 2045
  • [7] G-Miner: An Efficient Task-Oriented Graph Mining System
    Chen, Hongzhi
    Liu, Miao
    Zhao, Yunjian
    Yan, Xiao
    Yan, Da
    Cheng, James
    [J]. EUROSYS '18: PROCEEDINGS OF THE THIRTEENTH EUROSYS CONFERENCE, 2018,
  • [8] Blogel: A Block-Centric Framework for Distributed Computation on Real-World Graphs
    Da Yan
    Cheng, James
    Yi Lu
    Ng, Wilfred
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 7 (14): : 1981 - 1992
  • [9] Dathathri R, 2018, ACM SIGPLAN NOTICES, V53, P752, DOI [10.1145/3192366.3192404, 10.1145/3296979.3192404]
  • [10] Erwig M, 2000, NETWORKS, V36, P156, DOI 10.1002/1097-0037(200010)36:3<156::AID-NET2>3.0.CO