A Survey of Distributed Graph Algorithms on Massive Graphs

被引:5
作者
Meng, Lingkai [1 ]
Shao, Yu [2 ]
Yuan, Long [3 ]
Lai, Longbin [4 ]
Cheng, Peng [2 ]
Li, Xue [4 ]
Yu, Wenyuan [4 ]
Zhang, Wenjie [5 ]
Lin, Xuemin [1 ]
Zhou, Jingren [4 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] East China Normal Univ, Shanghai, Peoples R China
[3] Nanjing Univ Sci & Technol, Nanjing, Peoples R China
[4] Alibaba Grp, Hangzhou, Zhejiang, Peoples R China
[5] Univ New South Wales, Sydney, NSW, Australia
基金
中国国家自然科学基金;
关键词
Distributed processing; graph algorithms; big data; LABEL PROPAGATION ALGORITHM; CORE DECOMPOSITION; PARALLEL ALGORITHM; FRAMEWORK; CENTRALITY; SCALE; BFS; COMPUTATION; SIMRANK; VERTEX;
D O I
10.1145/3694966
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Distributed processing of large-scale graph data has many practical applications and has been widely studied. In recent years, a lot of distributed graph processing frameworks and algorithms have been proposed. While many efforts have been devoted to analyzing these, with most analyzing them based on programming models, less research focuses on understanding their challenges in distributed environments. Applying graph tasks to distributed environments is not easy, often facing numerous challenges through our analysis, including parallelism, load balancing, communication overhead, and bandwidth. In this article, we provide an extensive overview of the current state-of-the-art in this field by outlining the challenges and solutions of distributed graph algorithms. We first conduct a systematic analysis of the inherent challenges in distributed graph processing, followed by presenting an overview of existing general solutions. Subsequently, we survey the challenges highlighted in recent distributed graph processing papers and the strategies adopted to address them. Finally, we discuss the current research trends and identify potential future opportunities.
引用
收藏
页数:39
相关论文
共 180 条
[31]   Scaling Graph Traversal to 281 Trillion Edges with 40 Million Cores [J].
Cao, Huanqi ;
Wang, Yuanwei ;
Wang, Haojie ;
Lin, Heng ;
Ma, Zixuan ;
Yin, Wanwang ;
Chen, Wenguang .
PPOPP'22: PROCEEDINGS OF THE 27TH ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING, 2022, :234-245
[32]   Brief Announcement: An Improved Distributed Approximate Single-Source Shortest Paths Algorithm [J].
Cao, Nairen ;
Fineman, Jeremy T. ;
Russell, Katina .
PROCEEDINGS OF THE 2021 ACM SYMPOSIUM ON PRINCIPLES OF DISTRIBUTED COMPUTING (PODC '21), 2021, :493-496
[33]  
Carbone P, 2015, IEEE Data(base) Engineering Bulletin, V36, P28, DOI DOI 10.1109/IC2EW.2016.56
[34]  
Censor-Hillel Keren, 2022, PODC'22: Proceedings of the 2022 ACM Symposium on Principles of Distributed Computing, P271, DOI 10.1145/3519270.3538434
[35]   On Distributed Listing of Cliques [J].
Censor-Hillel, Keren ;
Le Gall, Francois ;
Leitersdorf, Dean .
PROCEEDINGS OF THE 39TH SYMPOSIUM ON PRINCIPLES OF DISTRIBUTED COMPUTING, PODC 2020, 2020, :474-482
[36]   Fast distributed algorithms for testing graph properties [J].
Censor-Hillel, Keren ;
Fischer, Eldar ;
Schwartzman, Gregory ;
Vasudev, Yadu .
DISTRIBUTED COMPUTING, 2019, 32 (01) :41-57
[37]   Scalable Single Source Shortest Path Algorithms for Massively Parallel Systems [J].
Chakaravarthy, Venkatesan T. ;
Checconi, Fabio ;
Murali, Prakash ;
Petrini, Fabrizio ;
Sabharwal, Yogish .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (07) :2031-2045
[38]   Distributed approximate k-core decomposition and min-max edge orientation: Breaking the diameter barrier [J].
Chan, T-H Hubert ;
Sozio, Mauro ;
Sun, Bintao .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2021, 147 :87-99
[39]   Near-optimal Distributed Triangle Enumeration via Expander Decompositions [J].
Chang, Yi-Jun ;
Pettie, Seth ;
Saranurak, Thatchaphol ;
Zhang, Hengjie .
JOURNAL OF THE ACM, 2021, 68 (03)
[40]   Improved Distributed Expander Decomposition and Nearly Optimal Triangle Enumeration [J].
Chang, Yi-Jun ;
Saranurak, Thatchaphol .
PROCEEDINGS OF THE 2019 ACM SYMPOSIUM ON PRINCIPLES OF DISTRIBUTED COMPUTING (PODC '19), 2019, :66-73