A Survey of Distributed Graph Algorithms on Massive Graphs

被引:5
作者
Meng, Lingkai [1 ]
Shao, Yu [2 ]
Yuan, Long [3 ]
Lai, Longbin [4 ]
Cheng, Peng [2 ]
Li, Xue [4 ]
Yu, Wenyuan [4 ]
Zhang, Wenjie [5 ]
Lin, Xuemin [1 ]
Zhou, Jingren [4 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] East China Normal Univ, Shanghai, Peoples R China
[3] Nanjing Univ Sci & Technol, Nanjing, Peoples R China
[4] Alibaba Grp, Hangzhou, Zhejiang, Peoples R China
[5] Univ New South Wales, Sydney, NSW, Australia
基金
中国国家自然科学基金;
关键词
Distributed processing; graph algorithms; big data; LABEL PROPAGATION ALGORITHM; CORE DECOMPOSITION; PARALLEL ALGORITHM; FRAMEWORK; CENTRALITY; SCALE; BFS; COMPUTATION; SIMRANK; VERTEX;
D O I
10.1145/3694966
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Distributed processing of large-scale graph data has many practical applications and has been widely studied. In recent years, a lot of distributed graph processing frameworks and algorithms have been proposed. While many efforts have been devoted to analyzing these, with most analyzing them based on programming models, less research focuses on understanding their challenges in distributed environments. Applying graph tasks to distributed environments is not easy, often facing numerous challenges through our analysis, including parallelism, load balancing, communication overhead, and bandwidth. In this article, we provide an extensive overview of the current state-of-the-art in this field by outlining the challenges and solutions of distributed graph algorithms. We first conduct a systematic analysis of the inherent challenges in distributed graph processing, followed by presenting an overview of existing general solutions. Subsequently, we survey the challenges highlighted in recent distributed graph processing papers and the strategies adopted to address them. Finally, we discuss the current research trends and identify potential future opportunities.
引用
收藏
页数:39
相关论文
共 180 条
[1]   ASK-GraphView: A large scale graph visualization system [J].
Abello, James ;
van Ham, Frank ;
Krishnan, Neeraj .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2006, 12 (05) :669-676
[2]  
Ahmadi Mohamad, 2018, LEIBNIZ INT P INFORM, V121
[3]   Increasing the Parallelism of Graph Coloring via Shortcutting [J].
Alabandi, Ghadeer ;
Powers, Evan ;
Burtscher, Martin .
PROCEEDINGS OF THE 25TH ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING (PPOPP '20), 2020, :262-275
[4]   KTMiner: Distributed k-truss detection in big graphs [J].
Alemi, Mehdi ;
Haghighi, Hassan .
INFORMATION SYSTEMS, 2019, 83 :195-216
[5]   Distributed Evaluation of Subgraph Queries Using Worst-case Optimal Low-Memory Dataflows [J].
Ammar, Khaled ;
McSherry, Frank ;
Salihoglu, Semih ;
Joglekar, Manas .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 11 (06) :691-704
[6]   Foundations of Modern Query Languages for Graph Databases [J].
Angles, Renzo ;
Arenas, Marcelo ;
Barcelo, Pablo ;
Hogan, Aidan ;
Reutter, Juan ;
Vrgoc, Domagoj .
ACM COMPUTING SURVEYS, 2017, 50 (05)
[7]  
[Anonymous], 2012, Hadoop: The Definitive Guide
[8]  
[Anonymous], Althamer, Pawel, Brodno 2000, retrieved from https://artmuseum.pl/en/filmoteka/praca/althamer-pawel-brodno-2000 (accessed January 15, 2019).
[9]  
Arifuzzaman S, 2017, Arxiv, DOI arXiv:1706.05151
[10]   Erlang [J].
Armstrong, Joe .
COMMUNICATIONS OF THE ACM, 2010, 53 (09) :68-75