Practice of Streaming Processing of Dynamic Graphs: Concepts, Models, and Systems

被引:17
作者
Besta, Maciej [1 ]
Fischer, Marc [2 ]
Kalavri, Vasiliki [3 ]
Kapralov, Michael [4 ]
Hoefler, Torsten [1 ]
机构
[1] Swiss Fed Inst Technol, CH-8092 Zurich, Switzerland
[2] PRODYNA Schweiz AG, CH-4052 Basel, Switzerland
[3] Boston Univ, Boston, MA 02215 USA
[4] Ecole Polytech Fed Lausanne, Sch Comp & Commun Sci, CH-1015 Lausanne, Switzerland
基金
欧洲研究理事会;
关键词
Heuristic algorithms; Taxonomy; Analytical models; Data models; Computational modeling; Distributed databases; Social networking (online); Streaming graphs; dynamic graphs; evolving graphs; streaming graph processing; dynamic graph processing; evolving graph processing; online graph processing; graph streaming frameworks; graph databases; ALGORITHMS;
D O I
10.1109/TPDS.2021.3131677
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Graphs processing has become an important part of various areas of computing, including machine learning, medical applications, social network analysis, computational sciences, and others. A growing amount of the associated graph processing workloads are dynamic, with millions of edges added or removed per second. Graph streaming frameworks are specifically crafted to enable the processing of such highly dynamic workloads. Recent years have seen the development of many such frameworks. However, they differ in their general architectures (with key details such as the support for the concurrent execution of graph updates and queries, or the incorporated graph data organization), the types of updates and workloads allowed, and many others. To facilitate the understanding of this growing field, we provide the first analysis and taxonomy of dynamic and streaming graph processing. We focus on identifying the fundamental system designs and on understanding their support for concurrency, and for different graph updates as well as analytics workloads. We also crystallize the meaning of different concepts associated with streaming graph processing, such as dynamic, temporal, online, and time-evolving graphs, edge-centric processing, models for the maintenance of updates, and graph databases. Moreover, we provide a bridge with the very rich landscape of graph streaming theory by giving a broad overview of recent theoretical related advances, and by discussing which graph streaming models and settings could be helpful in developing more powerful streaming frameworks and designs. We also outline graph streaming workloads and research challenges.
引用
收藏
页码:1860 / 1876
页数:17
相关论文
共 166 条
[1]   Sprouter: Dynamic Graph Processing over Data Streams at Scale [J].
Abughofa, Tariq ;
Zulkernine, Farhana .
DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA 2018), PT II, 2018, 11030 :321-328
[2]   Parallel Batch-Dynamic Graph Connectivity [J].
Acar, Umut A. ;
Anderson, Daniel ;
Blelloch, Guy E. ;
Dhulipala, Laxman .
SPAA'19: PROCEEDINGS OF THE 31ST ACM SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURESS, 2019, 2019, :381-392
[3]  
Agarwal Rachit, 2015, Proceedings of the 12th USENIX Symposium on Networked Systems Design and Implementation: NSDI '15, P337
[4]   Evolutionary Network Analysis: A Survey [J].
Aggarwal, Charu ;
Subbian, Karthik .
ACM COMPUTING SURVEYS, 2014, 47 (01)
[5]  
Aguilera Marcos K., 2007, Operating Systems Review, V41, P159, DOI 10.1145/1323293.1294278
[6]   A Scalable Processing-in-Memory Accelerator for Parallel Graph Processing [J].
Ahn, Junwhan ;
Hong, Sungpack ;
Yoo, Sungjoo ;
Mutlu, Onur ;
Choi, Kiyoung .
2015 ACM/IEEE 42ND ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2015, :105-117
[7]  
Ahn K.J., 2012, PODS, P5, DOI DOI 10.1145/2213556.2213560
[8]  
Akidau T, 2015, PROC VLDB ENDOW, V8, P1792
[9]  
Ammar Khaled, 2016, SIGMOD PHD S, P7
[10]  
Anderson JC., 2010, CouchDB: The Definitive Guide