Demystifying Graph Databases: Analysis and Taxonomy of Data Organization, System Designs, and Graph Queries

被引:18
作者
Besta, Maciej [1 ]
Gerstenberger, Robert [1 ]
Peter, Emanuel [1 ]
Fischer, Marc [3 ]
Podstawski, Michal [4 ]
Barthels, Claude [1 ]
Alonso, Gustavo [2 ]
Hoefler, Torsten [1 ]
机构
[1] Swiss Fed Inst Technol, Inst Comp Platforms, OAT S 17,Andreasstr 5, CH-8092 Zurich, Switzerland
[2] Swiss Fed Inst Technol, Inst Comp Platforms, Dep Comp Sci, STF J 420,Stampfenbachstr 114, CH-8092 Zurich, Switzerland
[3] PRODYNA Schweiz AG, Grosspeteranlage 29, CH-4052 Basel, Switzerland
[4] Future Proc, Bojkowska 37a, PL-44100 Gliwice, Poland
关键词
Graphs; Graph Databases; NoSQL Stores; Graph Database Management Systems; Graph Models; Data Layout; Graph Queries; Graph Transactions; Graph Representations; RDF; Labeled Property Graph; Triple Stores; Key-Value Stores; RDBMS; Wide-Column Stores; Document Stores; HIGH-PERFORMANCE; RDF; PARALLEL; COMMUNICATION; CHALLENGES; BENCHMARK;
D O I
10.1145/3604932
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Numerous irregular graph datasets, for example social networks or web graphs, may contain even trillions of edges. Often, their structure changes over time and they have domain-specific rich data associated with vertices and edges. Graph database systems such as Neo4j enable storing, processing, and analyzing such large, evolving, and rich datasets. Due to the sheer size and irregularity of such datasets, these systems face unique design challenges. To facilitate the understanding of this emerging domain, we present the first survey and taxonomy of graph database systems. We focus on identifying and analyzing fundamental categories of these systems (e.g., document stores, tuple stores, native graph database systems, or object-oriented systems), the associated graph models (e.g., Resource Description Framework or Labeled Property Graph), data organization techniques (e.g., storing graph data in indexing structures or dividing data into records), and different aspects of data distribution and query execution (e.g., support for sharding and Atomicity, Consistency, Isolation, Durability). Fifty-one graph database systems are presented and compared, including Neo4j, OrientDB, and Virtuoso. We outline graph database queries and relationships with associated domains (NoSQL stores, graph streaming, and dynamic graph algorithms). Finally, we outline future research and engineering challenges related to graph databases.
引用
收藏
页数:40
相关论文
共 214 条
[1]  
Abadi D., 2007, VLDB, P411
[2]   Effective Partitioning and Multiple RDF Indexing for Database Triple Store [J].
Abburua, Sunitha ;
Golla, Suresh Babu .
ENGINEERING JOURNAL-THAILAND, 2015, 19 (05) :139-154
[3]   EmptyHeaded: A Relational Engine for Graph Processing [J].
Aberger, Christopher R. ;
Lamb, Andrew ;
Tu, Susan ;
Noetzli, Andres ;
Olukotun, Kunle ;
Re, Christopher .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 2017, 42 (04)
[4]   A Scalable Processing-in-Memory Accelerator for Parallel Graph Processing [J].
Ahn, Junwhan ;
Hong, Sungpack ;
Yoo, Sungjoo ;
Mutlu, Onur ;
Choi, Kiyoung .
2015 ACM/IEEE 42ND ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2015, :105-117
[5]  
Amazon, Amazon Neptune
[6]   Survey of graph database models [J].
Angles, Renzo ;
Gutierrez, Claudio .
ACM COMPUTING SURVEYS, 2008, 40 (01)
[7]   G-CORE A Core for Future Graph Query Languages [J].
Angles, Renzo ;
Arenas, Marcelo ;
Barcelo, Pablo ;
Boncz, Peter ;
Fletcher, George ;
Gutierrez, Claudio ;
Lindaaker, Tobias ;
Paradies, Marcus ;
Plantikow, Stefan ;
Sequeda, Juan ;
van Rest, Oskar ;
Voigt, Hannes .
SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2018, :1421-1432
[8]   Foundations of Modern Query Languages for Graph Databases [J].
Angles, Renzo ;
Arenas, Marcelo ;
Barcelo, Pablo ;
Hogan, Aidan ;
Reutter, Juan ;
Vrgoc, Domagoj .
ACM COMPUTING SURVEYS, 2017, 50 (05)
[9]   The Linked Data Benchmark Council: a Graph and RDF industry benchmarking effort [J].
Angles, Renzo ;
Boncz, Peter ;
Larriba-Pey, Josep ;
Fundulaki, Irini ;
Neumann, Thomas ;
Erling, Orri ;
Neubauer, Peter ;
Martinez-Bazan, Norbert ;
Kotsev, Venelin ;
Toma, Ioan .
SIGMOD RECORD, 2014, 43 (01) :27-31
[10]  
Angles Renzo., 2018, Graph Data Management, p1s32, DOI [10.1007/978-3-319-96193-4_1, DOI 10.1007/978-3-319-96193-4_1]