Persistence of RDF Data into NoSQL: A Survey and a Reference Architecture

被引:1
作者
Zambom Santana, Luiz Henrique [1 ]
Mello, Ronaldo dos Santos [1 ]
机构
[1] Univ Fed Santa Catarina, Dept Informat & Estat, BR-88040900 Florianopolis, SC, Brazil
关键词
Resource description framework; NoSQL databases; Data models; Indexing; Benchmark testing; Scalability; NoSQL; RDF; SPARQL; and Semantic Web; SEMANTIC WEB; DATA-MANAGEMENT; DATABASES; SPARQL; CLOUDS;
D O I
10.1109/TKDE.2020.2994521
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
RDF is being increasingly considered in a broad range of information management scenarios. Governments, large corporations, startups, and other organizations around the world are using RDF as a data model to represent and share knowledge. However, there is still a long evolutionary track with multiple challenges for RDF reaching the scale of the most recent Big Data intensive applications (e.g., Smart Cities, Sensor Networks, eHealth, Internet of Things). In this survey, we review the usage of NoSQL databases to the storage of large RDF graphs by rehearsing the latest surveys and expanding their findings by updating proposals and bringing light to aspects such as model mapping between RDF and NoSQL, triple indexing and partitioning, graph fragmentation and data caching. Moreover, we explain how the surveyed works extended the RDF capabilities so the datasets can benefit of the characteristics of scalability, schemaless data, and better overall performance of NoSQL databases. The survey summarizes the current state of art, discusses open problems, and proposes a Reference Architecture (RA). For the best of our knowledge, this is the first survey where the focus is solely on papers that use one or more NoSQL systems for the RDF persistence.
引用
收藏
页码:1370 / 1389
页数:20
相关论文
共 94 条
[1]   SW-Store: a vertically partitioned DBMS for Semantic Web data management [J].
Abadi, Daniel J. ;
Marcus, Adam ;
Madden, Samuel R. ;
Hollenbach, Kate .
VLDB JOURNAL, 2009, 18 (02) :385-406
[2]   Workload Matters: Why RDF Databases Need a New Design [J].
Aluc, Gunes ;
Ozsu, M. Tamer ;
Daudjee, Khuzaima .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 7 (10) :837-840
[3]  
[Anonymous], 2001, SCI AM
[4]  
[Anonymous], 2012, NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot Persistence
[5]  
Aranda-ujar A., 2012, P OF THE 21 ACM INT, P2749
[6]  
Atre M., 2008, P 7 INT SEM WEB C PO, P1
[7]  
Banane M., 2018, P INT C ADV INT SYST, P444
[8]   Performance assessment of RDF graph databases for smart city services [J].
Bellini, Pierfrancesco ;
Nesi, Paolo .
JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 2018, 45 :24-38
[9]   Linked Data - The Story So Far [J].
Bizer, Christian ;
Heath, Tom ;
Berners-Lee, Tim .
INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2009, 5 (03) :1-22
[10]   Exploiting RDF Open Data Using NoSQL Graph Databases [J].
Bouhali, Raouf ;
Laurent, Anne .
ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, 2015, 458 :177-190