gStore: a graph-based SPARQL query engine

被引:0
作者
Lei Zou
M. Tamer Özsu
Lei Chen
Xuchuan Shen
Ruizhe Huang
Dongyan Zhao
机构
[1] Peking University,Institute of Computer Science and Technology
[2] University of Waterloo,David R. Cheriton School of Computer Science
[3] Hong Kong University of Science and Technology,Department of Computer Science and Engineering
来源
The VLDB Journal | 2014年 / 23卷
关键词
RDF; SPARQL; Graph database; Graph matching ; Aggregate query;
D O I
暂无
中图分类号
学科分类号
摘要
We address efficient processing of SPARQL queries over RDF datasets. The proposed techniques, incorporated into the gStore system, handle, in a uniform and scalable manner, SPARQL queries with wildcards and aggregate operators over dynamic RDF datasets. Our approach is graph based. We store RDF data as a large graph and also represent a SPARQL query as a query graph. Thus, the query answering problem is converted into a subgraph matching problem. To achieve efficient and scalable query processing, we develop an index, together with effective pruning rules and efficient search algorithms. We propose techniques that use this infrastructure to answer aggregation queries. We also propose an effective maintenance algorithm to handle online updates over RDF repositories. Extensive experiments confirm the efficiency and effectiveness of our solutions.
引用
收藏
页码:565 / 590
页数:25
相关论文
共 45 条
[1]  
Abadi DJ(2009)SW-Store: a vertically partitioned DBMS for semantic web data management VLDB J. 18 385-406
[2]  
Marcus A(1981)Using semi-joins to solve relational queries J. ACM 28 25-40
[3]  
Madden S(1984)Signature files: an access method for documents and its analytical performance evaluation ACM Trans. Inf. Syst. 2 267-288
[4]  
Hollenbach K(2001)Using IEEE Data Eng. Bull. 24 28-34
[5]  
Bernstein PA(2005)-grams in a DBMS for approximate string processing J. Web Semant. 3 158-182
[6]  
Chiu D-MW(1993)LUBM: a benchmark for OWL knowledge base systems J. Comput. Syst. Sci. 47 45-76
[7]  
Faloutsos C(1997)B-trees with inserts and deletes: why free-at-empty is better than merge-at-half IEICE Trans. Inf. Syst. E80–D 1-12
[8]  
Christodoulakis S(2008)False drop analysis of set retrieval with signature files Proc. VLDB Endow. 1 647-659
[9]  
Gravano L(2010)RDF-3X: a RISC-style engine for RDF VLDB J. 19 91-113
[10]  
Ipeirotis PG(2010)The RDF-3X engine for scalable management of RDF data Proc. VLDB Endow. 1 256-263