gStore: a graph-based SPARQL query engine

被引:114
作者
Zou, Lei [1 ]
Oezsu, M. Tamer [2 ]
Chen, Lei [3 ]
Shen, Xuchuan [1 ]
Huang, Ruizhe [1 ]
Zhao, Dongyan [1 ]
机构
[1] Peking Univ, Inst Comp Sci & Technol, Beijing 100871, Peoples R China
[2] Univ Waterloo, David R Cheriton Sch Comp Sci, Waterloo, ON N2L 3G1, Canada
[3] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Hong Kong, Peoples R China
基金
美国国家科学基金会; 加拿大自然科学与工程研究理事会;
关键词
RDF; SPARQL; Graph database; Graph matching; Aggregate query; RDF;
D O I
10.1007/s00778-013-0337-7
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We address efficient processing of SPARQL queries over RDF datasets. The proposed techniques, incorporated into the gStore system, handle, in a uniform and scalable manner, SPARQL queries with wildcards and aggregate operators over dynamic RDF datasets. Our approach is graph based. We store RDF data as a large graph and also represent a SPARQL query as a query graph. Thus, the query answering problem is converted into a subgraph matching problem. To achieve efficient and scalable query processing, we develop an index, together with effective pruning rules and efficient search algorithms. We propose techniques that use this infrastructure to answer aggregation queries. We also propose an effective maintenance algorithm to handle online updates over RDF repositories. Extensive experiments confirm the efficiency and effectiveness of our solutions.
引用
收藏
页码:565 / 590
页数:26
相关论文
共 34 条
[1]   SW-Store: a vertically partitioned DBMS for Semantic Web data management [J].
Abadi, Daniel J. ;
Marcus, Adam ;
Madden, Samuel R. ;
Hollenbach, Kate .
VLDB JOURNAL, 2009, 18 (02) :385-406
[2]  
Abadi Daniel J., 2007, Proceedings of the 33rd International Conference on Very Large Data Bases, VLDB'07, P411
[3]  
Atre Medha., 2010, WWW, P41, DOI DOI 10.1145/1772690.1772696
[4]   USING SEMI-JOINS TO SOLVE RELATIONAL QUERIES [J].
BERNSTEIN, PA ;
CHIU, DMW .
JOURNAL OF THE ACM, 1981, 28 (01) :25-40
[5]   Storing RDF as a graph [J].
Bönström, V ;
Hinze, A ;
Schweppe, H .
FIRST LATIN AMERICAN WEB CONGRESS, PROCEEDINGS, 2003, :27-36
[6]  
Broekstra J, 2002, LECT NOTES COMPUT SC, V2342, P54
[7]  
Cormen T., 2001, Introduction to Algorithms
[8]  
Deppisch U., 1986, P 9 ANN INT ACM SIGI, P77, DOI DOI 10.1145/253168.253189
[9]   SIGNATURE FILES - AN ACCESS METHOD FOR DOCUMENTS AND ITS ANALYTICAL PERFORMANCE EVALUATION [J].
FALOUTSOS, C ;
CHRISTODOULAKIS, S .
ACM TRANSACTIONS ON OFFICE INFORMATION SYSTEMS, 1984, 2 (04) :267-288
[10]  
Gravano L., 2001, IEEE DATA ENG, V24, P28