Processing SPARQL queries over distributed RDF graphs

被引:1
作者
Peng Peng
Lei Zou
M. Tamer Özsu
Lei Chen
Dongyan Zhao
机构
[1] Peking University,Institute of Computer Science and Technology
[2] University of Waterloo,David R. Cheriton School of Computer Science
[3] Hong Kong University of Science and Technology,Department of Computer Science and Engineering
来源
The VLDB Journal | 2016年 / 25卷
关键词
RDF; SPARQL; RDF graph; Distributed queries;
D O I
暂无
中图分类号
学科分类号
摘要
We propose techniques for processing SPARQL queries over a large RDF graph in a distributed environment. We adopt a “partial evaluation and assembly” framework. Answering a SPARQL query Q is equivalent to finding subgraph matches of the query graph Q over RDF graph G. Based on properties of subgraph matching over a distributed graph, we introduce local partial match as partial answers in each fragment of RDF graph G. For assembly, we propose two methods: centralized and distributed assembly. We analyze our algorithms from both theoretically and experimentally. Extensive experiments over both real and benchmark RDF repositories of billions of triples confirm that our method is superior to the state-of-the-art methods in both the system’s performance and scalability.
引用
收藏
页码:243 / 268
页数:25
相关论文
共 86 条
[1]  
Abadi DJ(2009)SW-store: a vertically partitioned DBMS for semantic web data management VLDB J. 18 385-406
[2]  
Marcus A(1976)System R: relational approach to database management ACM Trans. Database Syst. 1 97-137
[3]  
Madden S(2012)Partial evaluation for distributed XPath query processing and beyond ACM Trans. Database Syst. 37 32-570
[4]  
Hollenbach K(1999)The parametrized complexity of some fundamental problems in coding theory SIAM J. Comput. 29 545-289
[5]  
Astrahan MM(2000)The complexity of counting graph homomorphisms Random Struct. Algorithms 17 260-275
[6]  
Blasgen HW(2010)Graph pattern matching: from intractable to polynomial time Proc. VLDB Endow. 3 264-1315
[7]  
Chamberlin DD(2012)Performance guarantees for distributed reachability queries Proc. VLDB Endow. 5 1304-1094
[8]  
Eswaran KP(2014)Distributed graph simulation: impossibility and possibility Proc. VLDB Endow. 7 1083-182
[9]  
Gray JN(2005)LUBM: a benchmark for OWL knowledge base systems J. Web Semant. 3 158-1134
[10]  
Griffiths PP(2011)Scalable SPARQL querying of large RDF graphs Proc. VLDB Endow. 4 1123-1327