Accelerating Partial Evaluation in Distributed SPARQL Query Evaluation

被引:16
作者
Peng, Peng [1 ]
Zou, Lei [2 ,3 ,4 ]
Guan, Runyu [1 ]
机构
[1] Hunan Univ, Changsha, Hunan, Peoples R China
[2] Peking Univ, Beijing, Peoples R China
[3] Beijing Inst Big Data Res, Beijing, Peoples R China
[4] Natl Engn Lab Big Data Anal Technol & Applicat PK, Beijing, Peoples R China
来源
2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019) | 2019年
关键词
KNOWLEDGE-BASE; RDF;
D O I
10.1109/ICDE.2019.00019
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Partial evaluation has recently been used for processing SPARQL queries over a large resource description framework (RDF) graph in a distributed environment. However, the previous approach is inefficient when dealing with complex queries. In this study, we further improve the "partial evaluation and assembly" framework for answering SPARQL queries over a distributed RDF graph, while providing performance guarantees. Our key idea is to explore the intrinsic structural characteristics of partial matches to filter out irrelevant partial results while providing performance guarantees on the data shipment and the response time. We also propose an efficient assembly algorithm to utilize the characteristics of partial matches to merge them and form final results. To improve the efficiency of finding partial matches further, we propose an optimization that communicates variables' candidates among sites to avoid redundant computations. In addition, although our approach is partitioning-tolerant, different partitioning strategies result in different performances, and we evaluate different partitioning strategies for our approach. Experiments over both real and synthetic RDF datasets confirm the superiority of our approach.
引用
收藏
页码:112 / 123
页数:12
相关论文
共 24 条
[1]   A Survey and Experimental Comparison of Distributed SPARQL Engines for Very Large RDF Data [J].
Abdelaziz, Ibrahim ;
Harbi, Razen ;
Khayyat, Zuhair ;
Kalnis, Panos .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (13) :2049-2060
[2]   Distributed Graph Simulation: Impossibility and Possibility [J].
Fan, Wenfei ;
Wang, Xin ;
Wu, Yinghui ;
Deng, Dong .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 7 (12) :1083-1094
[3]   Performance Guarantees for Distributed Reachability Queries [J].
Fan, Wenfei ;
Wang, Xin ;
Wul, Yinghui .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (11) :1304-1315
[4]  
Goasdoué F, 2015, PROC INT CONF DATA, P771, DOI 10.1109/ICDE.2015.7113332
[5]   LUBM: A benchmark for OWL knowledge base systems [J].
Guo, YB ;
Pan, ZX ;
Heflin, J .
JOURNAL OF WEB SEMANTICS, 2005, 3 (2-3) :158-182
[6]   Distributed Set Reachability [J].
Gurajada, Sairam ;
Theobald, Martin .
SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, :1247-1261
[7]   DREAM: Distributed RDF Engine with Adaptive Query Planner and Minimal Communication [J].
Hammoud, Mohammad ;
Rabbou, Dania Abed ;
Nouri, Reza ;
Beheshti, Seyed-Mehdi-Reza ;
Sakr, Sherif .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2015, 8 (06) :654-665
[8]  
Harbi R, 2015, PROC VLDB ENDOW, V8, P1848
[9]   Accelerating SPARQL queries by exploiting hash-based locality and adaptive partitioning [J].
Harbi, Razen ;
Abdelaziz, Ibrahim ;
Kalnis, Panos ;
Mamoulis, Nikos ;
Ebrahim, Yasser ;
Sahli, Majed .
VLDB JOURNAL, 2016, 25 (03) :355-380
[10]   Stylus: A Strongly-Typed Store for Serving Massive RDF Data [J].
He, Liang ;
Shao, Bin ;
Li, Yatao ;
Xia, Huanhuan ;
Xiao, Yanghua ;
Chen, Enhong ;
Chen, Liang Jeff .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 11 (02) :203-216