An empirical evaluation of cost-based federated SPARQL query processing engines

被引:5
作者
Qudus, Umair [1 ]
Saleem, Muhammad [2 ]
Ngomo, Axel-Cyrille Ngonga [3 ]
Lee, Young-Koo [1 ]
机构
[1] Kyung Hee Univ, DKE, Seoul, South Korea
[2] AKSW, Leipzig, Germany
[3] Univ Paderborn, Paderborn, Germany
基金
新加坡国家研究基金会;
关键词
SPARQL; benchmarking; cost-based; cost-free; federated; querying; OPTIMIZATION;
D O I
10.3233/SW-200420
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Finding a good query plan is key to the optimization of query runtime. This holds in particular for cost-based federation engines, which make use of cardinality estimations to achieve this goal. A number of studies compare SPARQL federation engines across different performance metrics, including query runtime, result set completeness and correctness, number of sources selected and number of requests sent. Albeit informative, these metrics are generic and unable to quantify and evaluate the accuracy of the cardinality estimators of cost-based federation engines. To thoroughly evaluate cost-based federation engines, the effect of estimated cardinality errors on the overall query runtime performance must be measured. In this paper, we address this challenge by presenting novel evaluation metrics targeted at a fine-grained benchmarking of cost-based federated SPARQL query engines. We evaluate five cost-based federated SPARQL query engines using existing as well as novel evaluation metrics by using LargeRDFBench queries. Our results provide a detailed analysis of the experimental outcomes that reveal novel insights, useful for the development of future cost-based federated SPARQL query processing engines.
引用
收藏
页码:843 / 868
页数:26
相关论文
共 50 条
[1]   Lusail: A System for Querying Linked Data at Scale [J].
Abdelazizu, Ibrahim ;
Mansouru, Essam ;
Ouzzaniu, Mourad ;
Aboulnagau, Ashraf ;
Kalnisu, Panos .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 11 (04) :485-498
[2]   Diefficiency Metrics: Measuring the Continuous Efficiency of Query Processing Approaches [J].
Acosta, Maribel ;
Vidal, Maria-Esther ;
Sure-Vetter, York .
SEMANTIC WEB - ISWC 2017, PT II, 2017, 10588 :3-19
[3]  
Acosta M, 2011, LECT NOTES COMPUT SC, V7031, P18, DOI 10.1007/978-3-642-25073-6_2
[4]  
Alexander K., 2010, LINK DAT WEB WORKSH, V538
[5]  
[Anonymous], 2012, 1 INT WORKSHOP ONTOL
[6]  
[Anonymous], 2003, Robust Regression and Outlier Detection
[7]  
Bizer C, 2009, INT J SEMANT WEB INF, V5, P1, DOI 10.4018/jswis.2009040101
[8]  
Buil-Aranda C, 2013, LECT NOTES COMPUT SC, V8219, P277, DOI 10.1007/978-3-642-41338-4_18
[9]  
Charalambidis A., 2015, P 11 INT C SEM SYST, P121, DOI [10.1145/2814864.2814886, DOI 10.1145/2814864]
[10]   IGUANA: A Generic Framework for Benchmarking the Read-Write Performance of Triple Stores [J].
Conrads, Felix ;
Lehmann, Jens ;
Saleem, Muhammad ;
Morsey, Mohamed ;
Ngomo, Axel-Cyrille Ngonga .
SEMANTIC WEB - ISWC 2017, PT II, 2017, 10588 :48-65