A fine-grained evaluation of SPARQL endpoint federation systems

被引:37
作者
Saleem, Muhammad [1 ]
Khan, Yasar [2 ]
Hasnain, Ali [2 ]
Ermilov, Ivan [1 ]
Ngomo, Axel-Cyrille Ngonga [1 ]
机构
[1] Univ Leipzig, IFI AKSW, PO 100920, D-04009 Leipzig, Germany
[2] Natl Univ Ireland, Insight Ctr Data Analyt, Galway, Ireland
关键词
SPARQL federation; Web of Data; RDF;
D O I
10.3233/SW-150186
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Web of Data has grown enormously over the last years. Currently, it comprises a large compendium of interlinked and distributed datasets from multiple domains. Running complex queries on this compendium often requires accessing data from different endpoints within one query. The abundance of datasets and the need for running complex query has thus motivated a considerable body of work on SPARQL query federation systems, the dedicated means to access data distributed over the Web of Data. However, the granularity of previous evaluations of such systems has not allowed deriving of insights concerning their behavior in different steps involved during federated query processing. In this work, we perform extensive experiments to compare state-of-the-art SPARQL endpoint federation systems using the comprehensive performance evaluation framework FedBench. In addition to considering the tradition query runtime as an evaluation criterion, we extend the scope of our performance evaluation by considering criteria, which have not been paid much attention to in previous studies. In particular, we consider the number of sources selected, the total number of SPARQL ASK requests used, the completeness of answers as well as the source selection time. Yet, we show that they have a significant impact on the overall query runtime of existing systems. Moreover, we extend FedBench to mirror a highly distributed data environment and assess the behavior of existing systems by using the same performance criteria. As the result we provide a detailed analysis of the experimental outcomes that reveal novel insights for improving current and future SPARQL federation systems.
引用
收藏
页码:493 / 518
页数:26
相关论文
共 38 条
[1]  
Acosta M, 2011, LECT NOTES COMPUT SC, V7031, P18, DOI 10.1007/978-3-642-25073-6_2
[2]  
Akar Z., 2012, CEUR Workshop Proceedings, V937
[3]  
Amorim F., 2013, JOIN REORDERING BUSH
[4]  
Basca C, 2010, 6 INT WORKSH SCAL SE, P64
[5]  
Betz H., 2012, CEUR Workshop Proceedings, V905
[6]   Linked Data - The Story So Far [J].
Bizer, Christian ;
Heath, Tom ;
Berners-Lee, Tim .
INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2009, 5 (03) :1-22
[7]  
Görlitz O, 2011, STUD COMPUT INTELL, V331, P109
[8]  
Gorlitz O., 2011, CEUR WORKSHOP P, V782
[9]   LUBM: A benchmark for OWL knowledge base systems [J].
Guo, YB ;
Pan, ZX ;
Heflin, J .
JOURNAL OF WEB SEMANTICS, 2005, 3 (2-3) :158-182
[10]   An Overview on Execution Strategies for Linked Data Queries [J].
Olaf Hartig .
Datenbank-Spektrum, 2013, 13 (2) :89-99