Towards Efficient SPARQL Query Processing on RDF Data

被引:0
作者
刘畅 [1 ]
王昊奋 [1 ]
俞勇 [1 ]
徐林昊 [2 ]
机构
[1] Department of Computer Science and Engineering, Shanghai Jiao Tong University
[2] IBM China Research Lab
关键词
resource description framework (RDF) query engine; SPARQL; optimization;
D O I
暂无
中图分类号
TP311.13 [];
学科分类号
1201 ;
摘要
Efficient support for querying large-scale resource description framework (RDF) triples plays an important role in semantic web data management. This paper presents an efficient RDF query engine to evaluate SPARQL queries, where the inverted index structure is employed for indexing the RDF triples. A set of operators on the inverted index was developed for query optimization and evaluation. Then a main-tree-shaped optimization algorithm was developed that transforms a SPARQL query graph into the op-timal query plan by effectively reducing the search space to determine the optimal joining order. The opti-mization collects a set of RDF statistics for estimating the execution cost of the query plan. Finally the opti-mal query plan is evaluated using the defined operators for answering the given SPARQL query. Extensive tests were conducted on both synthetic and real datasets containing up to 100 million triples to evaluate this approach with the results showing that this approach can answer most queries within 1 s and is extremely efficient and scalable in comparison with previous best state-of-the-art RDF stores.
引用
收藏
页码:613 / 622
页数:10
相关论文
共 50 条
[21]   Simulation and Evaluation of Decentralized SPARQL Query Processing [J].
Zhou, Jing ;
Huang, Qi ;
Yan, Wei .
2014 INTERNATIONAL CONFERENCE ON IDENTIFICATION, INFORMATION AND KNOWLEDGE IN THE INTERNET OF THINGS (IIKI 2014), 2014, :39-44
[22]   An Efficient Distributed SPARQL Query Processing Scheme Considering Communication Costs in Spark Environments [J].
Lim, Jongtae ;
Kim, Byounghoon ;
Lee, Hyeonbyeong ;
Choi, Dojin ;
Bok, Kyoungsoo ;
Yoo, Jaesoo .
APPLIED SCIENCES-BASEL, 2022, 12 (01)
[23]   Mapping Spatiotemporal Data to RDF: A SPARQL Endpoint for Brussels [J].
Vaisman, Alejandro ;
Chentout, Kevin .
ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2019, 8 (08)
[24]   Towards Empty Answers in SPARQL: Approximating Querying with RDF Embedding [J].
Wang, Meng ;
Wang, Ruijie ;
Liu, Jun ;
Chen, Yihe ;
Zhang, Lei ;
Qi, Guilin .
SEMANTIC WEB - ISWC 2018, PT I, 2018, 11136 :513-529
[25]   Querying RDF and OWL Data Source using SPARQL [J].
Kumar, Naveen ;
Kumar, Suresh .
2013 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATIONS AND NETWORKING TECHNOLOGIES (ICCCNT), 2013,
[26]   Adaptive mechanism for distributed query processing and data loading using the RDF data in the cloud [J].
Dharmaraj, Chandrasekaran Ranichandra ;
Tripathy, BalaKrushna .
INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2018, 31 (15)
[27]   Leon: A Distributed RDF Engine for Multi-query Processing [J].
Guo, Xintong ;
Gao, Hong ;
Zou, Zhaonian .
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2019), PT I, 2019, 11446 :742-759
[28]   Graysearch: Transforming SPARQL to query humanities data [J].
Schweizer, Tobias ;
Geer, Benjamin .
SEMANTIC WEB, 2021, 12 (02) :379-400
[29]   Keyword search over schema-less RDF datasets by SPARQL query compilation [J].
Izquierdo, Yenier T. ;
Garcia, Grettel M. ;
Menendez, Elisa ;
Leme, Luiz Andre P. P. ;
Neves, Angelo ;
Lemos, Melissa ;
Finamore, Anna Carolina ;
Oliveira, Carlos ;
Casanova, Marco A. .
INFORMATION SYSTEMS, 2021, 102
[30]   A Cache-based Method for SPARQL Query Processing [J].
Saeedi, Alieh ;
Naghibzadeh, Mahmoud .
2014 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2014, :292-296