Towards Efficient SPARQL Query Processing on RDF Data

被引:0
作者
刘畅 [1 ]
王昊奋 [1 ]
俞勇 [1 ]
徐林昊 [2 ]
机构
[1] Department of Computer Science and Engineering, Shanghai Jiao Tong University
[2] IBM China Research Lab
关键词
resource description framework (RDF) query engine; SPARQL; optimization;
D O I
暂无
中图分类号
TP311.13 [];
学科分类号
1201 ;
摘要
Efficient support for querying large-scale resource description framework (RDF) triples plays an important role in semantic web data management. This paper presents an efficient RDF query engine to evaluate SPARQL queries, where the inverted index structure is employed for indexing the RDF triples. A set of operators on the inverted index was developed for query optimization and evaluation. Then a main-tree-shaped optimization algorithm was developed that transforms a SPARQL query graph into the op-timal query plan by effectively reducing the search space to determine the optimal joining order. The opti-mization collects a set of RDF statistics for estimating the execution cost of the query plan. Finally the opti-mal query plan is evaluated using the defined operators for answering the given SPARQL query. Extensive tests were conducted on both synthetic and real datasets containing up to 100 million triples to evaluate this approach with the results showing that this approach can answer most queries within 1 s and is extremely efficient and scalable in comparison with previous best state-of-the-art RDF stores.
引用
收藏
页码:613 / 622
页数:10
相关论文
共 50 条
[41]   SShare: a simulator for studying and evaluating decentralized SPARQL query processing [J].
Zhou, Jing ;
Huang, Qi ;
Xie, Weifeng ;
Qu, Zhiguo .
PERSONAL AND UBIQUITOUS COMPUTING, 2015, 19 (07) :1087-1097
[42]   SPARQL Query-Builder for Medical Temporal Data [J].
Vcelak, Petr ;
Kryl, Martin ;
Kleckova, Jana .
2018 11TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2018), 2018,
[43]   CORNER: A Completeness Reasoner for SPARQL Queries Over RDF Data Sources [J].
Darari, Fariz ;
Prasojo, Radityo Eko ;
Nutt, Werner .
SEMANTIC WEB: ESWC 2014 SATELLITE EVENTS, 2014, 8798 :310-314
[44]   Comparison and Analysis of RDF Data Using SPARQL, HIVE, PIG in Hadoop [J].
Chandel, Anshul ;
Garg, Deepak .
COMPUTING AND NETWORK SUSTAINABILITY, 2017, 12 :361-369
[45]   Towards an automatic SPARQL query generation from ontology competency questions [J].
Benhocine K. ;
Hansali A. ;
Zemmouchi-Ghomari L. ;
Ghomari A.R. .
International Journal of Computers and Applications, 2022, 44 (10) :971-980
[46]   Querying incomplete information in RDF with SPARQL [J].
Nikolaou, Charalampos ;
Koubarakis, Manolis .
ARTIFICIAL INTELLIGENCE, 2016, 237 :138-171
[47]   TOWARDS AN EFFICIENT RDF DATASET SLICING [J].
Marx, Edgard ;
Soru, Tommaso ;
Shekarpour, Saeedeh ;
Auer, Soren ;
Ngomo, Axel-Cyrille Ngonga ;
Breitman, Karin .
INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2013, 7 (04) :455-477
[48]   HyPSo: Hybrid Partitioning for Big RDF Storage and Query Processing [J].
Chawla, Tanvi ;
Singh, Girdhari ;
Pilli, Emmanuel S. .
PROCEEDINGS OF THE 6TH ACM IKDD CODS AND 24TH COMAD, 2019, :188-194
[49]   MDX2SPARQL: Semantic query mapping of OLAP query language to SPARQL [J].
Boumhidi, Haytem ;
Nfaoui, El Habib ;
Oubenaalla, Younes .
2018 INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND COMPUTER VISION (ISCV2018), 2018,
[50]   Query Optimization for massive RDF data based on Spark [J].
Li, Shaohui ;
Shen, Derong ;
Kou, Yue ;
Yang, Dan .
2018 4TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS (BIGCOM 2018), 2018, :219-224