Towards Efficient SPARQL Query Processing on RDF Data

被引:0
作者
刘畅 [1 ]
王昊奋 [1 ]
俞勇 [1 ]
徐林昊 [2 ]
机构
[1] Department of Computer Science and Engineering, Shanghai Jiao Tong University
[2] IBM China Research Lab
关键词
resource description framework (RDF) query engine; SPARQL; optimization;
D O I
暂无
中图分类号
TP311.13 [];
学科分类号
1201 ;
摘要
Efficient support for querying large-scale resource description framework (RDF) triples plays an important role in semantic web data management. This paper presents an efficient RDF query engine to evaluate SPARQL queries, where the inverted index structure is employed for indexing the RDF triples. A set of operators on the inverted index was developed for query optimization and evaluation. Then a main-tree-shaped optimization algorithm was developed that transforms a SPARQL query graph into the op-timal query plan by effectively reducing the search space to determine the optimal joining order. The opti-mization collects a set of RDF statistics for estimating the execution cost of the query plan. Finally the opti-mal query plan is evaluated using the defined operators for answering the given SPARQL query. Extensive tests were conducted on both synthetic and real datasets containing up to 100 million triples to evaluate this approach with the results showing that this approach can answer most queries within 1 s and is extremely efficient and scalable in comparison with previous best state-of-the-art RDF stores.
引用
收藏
页码:613 / 622
页数:10
相关论文
共 50 条
  • [31] Keyword search over schema-less RDF datasets by SPARQL query compilation
    Izquierdo, Yenier T.
    Garcia, Grettel M.
    Menendez, Elisa
    Leme, Luiz Andre P. P.
    Neves, Angelo
    Lemos, Melissa
    Finamore, Anna Carolina
    Oliveira, Carlos
    Casanova, Marco A.
    [J]. INFORMATION SYSTEMS, 2021, 102
  • [32] Quality-Driven Query Processing over Federated RDF Data Sources
    Heling, Lars
    [J]. SEMANTIC WEB: ESWC 2019 SATELLITE EVENTS, 2019, 11762 : 209 - 219
  • [33] Combining Vertex-Centric Graph Processing with SPARQL for Large-Scale RDF Data Analytics
    Abdelaziz, Ibrahim
    Harbi, Razen
    Salihoglu, Semih
    Kalnis, Panos
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (12) : 3374 - 3388
  • [34] Parallel Processing SPARQL Theta Join on Large Scale RDF Graphs
    Wang, Tao
    Yuan, Pingpeng
    Liao, Xiaofei
    Jin, Hai
    [J]. 2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,
  • [35] VEDAS: an efficient GPU alternative for store and query of large RDF data sets
    Makpaisit, Pisit
    Chantrapornchai, Chantana
    [J]. JOURNAL OF BIG DATA, 2021, 8 (01)
  • [36] VEDAS: an efficient GPU alternative for store and query of large RDF data sets
    Pisit Makpaisit
    Chantana Chantrapornchai
    [J]. Journal of Big Data, 8
  • [37] Towards the Novel Reasoning among Particles in PSO by the Use of RDF and SPARQL
    Fister, Iztok, Jr.
    Yang, Xin-She
    Ljubic, Karin
    Fister, Dusan
    Brest, Janez
    Fister, Iztok
    [J]. SCIENTIFIC WORLD JOURNAL, 2014,
  • [38] Efficient RDF querying based query translation
    Tong, Qiang
    Cheng, Jing-Wei
    Zhang, Fu
    Zhang, Li-Li
    Ma, Zong-Min
    [J]. Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2015, 45 (05): : 1550 - 1558
  • [39] An empirical evaluation of cost-based federated SPARQL query processing engines
    Qudus, Umair
    Saleem, Muhammad
    Ngomo, Axel-Cyrille Ngonga
    Lee, Young-Koo
    [J]. SEMANTIC WEB, 2021, 12 (06) : 843 - 868
  • [40] SShare: a simulator for studying and evaluating decentralized SPARQL query processing
    Jing Zhou
    Qi Huang
    Weifeng Xie
    Zhiguo Qu
    [J]. Personal and Ubiquitous Computing, 2015, 19 : 1087 - 1097