A Scalable Approach for Distributed Reasoning over Large-scale OWL Datasets

被引:3
作者
Mohamed, Heba [1 ,2 ]
Fathalla, Said [1 ,2 ]
Lehmann, Jens [1 ,3 ]
Jabeen, Hajira [4 ]
机构
[1] Univ Bonn, Smart Data Analyt SDA, Bonn, Germany
[2] Univ Alexandria, Fac Sci, Alexandria, Egypt
[3] Fraunhofer IAIS, Dresden Lab, NetMedia Dept, St Augustin, Germany
[4] Univ Cologne, Cluster Excellence Plant Sci CEPLAS, Cologne, Germany
来源
PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (KEOD), VOL 2 | 2021年
关键词
Big Data; Distributed Computing; In-Memory Computation; Parallel Reasoning; OWL Horst Rules; OWL Axioms; WEBPIE;
D O I
10.5220/0010656800003064
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the tremendous increase in the volume of semantic data on the Web, reasoning over such an amount of data has become a challenging task. On the other hand, the traditional centralized approaches are no longer feasible for large-scale data due to the limitations of software and hardware resources. Therefore, horizontal scalability is desirable. We develop a scalable distributed approach for RDFS and OWL Horst Reasoning over large-scale OWL datasets. The eminent feature of our approach is that it combines an optimized execution strategy, pre-shuffling method, and duplication elimination strategy. thus achieving an efficient distributed reasoning mechanism. We implemented our approach as open-source in Apache Spark using Resilient Distributed Datasets (RDD) as a parallel programming model. As a use case, our approach is used by the SANSA framework for large-scale semantic reasoning over OWL datasets. The evaluation results have shown the strength of the proposed approach for both data and node scalability.
引用
收藏
页码:51 / 60
页数:10
相关论文
共 14 条
  • [1] Al-Ajlan Ajlan, 2015, International Journal of Machine Learning and Computing, V5, P106, DOI 10.7763/IJMLC.2015.V5.492
  • [2] Cichlid: Efficient Large Scale RDFS/OWL Reasoning with Spark
    Gu, Rong
    Wang, Shanyong
    Wang, Fangfang
    Yuan, Chunfeng
    Huang, Yihua
    [J]. 2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2015, : 700 - 709
  • [3] LUBM: A benchmark for OWL knowledge base systems
    Guo, YB
    Pan, ZX
    Heflin, J
    [J]. JOURNAL OF WEB SEMANTICS, 2005, 3 (2-3): : 158 - 182
  • [4] Hayes P., 2004, RDF semantics
  • [5] Heino Norman, 2012, The Semantic Web. 11th International Semantic Web Conference (ISWC 2012). Proceedings, P133, DOI 10.1007/978-3-642-35176-1_9
  • [6] Kim JM, 2015, INT CONF BIG DATA, P79, DOI 10.1109/35021BIGCOMP.2015.7072815
  • [7] Liu H, 2017, PROCEEDINGS OF 2017 IEEE 2ND INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC), P1, DOI [10.1109/INTMAG.2017.8007847, 10.1109/ITNEC.2017.8284747]
  • [8] Mohamed H., 2020, IN PRESS
  • [9] Completeness, decidability and complexity of entailment for RDF Schema and a semantic extension involving the OWL vocabulary
    ter Horst, HJ
    [J]. JOURNAL OF WEB SEMANTICS, 2005, 3 (2-3): : 79 - 115
  • [10] Tilotma S., 2012, International Journal of Emerging Technology and Advanced Engineering, V2, P271