Using superimposed multidimensional schemas and OLAP patterns for RDF data analysis

被引:4
作者
Hilal, Median [1 ]
Schuetz, Christoph G. [1 ]
Schrefl, Michael [1 ]
机构
[1] Johannes Kepler Univ Linz, Dept Business Informat Data & Knowledge Engn, A-4040 Linz, Austria
来源
OPEN COMPUTER SCIENCE | 2018年 / 8卷 / 01期
关键词
Linked Open Data; Self-Service Business Intelligence; Multidimensional Modeling;
D O I
10.1515/comp-2018-0003
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The foundations for traditional data analysis are Online Analytical Processing (OLAP) systems that operate on multidimensional (MD) data. The Resource Description Framework (RDF) serves as the foundation for the publication of a growing amount of semantic web data still largely untapped by companies for data analysis. Most RDF data sources, however, do not correspond to the MD modeling paradigm and, as a consequence, elude traditional OLAP. The complexity of RDF data in terms of structure, semantics, and query languages renders RDF data analysis challenging for a typical analyst not familiar with the underlying data model or the SPARQL query language. Hence, conducting RDF data analysis is not a straightforward task. We propose an approach for the definition of superimposed MD schemas over arbitrary RDF datasets and show how to represent the superimposed MD schemas using well-known semantic web technologies. On top of that, we introduce OLAP patterns for RDF data analysis, which are recurring, domain-independent elements of data analysis. Analysts may compose queries by instantiating a pattern using only the MD concepts and business terms. Upon pattern instantiation, the corresponding SPARQL query over the source data can be automatically generated, sparing analysts from technical details and fostering self-service capabilities.
引用
收藏
页码:18 / 37
页数:20
相关论文
共 26 条
  • [1] Self-Service Business Intelligence
    Alpar, Paul
    Schulz, Michael
    [J]. BUSINESS & INFORMATION SYSTEMS ENGINEERING, 2016, 58 (02) : 151 - 155
  • [2] Barisic A, 2014, SOFTWARE DESIGN DEV, P2120, DOI 10.4018/978-1-4666-4301-7.ch098
  • [3] Linked Data - The Story So Far
    Bizer, Christian
    Heath, Tom
    Berners-Lee, Tim
    [J]. INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2009, 5 (03) : 1 - 22
  • [4] RDF Analytics: Lenses over Semantic Graphs
    Colazzo, Dario
    Goasdoue, Francois
    Manolescu, Ioana
    Roatis, Alexandra
    [J]. WWW'14: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2014, : 467 - 477
  • [5] Erxleben F, 2014, LECT NOTES COMPUT SC, V8796, P50, DOI 10.1007/978-3-319-11964-9_4
  • [6] Etcheverry L., 2015, ABS151206080 CORR
  • [7] Etcheverry L, 2017, J DATA SEMANT, V6, P199, DOI 10.1007/s13740-017-0082-y
  • [8] Etcheverry Lorena, 2012, P COLD 2012 CERS WS, P905
  • [9] The dimensional fact model: A conceptual model for data warehouses
    Golfarelli, M
    Maio, D
    Rizzi, S
    [J]. INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 1998, 7 (2-3) : 215 - 247
  • [10] Hilal M., 2016, LNCS, P245