Using Semantic Web Technologies for Exploratory OLAP: A Survey

被引:82
作者
Abello, Alberto [1 ]
Romero, Oscar [1 ]
Pedersen, Torben Bach [2 ]
Berlanga, Rafael [3 ]
Nebot, Victoria [4 ]
Jose Aramburu, Maria [3 ]
Simitsis, Alkis
机构
[1] Polytech Univ Catalonia BarcelonaTech, Barcelona, Spain
[2] Aalborg Univ, Aalborg, Denmark
[3] Univ Jaume 1, Castellon de La Plana, Spain
[4] Univ Jaume 1, TKBG, Castellon de La Plana, Spain
关键词
Business Intelligence; data warehousing; OLAP; ETL; Semantic Web; reasoning; DESCRIPTION LOGICS; CONCEPTUAL DESIGN; DATA WAREHOUSES; ETL PROCESSES; ONTOLOGIES; FRAMEWORK; TIME;
D O I
10.1109/TKDE.2014.2330822
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes the convergence of some of the most influential technologies in the last few years, namely data warehousing (DW), on-line analytical processing (OLAP), and the Semantic Web (SW). OLAP is used by enterprises to derive important business-critical knowledge from data inside the company. However, the most interesting OLAP queries can no longer be answered on internal data alone, external data must also be discovered (most often on the web), acquired, integrated, and (analytically) queried, resulting in a new type of OLAP, exploratory OLAP. When using external data, an important issue is knowing the precise semantics of the data. Here, SW technologies come to the rescue, as they allow semantics (ranging from very simple to very complex) to be specified for web-available resources. SW technologies do not only support capturing the "passive" semantics, but also support active inference and reasoning on the data. The paper first presents a characterization of DW/OLAP environments, followed by an introduction to the relevant SW foundation concepts. Then, it describes the relationship of multidimensional (MD) models and SW technologies, including the relationship between MD models and SW formalisms. Next, the paper goes on to survey the use of SW technologies for data modeling and data provisioning, including semantic data annotation and semantic-aware extract, transform, and load (ETL) processes. Finally, all the findings are discussed and a number of directions for future research are outlined, including SW support for intelligent MD querying, using SW technologies for providing context to data warehouses, and scalability issues.
引用
收藏
页码:571 / 588
页数:18
相关论文
共 57 条
[1]   Fusion Cubes: Towards Self-Service Business Intelligence [J].
Abello, Alberto ;
Darmont, Jerome ;
Etcheverry, Lorena ;
Golfarelli, Matteo ;
Mazon, Jose-Norberto ;
Naumann, Felix ;
Pedersen, Torben Bach ;
Rizzi, Stefano ;
Trujillo, Juan ;
Vassiliadis, Panos ;
Vossen, Gottfried .
INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2013, 9 (02) :66-88
[2]   Ontology driven search of compound IDs [J].
Abello, Alberto ;
Romero, Oscar .
KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 32 (01) :191-216
[3]  
Anderlik Stefan, 2012, Conceptual Modeling. Proceedings 31st International Conference, ER 2012, P88, DOI 10.1007/978-3-642-34002-4_7
[4]  
[Anonymous], 2002, The Description Logic Handbook
[5]  
[Anonymous], 2013, The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling
[6]  
[Anonymous], 2009, Scopus, DOI [DOI 10.1145/1516360.1516362, 10.1145/1516360.1516362]
[7]   The DL-Lite Family and Relations [J].
Artale, Alessandro ;
Calvanese, Diego ;
Kontchakov, Roman ;
Zakharyaschev, Michael .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2009, 36 :1-69
[8]   MIRSOFT: mediator for integrating and reconciling sources using ontological functional dependencies [J].
Bakhtouchi, Abdelghani ;
Bellatreche, Ladjel ;
Jean, Stephane ;
Ait-Ameur, Yamine .
INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2012, 8 (01) :72-110
[9]  
Berlanga R, 2012, BUSINESS INTELLIGENCE APPLICATIONS AND THE WEB: MODELS, SYSTEMS AND TECHNOLOGIES, P310, DOI 10.4018/978-1-61350-038-5.ch014
[10]  
Berthold H., 2010, P EDBT ICDT WORKSH