The contribution of linked open data to augment a traditional data warehouse

被引:8
作者
Berkani, Nabila [1 ]
Bellatreche, Ladjel [2 ]
Khouri, Selma [1 ]
Ordonez, Carlos [3 ]
机构
[1] Ecole Natl Super Informat, Lab Commun Syst Informat, BP 68M, Algiers 16309, Algeria
[2] Poitiers Univ, ISAE ENSMA, LIAS, Poitiers, France
[3] Univ Houston, Dept Comp Sci, Houston, TX 77204 USA
关键词
Linked open data; Traditional DW augmentation; Value; METRICS; ONTOLOGY; QUALITY; DESIGN; MODEL; ETL;
D O I
10.1007/s10844-020-00594-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The arrival of Big Data has contributed positively to the evolution of the data warehouse (DW ) technology. This gives birth of augmented DW s that aim at maximizing the effectiveness of existing ones. Various augmentation scenarios have been proposed and adopted by firms and industry covering several aspects such as new data sources (e.g., Linked Open Data (LOD), social, stream and IoT data), data ingestion, advanced deployment infrastructures, programming paradigms, data visualization. These scenarios allow companies reaching valuable decisions. By examining traditional DW s, we realized that they do not fulfill all decision-maker requirements since data sources alimenting a target DW are not rich enough to capture Big Data. The arrival of LOD era is an excellent opportunity to enrich traditional DW s with a new V dimension: Value. In this paper, we first conceptualize the variety of internal and external sources and study its effect on the ETL phase to ease the value capturing. Secondly, a Value-driven approach for the DW design is discussed. Thirdly, three realistic scenarios for integrating LOD in the DW landscape are given. Finally, experiments are conducted showing the added value by augmenting the existing DW environment with LOD.
引用
收藏
页码:397 / 421
页数:25
相关论文
共 40 条
[1]   Using Semantic Web Technologies for Exploratory OLAP: A Survey [J].
Abello, Alberto ;
Romero, Oscar ;
Pedersen, Torben Bach ;
Berlanga, Rafael ;
Nebot, Victoria ;
Jose Aramburu, Maria ;
Simitsis, Alkis .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (02) :571-588
[2]   Towards an Ontology of Value Ascription [J].
Andersson, Birger ;
Guarino, Nicola ;
Johannesson, Paul ;
Livieri, Barbara .
FORMAL ONTOLOGY IN INFORMATION SYSTEMS, 2016, 283 :331-344
[3]   QETL: An approach to on-demand ETL from non-owned data sources [J].
Baldacci, Lorenzo ;
Golfarelli, Matteo ;
Graziani, Simone ;
Rizzi, Stefano .
DATA & KNOWLEDGE ENGINEERING, 2017, 112 :17-37
[4]   Enhancing data quality in data warehouse environments [J].
Ballou, DP ;
Tayi, GK .
COMMUNICATIONS OF THE ACM, 1999, 42 (01) :73-78
[5]   DataSynapse: A Social Data Curation Foundry [J].
Beheshti, Amin ;
Benatallah, Boualem ;
Tabebordbar, Alireza ;
Motahari-Nezhad, Hamid Reza ;
Barukh, Moshe Chai ;
Nouri, Reza .
DISTRIBUTED AND PARALLEL DATABASES, 2019, 37 (03) :351-384
[6]   CoreKG: a Knowledge Lake Service [J].
Beheshti, Amin ;
Benatallah, Boualem ;
Nouri, Reza ;
Tabebordbar, Alireza .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 11 (12) :1942-1945
[7]  
Berkani N., 2019, DOLAP
[8]   A Variety-Sensitive ETL Processes [J].
Berkani, Nabila ;
Bellatreche, Ladjel .
DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2017, PT II, 2017, 10439 :201-216
[9]   A Value-Added Approach to Design BI Applications [J].
Berkani, Nabila ;
Bellatreche, Ladjel ;
Benatallah, Boualem .
BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2016, 2016, 9829 :361-375
[10]  
Berro Alain, 2015, 17th International Conference on Enterprise Information Systems (ICEIS 2015). Proceedings, P271