A Variety-Sensitive ETL Processes

被引:2
作者
Berkani, Nabila [1 ]
Bellatreche, Ladjel [2 ]
机构
[1] Ecole Natl Super Informat ESI, Algiers, Algeria
[2] Poitiers Univ, ENSMA, ISAE, LIAS, Poitiers, France
来源
DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2017, PT II | 2017年 / 10439卷
关键词
DESIGN; WAREHOUSES;
D O I
10.1007/978-3-319-64471-4_17
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, small, medium and large companies need advanced data integration techniques supported by tools to analyse data in order to deliver real-time alerts and trigger automated actions, etc. In the context of rapidly technology changing, these techniques have to consider two main issues: (a) the variety of the huge amount of data sources (ex. traditional, semantic, and graph databases) and (b) the variety of storage platforms, where a data integration system may have several stores, where one hosts a particular type. These issues directly impact the efficiency and the deployment flexibility of ETL (Extract, Transform, Load). In this paper, we consider these issues. Firstly, thanks to Model Driven Engineering, we make generic different types of data sources. This genericity allows overloading the ETL operators. To show the benefit of this genericity, several examples of instantiation are described covering relational, semantic and graph databases. Secondly, a Web-service-driven approach for orchestrating the ETL flows is given. Thirdly, we present a fusion procedure that merges the set of heterogeneous instances and deployed according their favorite stores. Finally, our finding is validated through a proof of concept tool using the LUBM benchmark and YAGO KB and deployed in Oracle RDF Semantic Graph 12c.
引用
收藏
页码:201 / 216
页数:16
相关论文
共 28 条
[1]  
[Anonymous], JISBD
[2]  
[Anonymous], 2003, DESCRIPTION LOGIC HD
[3]  
[Anonymous], 2002, P ACM SIGACT SIGMOD, DOI DOI 10.1145/543613.543644
[4]   Towards a conceptualization of ETL and physical storage of semantic data warehouses as a service [J].
Berkani, Nabila ;
Bellatreche, Ladjel ;
Khouri, Selma .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2013, 16 (04) :915-931
[5]  
Calvanese D, 1998, SPRING INT SER ENG C, P229
[6]  
Casati F., 2007, Proceedings of the 33rd international conference on Very large data bases, VLDB '07, P1128
[7]  
Craig I., 2002, INTERPRETATION OBJEC, DOI [10.1007/978-1-4471-0199-4, DOI 10.1007/978-1-4471-0199-4]
[8]  
Dong X.L., 2013, PVLDB, V6, P118
[9]  
El Akkaoui Zineb, 2012, Data Warehousing and Knowledge Discovery. Proceedings of the 14th International Conference, DaWaK 2012, P1, DOI 10.1007/978-3-642-32584-7_1
[10]  
Jean S, 2013, LECT NOTES COMPUT SC, V8217, P499, DOI 10.1007/978-3-642-41924-9_44