A semantic approach to ETL technologies

被引:32
作者
Bergamaschi, Sonia [1 ]
Guerra, Francesco [1 ]
Orsini, Mirko [1 ]
Sartori, Claudio [2 ]
Vincini, Maurizio [1 ]
机构
[1] Univ Modena & Reggio Emilia, I-41121 Modena, Italy
[2] Univ Bologna, I-40136 Bologna, Italy
关键词
Semantic ETL; Semantic mapping; Cluster analysis; Data analysis; Semantic data integration; Data warehouse; ANNOTATION; VALIDATION; DESIGN;
D O I
10.1016/j.datak.2011.03.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data warehouse architectures rely on extraction, transformation and loading (ETL) processes for the creation of an updated, consistent and materialized view of a set of data sources. In this paper, we support these processes by proposing a tool that: (1) allows the semi-automatic definition of inter-attribute semantic mappings, by identifying the parts of the data source schemas which are related to the data warehouse schema, thus supporting the extraction process; and (2) groups the attribute values semantically related thus defining a transformation function for populating with homogeneous values the data warehouse. Our proposal couples and extends the functionalities of two previously developed systems: the MOMIS integration system and the RELEVANT data analysis system. The system has been experimented within a real scenario concerning the creation of a data warehouse for enterprises working in the beverage and food logistic area. The results showed that the coupled system supports effectively the extraction and transformation processes. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:717 / 731
页数:15
相关论文
共 33 条
[1]  
ALBRECHT A, 2008, MANAGING ETL PROCESS, P12
[2]   Synthesizing, an integrated ontology [J].
Beneventano, D ;
Bergamaschi, S ;
Guerra, F ;
Vincini, M .
IEEE INTERNET COMPUTING, 2003, 7 (05) :42-51
[3]  
BENEVENTANO D, 2007, GETTING THALIA BENCH
[4]  
BERGAMASCHI S, 1997, ICDE, P578
[5]  
Bergamaschi S, 2007, LECT NOTES COMPUT SC, V4805, P27
[6]   Extracting relevant attribute values for improved search [J].
Bergamaschi, Sonia ;
Guerra, Francesco ;
Orsini, Mirko ;
Sartori, Claudio .
IEEE INTERNET COMPUTING, 2007, 11 (05) :26-35
[7]   The Emerging Web of Linked Data [J].
Bizer, Christian .
IEEE INTELLIGENT SYSTEMS, 2009, 24 (05) :87-92
[8]  
Cleuziou G., 2004, P 16 EUR C ART INT E, P440
[9]  
DESSLOCH S, 2008, ORCHID INTEGRATING S, P1307
[10]  
Eckerson W., 2003, Evaluating ETL and data integration platforms