Data Integration Patterns for Data Warehouse Automation

被引:5
作者
Tomingas, Kalle [1 ]
Kliimask, Margus [2 ]
Tammet, Tanel [1 ]
机构
[1] Tallinn Univ Technol, EE-19086 Tallinn, Estonia
[2] Eliko Competence Ctr, EE-12618 Tallinn, Estonia
来源
NEW TRENDS IN DATABASE AND INFORMATION SYSTEMS II | 2015年 / 312卷
关键词
data warehouse; etl; data mappings; template based sql generation; abstract syntax patterns; metadata management;
D O I
10.1007/978-3-319-10518-5_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper presents a mapping-based and metadata-driven modular data transformation framework designed to solve extract-transform-load (ETL) automation, impact analysis, data quality and integration problems in data warehouse environments. We introduce a declarative mapping formalization technique, an abstract expression pattern concept and a related template engine technology for flexible ETL code generation and execution. The feasibility and efficiency of the approach is demonstrated on the pattern detection and data lineage analysis case studies using large real life SQL corpuses.
引用
收藏
页码:41 / 55
页数:15
相关论文
共 18 条
  • [1] [Anonymous], 11179 ISOIEC
  • [2] Behrend A., 2010, OPTIMIZED INCREMENTA
  • [3] Boehm M., 2009, GCIP EXPLOITING GENE
  • [4] Bohm M., 2008, ICEIS
  • [5] Dessloch S., 2008, IEEE 24 INT C DAT EN
  • [6] GRAnD: A goal-oriented approach to requirement analysis in data warehouses
    Giorgini, Paolo
    Rizzi, Stefano
    Garzetti, Maddalena
    [J]. DECISION SUPPORT SYSTEMS, 2008, 45 (01) : 4 - 21
  • [7] Haas L.M., 2005, P ACM SIGMOD INT C M, P805
  • [8] JUN T, 2009, FITA 2009, P620, DOI DOI 10.1109/IFITA.2009.48
  • [9] Papastefanatos G, 2010, LECT NOTES COMPUT SC, V5968, P55
  • [10] Patil P.S., 2011, DATA INTEGRATION PRO