A Novel Agent-based Parallel ETL System for Massive Data

被引:0
作者
Chen, Gang [1 ]
An, Baoran [1 ]
Liu, Yan [2 ]
机构
[1] China Acad Engn Phys, Inst Comp Applicat, Mianyang 621900, Peoples R China
[2] China Acad Engn Phys, Grad Coll, Mianyang 621900, Peoples R China
来源
PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC) | 2016年
关键词
Parallel ETL; Massive Data; Multi-Agent System; Data Warehouse;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Business Intelligence (BI) systems are crucial for enterprise improvement. They consolidate heterogeneous data from distributed sources and input high-quality data into strategic indicators. An essential component of the data consolidation is Extraction, Transformation and Loading (ETL) which are responsible for extracting data from heterogeneous sources, transforming, restructuring and integrating them into homogenous data warehouse. Due to the deficiency of traditional ETL, the entire ETL component for massive data has decreased performance. Aiming at this challenge, we propose a novel workflow framework for parallel ETL execution based on multi-agent system. The purpose of the system is to utilize a parallel strategy to improve the efficiency of ETL process. Through research, we find that some ETL activities are often executed on the same priority or using the same input data. Based on this discovery, this paper presents a parallel ETL framework based on agent theory and multi-thread techniques. The experimental results show that the proposed approach can greatly improve the efficiency of ETL process.
引用
收藏
页码:3942 / 3948
页数:7
相关论文
共 19 条
  • [1] Bentayeb F., 2009, PERSONNALISATION DAT, P7
  • [2] Boussaid O., 2003, P 10 ISPE INT C CONC, P49
  • [3] Chaudhuri S., 1997, SIGMOD Record, V26, P65, DOI 10.1145/248603.248616
  • [4] Earls A. R., 2012, STATE ETL EXTRACT TR
  • [5] A proposed model for data warehouse ETL processes
    El-Sappagh, Shaker H. Ali
    Hendawi, Abdeltawab M. Ahmed
    El Bastawissy, Ali Hamed
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2011, 23 (02) : 91 - 104
  • [6] The use of Carin language and algorithms for information integration:: The Picsel system
    Goasdoué, F
    Lattès, V
    Rousset, MC
    [J]. INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 2000, 9 (04) : 383 - 401
  • [7] Inmon B., 1996, BUILDING DATA WAREHO, P401
  • [8] Kakish K., 2012, PROC C INF SYST APPL, P1
  • [9] Kimbal R., 1996, DATA WAREHOUSE TOOLK
  • [10] Kimball R., DATA WAREHOUSE ETL T