A Systematic Literature Review on Big Data Extraction, Transformation and Loading (ETL)

被引:8
作者
Nwokeji, Joshua C. [1 ]
Matovu, Richard [1 ]
机构
[1] Gannon Univ, Erie, PA 16541 USA
来源
INTELLIGENT COMPUTING, VOL 2 | 2021年 / 284卷
关键词
ETL; Big data; Big data ETL; Systematic literature review;
D O I
10.1007/978-3-030-80126-7_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data analytics plays a vital role in contemporary organizations, through analytics, organizations are able to derive knowledge and intelligence from data to support strategic decisions. An important step in data analytics is data integration, during which historic data is gathered from various sources and integrated into a centralized repository called data warehouse. Although there are various approaches for data integration, Extract Transform and Load (ETL) has become one of the most efficient and popular approach. Over the decades, ETL has been applied to a wide range of domains such as finance, health and telecom to mention but a few. As the popularity and use of ETL grow, it becomes important to analyze and identify the trends in the research and practice of ETL. In this paper, we perform a systematic literature review to identify and analyze: (1) Approaches used to implement existing ETL solutions (2) Quality attributes to be considered while adopting any ETL approach. (3) The depth of coverage in ETL research and practice with regards to the application domains, frequency publications and geographical locations of papers. (4) The prevailing challenges in developing ETL solutions. Furthermore, we discuss the implications of our findings to ETL researchers and practitioners.
引用
收藏
页码:308 / 324
页数:17
相关论文
共 19 条
  • [1] [Anonymous], 2011, P ACM
  • [2] [Anonymous], 2007, ENGINEERING
  • [3] Aqlan F, 2020, PROC FRONT EDUC CONF
  • [4] Aqlan F, 2018, PROC FRONT EDUC CONF
  • [5] Towards a Semantic Extract-Transform-Load (ETL) framework for Big Data Integration
    Bansal, Srividya K.
    [J]. 2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 521 - 528
  • [6] Dayal U., 2009, Proceedings of EDBT'09, Saint Petersburg, Russia, P1, DOI [10.1145/1516360.1516362, DOI 10.1145/1516360.1516362]
  • [7] Deb Nath R.P., 2015, Proceedings of the ACM Eighteenth International Workshop on Data Warehousing and OLAP, New York, NY, USA, P15, DOI [DOI 10.1145/2811222.2811229, 10.1145/2811222.2811229]
  • [8] A BPMN-Based Design and Maintenance Framework for ETL Processes
    El Akkaoui, Zineb
    Zimanyi, Esteban
    Mazon, Jose-Norberto
    Trujillo, Juan
    [J]. INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2013, 9 (03) : 46 - 72
  • [9] Freitas A., 2012, EXTENDED SEMANTIC WE, P43
  • [10] Big Data: Promises and Problems
    Gudivada, Venkat N.
    Baeza-Yates, Ricardo
    Raghavan, Vijay V.
    [J]. COMPUTER, 2015, 48 (03) : 20 - 23