An Event-Driven Serverless ETL Pipeline on AWS

被引:7
作者
Pogiatzis, Antreas [1 ]
Samakovitis, Georgios [1 ]
机构
[1] Univ Greenwich, Sch Comp & Math Sci, Old Royal Naval Coll, Pk Row, London SE10 9LS, England
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 01期
关键词
serverless; FaaS; event-driven; distributed; AWS; ETL; architecture;
D O I
10.3390/app11010191
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This work presents an event-driven Extract, Transform, and Load (ETL) pipeline serverless architecture and provides an evaluation of its performance over a range of dataflow tasks of varying frequency, velocity, and payload size. We design an experiment while using generated tabular data throughout varying data volumes, event frequencies, and processing power in order to measure: (i) the consistency of pipeline executions; (ii) reliability on data delivery; (iii) maximum payload size per pipeline; and, (iv) economic scalability (cost of chargeable tasks). We run 92 parameterised experiments on a simple AWS architecture, thus avoiding any AWS-enhanced platform features, in order to allow for unbiased assessment of our model's performance. Our results indicate that our reference architecture can achieve time-consistent data processing of event payloads of more than 100 MB, with a throughput of 750 KB/s across four event frequencies. It is also observed that, although the utilisation of an SQS queue for data transfer enables easy concurrency control and data slicing, it becomes a bottleneck on large sized event payloads. Finally, we develop and discuss a candidate pricing model for our reference architecture usage.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [41] Towards event-driven simulation of static sphere packings
    Gonzalez, S.
    Risso, D.
    Soto, R.
    POWDERS AND GRAINS 2009, 2009, 1145 : 301 - 304
  • [42] Toward Event-Driven Mechanism for Load Profile Generation
    Conus, Gilbert
    Lilis, Georgios
    Zanjani, Nastaran Asadi
    Kayal, Maher
    2017 22ND IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2017,
  • [43] Smart home architecture based on event-driven DPWS~+
    陈鹏
    支小莉
    Advances in Manufacturing, 2011, 15 (05) : 386 - 390
  • [44] The Event-Driven Wireless Sensor Networks of Coal Mine
    Liu Xiaoyang
    Qiao Zhi
    Lv Hongjie
    2017 IEEE 9TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN), 2017, : 322 - 326
  • [45] Event-Driven Distributed Kalman-Consensus Filter with Limited Memory Information
    Yang, Chunxi
    Zhu, Jie
    Zhai, Chi
    PROCEEDINGS OF 2020 IEEE 9TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS'20), 2020, : 690 - 696
  • [46] A framework and a performance assessment for serverless MapReduce on AWS Lambda
    Gimenez-Alventosa, V
    Molto, German
    Caballer, Miguel
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 97 : 259 - 274
  • [47] Defining the semantics of reactive components in event-driven workflow execution with event histories
    Geppert, A
    Tombros, D
    Dittrich, KR
    INFORMATION SYSTEMS, 1998, 23 (3-4) : 235 - 252
  • [48] Stable algorithm for event detection in event-driven particle dynamics: logical states
    Severin Strobl
    Marcus N. Bannerman
    Thorsten Pöschel
    Computational Particle Mechanics, 2016, 3 : 383 - 388
  • [49] A time-driven and event-driven approach for substation feeder incident analysis
    Chen, Chao-Rong
    Chang, Chi-Juin
    Lee, Cheng-Hung
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2016, 74 : 9 - 15
  • [50] An Event-Driven Approach for E-Services System Design
    Liu, Yan
    Liu, Qin
    Zhuang, Mingguang
    Wang, Qingling
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 8414 - 8417