An Event-Driven Serverless ETL Pipeline on AWS

被引:7
作者
Pogiatzis, Antreas [1 ]
Samakovitis, Georgios [1 ]
机构
[1] Univ Greenwich, Sch Comp & Math Sci, Old Royal Naval Coll, Pk Row, London SE10 9LS, England
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 01期
关键词
serverless; FaaS; event-driven; distributed; AWS; ETL; architecture;
D O I
10.3390/app11010191
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This work presents an event-driven Extract, Transform, and Load (ETL) pipeline serverless architecture and provides an evaluation of its performance over a range of dataflow tasks of varying frequency, velocity, and payload size. We design an experiment while using generated tabular data throughout varying data volumes, event frequencies, and processing power in order to measure: (i) the consistency of pipeline executions; (ii) reliability on data delivery; (iii) maximum payload size per pipeline; and, (iv) economic scalability (cost of chargeable tasks). We run 92 parameterised experiments on a simple AWS architecture, thus avoiding any AWS-enhanced platform features, in order to allow for unbiased assessment of our model's performance. Our results indicate that our reference architecture can achieve time-consistent data processing of event payloads of more than 100 MB, with a throughput of 750 KB/s across four event frequencies. It is also observed that, although the utilisation of an SQS queue for data transfer enables easy concurrency control and data slicing, it becomes a bottleneck on large sized event payloads. Finally, we develop and discuss a candidate pricing model for our reference architecture usage.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [31] ED-FreEst: Event-Driven Frequency Estimation
    Guo, Ruiming
    Bhandari, Ayush
    32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 867 - 871
  • [32] An Event-driven Clustering Routing Algorithm in mobile CRSNs
    Han, Kun
    Tan, Long
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 655 - 660
  • [33] Overload behaviour and protection of event-driven web servers
    Voigt, T
    WEB ENGINEERING AND PEER TO PEER COMPUTING, 2002, 2376 : 147 - 157
  • [34] REST: Relational Event-driven Stock Trend Forecasting
    Xu, Wentao
    Liu, Weiqing
    Xu, Chang
    Bian, Jiang
    Yin, Jian
    Liu, Tie-Yan
    PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 1 - 10
  • [35] Event-Driven Distributed Kalman-Consensus Filter with Limited Memory Information
    Yang, Chunxi
    Zhu, Jie
    Zhai, Chi
    PROCEEDINGS OF 2020 IEEE 9TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS'20), 2020, : 690 - 696
  • [36] Towards event-driven simulation of static sphere packings
    Gonzalez, S.
    Risso, D.
    Soto, R.
    POWDERS AND GRAINS 2009, 2009, 1145 : 301 - 304
  • [37] Event-driven visual attention for the humanoid robot iCub
    Rea, Francesco
    Metta, Giorgio
    Bartolozzi, Chiara
    FRONTIERS IN NEUROSCIENCE, 2013, 7
  • [38] Event-Driven Test Script Methodology for SOA System
    Lee, Youngkon
    COMMUNICATION AND NETWORKING, PT I, 2011, 265 : 23 - 30
  • [39] Event-driven PID control of autonomous quadrotor helicopters
    Ye, Qian
    JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2018, 12 (02) : 159 - 164
  • [40] Balancing parallelization and asynchronization in event-driven programs with OpenMP
    Fan, Xing
    Sinnen, Oliver
    Giacaman, Nasser
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (04)