A framework and a performance assessment for serverless MapReduce on AWS Lambda

被引:37
作者
Gimenez-Alventosa, V [1 ]
Molto, German [1 ]
Caballer, Miguel [1 ]
机构
[1] Univ Politecn Valencia, Ctr Mixto CSIC, I3M, Camino Vera S-N, E-46022 Valencia, Spain
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2019年 / 97卷
关键词
MapReduce; Serverless; Cloud computing; Elasticity;
D O I
10.1016/j.future.2019.02.057
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
MapReduce is one of the most widely used programming models for analysing large-scale datasets, i.e. Big Data. In recent years, serverless computing and, in particular, Functions as a Service (FaaS) has surged as an execution model in which no explicit management of servers (e.g. virtual machines) is performed by the user. Instead, the Cloud provider dynamically allocates resources to the function invocations and fine-grained billing is introduced depending on the execution time and allocated memory, as exemplified by AWS Lambda. In this article, a high-performant serverless architecture has been created to execute MapReduce jobs on AWS Lambda using Amazon S3 as the storage backend. In addition, a thorough assessment has been carried out to study the suitability of AWS Lambda as a platform for the execution of High Throughput Computing jobs. The results indicate that AWS Lambda provides a convenient computing platform for general-purpose applications that fit within the constraints of the service (15 min of maximum execution time, 3008 MB of RAM and 512 MB of disk space) but it exhibits an inhomogeneous performance behaviour that may jeopardise adoption for tightly coupled computing jobs. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:259 / 274
页数:16
相关论文
共 11 条
  • [1] [Anonymous], LAMBDA
  • [2] Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
  • [3] Figiela Kamil, 2018, CONCURRENCY COMPUT
  • [4] Glikson Alex, 2017, C 10 ACM INT SYST ST
  • [5] J David, 2008, TR0807 HARV COMP SCI
  • [6] Occupy the Cloud: Distributed Computing for the 99%
    Jonas, Eric
    Pu, Qifan
    Venkataraman, Shivaram
    Stoica, Ion
    Recht, Benjamin
    [J]. PROCEEDINGS OF THE 2017 SYMPOSIUM ON CLOUD COMPUTING (SOCC '17), 2017, : 445 - 451
  • [7] Lee Hyungro, 2018, 2018 IEEE 11 INT C C
  • [8] Mitigating Resource Contention and Heterogeneity in Public Clouds for Scientific Modeling Services
    Lloyd, Wes
    Pallickara, Shrideep
    David, Olaf
    Arabi, Mazdak
    Rojas, Ken
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING (IC2E 2017), 2017, : 159 - 166
  • [9] Pavlo Andrew, 2009, SIGMOD 2009
  • [10] Wang Laing, 2018, ANN TECHN C