Cost-Availability Aware Scaling: Towards Optimal Scaling of Cloud Services

被引:1
作者
Bento, Andre [1 ]
Araujo, Filipe [1 ]
Barbosa, Raul [1 ]
机构
[1] Univ Coimbra, Ctr Informat & Syst, Dept Informat Engn, P-3030290 Coimbra, Portugal
关键词
Cloud services; Microservices; Availability modeling; Cost-effectiveness; Multi-objective optimization; Autoscaling; MULTIOBJECTIVE OPTIMIZATION; MICROSERVICES;
D O I
10.1007/s10723-023-09718-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cloud services have become increasingly popular for developing large-scale applications due to the abundance of resources they offer. The scalability and accessibility of these resources have made it easier for organizations of all sizes to develop and implement sophisticated and demanding applications to meet demand instantly. As monetary fees are involved in the use of the cloud, one of the challenges for application developers and operators is to balance their budget constraints with crucial quality attributes, such as availability. Industry standards usually default to simplified solutions that cannot simultaneously consider competing objectives. Our research addresses this challenge by proposing a Cost-Availability Aware Scaling (CAAS) approach that uses multi-objective optimization of availability and cost. We evaluate CAAS using two open-source microservices applications, yielding improved results compared to the industry standard CPU-based Autoscaler (AS). CAAS can find optimal system configurations with higher availability, between 1 and 2 nines on average, and reduced costs, 6% on average, with the first application, and 1 nine of availability on average, and reduced costs up to 18% on average, with the second application. The gap in the results between our model and the default AS suggests that operators can significantly improve the operation of their applications.
引用
收藏
页数:19
相关论文
共 47 条
  • [1] Adan I., 2002, QUEUEING THEORY
  • [2] An Automated Task Scheduling Model Using Non-Dominated Sorting Genetic Algorithm II for Fog-Cloud Systems
    Ali, Ismail M. M.
    Sallam, Karam M. M.
    Moustafa, Nour
    Chakraborty, Ripon
    Ryan, Michael
    Choo, Kim-Kwang Raymond
    [J]. IEEE TRANSACTIONS ON CLOUD COMPUTING, 2022, 10 (04) : 2294 - 2308
  • [3] Amazon Web Services Inc., 2022, Amazon compute service level agreement
  • [4] Amazon Web Services Inc., Amazon cloudwatch metrics for Amazon EC2 auto scaling
  • [5] Amdahl G.M., 1967, PROC SPRING JOINT CO, P483, DOI [10.1145/1465482.1465560, DOI 10.1145/1465482.1465560]
  • [6] Chamulteon: Coordinated Auto-Scaling of Micro-Services
    Bauer, Andre
    Lesch, Veronika
    Versluis, Laurens
    Ilyushkin, Alexey
    Herbst, Nikolas
    Kounev, Samuel
    [J]. 2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 2015 - 2025
  • [7] Bauer E., 2012, RELIABILITY AVAILABI
  • [8] Cloud computing in construction industry: Use cases, benefits and challenges
    Bello, Sururah A.
    Oyedele, Lukumon O.
    Akinade, Olugbenga O.
    Bilal, Muhammad
    Delgado, Juan Manuel Davila
    Akanbi, Lukman A.
    Ajayi, Anuoluwapo O.
    Owolabi, Hakeem A.
    [J]. AUTOMATION IN CONSTRUCTION, 2021, 122
  • [9] Bento Andre, 2022, 2022 IEEE 21st International Symposium on Network Computing and Applications (NCA), P45, DOI 10.1109/NCA57778.2022.10013618
  • [10] Pymoo: Multi-Objective Optimization in Python']Python
    Blank, Julian
    Deb, Kalyanmoy
    [J]. IEEE ACCESS, 2020, 8 : 89497 - 89509