An adaptive auto-scaling framework for cloud resource provisioning

被引:11
作者
Chouliaras, Spyridon [1 ]
Sotiriadis, Stelios [1 ]
机构
[1] Birkbeck Univ London, Dept Comp Sci & Informat Syst, London, England
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2023年 / 148卷
关键词
Auto-scaling; Resource provisioning; Cloud computing; Convolutional neural networks; K-means; MODEL; PERFORMANCE;
D O I
10.1016/j.future.2023.05.017
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Cloud computing emerged as a technology that offers scalable access to computing resources in conjunction with low maintenance costs. In this domain, cloud users utilize virtualized resources to benefit from on-demand and long-term pricing strategies. Although the latter consists of a more cost-efficient solution, it requires accurate estimations of future workload demands, which is a challenging task. Furthermore, clouds offer threshold-based auto-scaling rules that need to be manually controlled by the users according to application requirements. Still, tuning scaling parameters is not trivial, since it is mainly based on static scaling rules that may lead to unreasonable costs and quality of service violations. In this work we introduce ADA-RP, an adaptive auto-scaling framework for reliable resource provisioning in the cloud. ADA-RP uses historical time series data for training K-means and convolutional neural networks (CNN) to categorize future workload demands as High, Medium or Low based on CPU utilization. We auto-scale cloud resources in real-time based on the predicted workload demand to reduce costs and improve application performance. The experimental analysis is based on TPC-C runs on MySQL containers deployed on the Google Cloud Platform. Experimental results are prosperous, demonstrating the ability of ADA-RP (i) to reduce MySQL deployment costs by 48% in a single-tenant environment, and (ii) to double the executed queries per second in a multi-tenant environment considering user's budget requirements.& COPY; 2023 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页码:173 / 183
页数:11
相关论文
共 40 条
[21]  
Koprinska I, 2018, IEEE IJCNN
[22]   Identification of typical building daily electricity usage profiles using Gaussian mixture model-based clustering and hierarchical clustering [J].
Li, Kehua ;
Ma, Zhenjun ;
Robinson, Duane ;
Ma, Jun .
APPLIED ENERGY, 2018, 231 :331-342
[23]  
Li Yi, 2019, IEEE Transactions on Cloud Computing
[24]   A Review of Auto-scaling Techniques for Elastic Applications in Cloud Environments [J].
Lorido-Botran, Tania ;
Miguel-Alonso, Jose ;
Lozano, Jose A. .
JOURNAL OF GRID COMPUTING, 2014, 12 (04) :559-592
[25]   Dynamic Cloud Resource Allocation Considering Demand Uncertainty [J].
Mireslami, Seyedehmehrnaz ;
Rakai, Logan ;
Wang, Mea ;
Far, Behrouz Homayoun .
IEEE TRANSACTIONS ON CLOUD COMPUTING, 2021, 9 (03) :981-994
[26]  
Nair V., 2010, P 27 INT C INT C MAC, P807
[27]  
Niennattrakul V, 2007, MUE: 2007 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND UBIQUITOUS ENGINEERING, PROCEEDINGS, P733
[28]  
Niennattrakul V, 2009, ECTI-CON: 2009 6TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, P586
[29]   Towards an Autonomic Auto-Scaling Prediction System for Cloud Resource Provisioning [J].
Nikravesh, Ali Yadavar ;
Ajila, Samuel A. ;
Lung, Chung-Horng .
2015 IEEE/ACM 10TH INTERNATIONAL SYMPOSIUM ON SOFTWARE ENGINEERING FOR ADAPTIVE AND SELF-MANAGING SYSTEMS, 2015, :35-45
[30]  
Padala P, 2009, EUROSYS'09: PROCEEDINGS OF THE FOURTH EUROSYS CONFERENCE, P13