An adaptive auto-scaling framework for cloud resource provisioning

被引:11
作者
Chouliaras, Spyridon [1 ]
Sotiriadis, Stelios [1 ]
机构
[1] Birkbeck Univ London, Dept Comp Sci & Informat Syst, London, England
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2023年 / 148卷
关键词
Auto-scaling; Resource provisioning; Cloud computing; Convolutional neural networks; K-means; MODEL; PERFORMANCE;
D O I
10.1016/j.future.2023.05.017
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Cloud computing emerged as a technology that offers scalable access to computing resources in conjunction with low maintenance costs. In this domain, cloud users utilize virtualized resources to benefit from on-demand and long-term pricing strategies. Although the latter consists of a more cost-efficient solution, it requires accurate estimations of future workload demands, which is a challenging task. Furthermore, clouds offer threshold-based auto-scaling rules that need to be manually controlled by the users according to application requirements. Still, tuning scaling parameters is not trivial, since it is mainly based on static scaling rules that may lead to unreasonable costs and quality of service violations. In this work we introduce ADA-RP, an adaptive auto-scaling framework for reliable resource provisioning in the cloud. ADA-RP uses historical time series data for training K-means and convolutional neural networks (CNN) to categorize future workload demands as High, Medium or Low based on CPU utilization. We auto-scale cloud resources in real-time based on the predicted workload demand to reduce costs and improve application performance. The experimental analysis is based on TPC-C runs on MySQL containers deployed on the Google Cloud Platform. Experimental results are prosperous, demonstrating the ability of ADA-RP (i) to reduce MySQL deployment costs by 48% in a single-tenant environment, and (ii) to double the executed queries per second in a multi-tenant environment considering user's budget requirements.& COPY; 2023 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页码:173 / 183
页数:11
相关论文
共 40 条
[1]  
Ali-Eldin A, 2012, IEEE IFIP NETW OPER, P204, DOI 10.1109/NOMS.2012.6211900
[2]   A Comparison of Reinforcement Learning Techniques for Fuzzy Cloud Auto-Scaling [J].
Arabnejad, Hamid ;
Pahl, Claus ;
Jamshidi, Pooyan ;
Estrada, Giovani .
2017 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2017, :64-73
[3]   Federated intelligence of anomaly detection agent in IoTMD-enabled Diabetes Management Control System [J].
Astillo, Philip Virgil ;
Duguma, Daniel Gerbi ;
Park, Hoonyong ;
Kim, Jiyoon ;
Kim, Bonam ;
You, Ilsun .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 128 :395-405
[4]   Applying reinforcement learning towards automating resource allocation and application scalability in the cloud [J].
Barrett, Enda ;
Howley, Enda ;
Duggan, Jim .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2013, 25 (12) :1656-1674
[6]   Workload Prediction Using ARIMA Model and Its Impact on Cloud Applications' QoS [J].
Calheiros, Rodrigo N. ;
Masoumi, Enayat ;
Ranjan, Rajiv ;
Buyya, Rajkumar .
IEEE TRANSACTIONS ON CLOUD COMPUTING, 2015, 3 (04) :449-458
[7]   Optimization of Resource Provisioning Cost in Cloud Computing [J].
Chaisiri, Sivadon ;
Lee, Bu-Sung ;
Niyato, Dusit .
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2012, 5 (02) :164-177
[8]   Real-Time Anomaly Detection of NoSQL Systems Based on Resource Usage Monitoring [J].
Chouliaras, Spyridon ;
Sotiriadis, Stelios .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (09) :6042-6049
[9]   Reliability-aware server consolidation for balancing energy-lifetime tradeoff in virtualized cloud datacenters [J].
Deng, Wei ;
Liu, Fangming ;
Jin, Hai ;
Liao, Xiaofei ;
Liu, Haikun .
INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2014, 27 (04) :623-642
[10]  
Dong XS, 2017, INT CONF BIG DATA, P119, DOI 10.1109/BIGCOMP.2017.7881726