Proactive auto-scaling for cloud environments using temporal convolutional neural networks

被引:11
作者
Golshani, Ehsan [1 ,2 ]
Ashtiani, Mehrdad [1 ,2 ]
机构
[1] Iran Univ Sci & Technol, Cloud Comp Ctr, Sch Comp Engn, Tehran, Iran
[2] Iran Univ Sci & Technol, Sch Comp Engn, Hengam St, Tehran 1684613114, Iran
关键词
Cloud computing; Auto-scaling; Resources provisioning; Dynamic scalability; Multi-criteria decision making; PREDICTION;
D O I
10.1016/j.jpdc.2021.04.006
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Auto-scaling systems can dynamically scale the required resources for cloud-based services at runtime. This is an effective mechanism, enabling services to adapt to environmental changes. These systems establish the foundation for achieving elasticity in the modern cloud computing paradigm. Given the dynamic and uncertain nature of the shared cloud infrastructure, cloud auto-scaling systems are one of the most complex and sophisticated created artifacts, aiming to achieve self-aware, self-adaptive, and dependable runtime scaling. To find an effective solution to this problem, an accurate prediction of the required amount of workload as well as the system metrics for future time periods are needed. Various solutions have already been proposed to tackle this problem. Many solutions make use of machine learning, statistical, and ensemble methods. In this paper, we view the auto-scaling problem as a sequence model and apply the convolutional neural networks to predict the future workload of cloud services. Also, by using neural networks, we obtain a mapping between the predicted workload as well as the real-time and future amounts of the required resources. We have also proposed a decision-making mechanism that takes into account different and sometimes conflicting user criteria resulting in the best-compromised decision. To this aim, we have used TOPSIS as a multi-criteria decision-making method for the decision-making component. In the evaluation section, we have examined the amount of prediction error, the amount of service level agreement violations, as well as the amount of resources' under-utilization. Evaluations demonstrate that the proposed approach for predicting the workload shows a 4 percent improvement over the existing approaches. (C) 2021 Elsevier Inc. All rights reserved.
引用
收藏
页码:119 / 141
页数:23
相关论文
共 29 条
[1]  
Almeida V., 2002, Performance Evaluation Review, V30, P3, DOI 10.1145/588160.588162
[2]  
[Anonymous], 2015, P INT C LEARN REPR
[3]  
Arlitt M.F., 1996, PROC ACM SIGMETRICS, P126
[4]  
Bai S, ARXIV
[5]  
Benifa J, 2018, MOB NETW APPL, V23, P1
[6]   Prediction of wind pressure coefficients on building surfaces using artificial neural networks [J].
Bre, Facundo ;
Gimenez, Juan M. ;
Fachinotti, Victor D. .
ENERGY AND BUILDINGS, 2018, 158 :1429-1441
[7]  
BrockwellRichard P.J., 2016, INTRO TIME SERIES FO
[8]   Metamorphic Testing: A Review of Challenges and Opportunities [J].
Chen, Tsong Yueh ;
Kuo, Fei-Ching ;
Liu, Huai ;
Poon, Pak-Lok ;
Towey, Dave ;
Tse, T. H. ;
Zhou, Zhi Quan .
ACM COMPUTING SURVEYS, 2018, 51 (01)
[9]   A Proactive Cloud Scaling Model Based on Fuzzy Time Series and SLA Awareness [J].
Dang Tran ;
Nhuan Tran ;
Giang Nguyen ;
Binh Minh Nguyen .
INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS 2017), 2017, 108 :365-374
[10]   An optimizing BP neural network algorithm based on genetic algorithm [J].
Ding, Shifei ;
Su, Chunyang ;
Yu, Junzhao .
ARTIFICIAL INTELLIGENCE REVIEW, 2011, 36 (02) :153-162