Deep Learning-Based Autoscaling Using Bidirectional Long Short-Term Memory for Kubernetes

被引:44
作者
Dang-Quang, Nhat-Minh [1 ]
Yoo, Myungsik [2 ]
机构
[1] Soongsil Univ, Dept Informat Commun Convergence Technol, Seoul 06978, South Korea
[2] Soongsil Univ, Sch Elect Engn, Seoul 06978, South Korea
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 09期
关键词
cloud computing; web application; autoscaling; time-series analysis; resource estimation; Kubernetes; deep neural network; WEB APPLICATIONS; TRENDS;
D O I
10.3390/app11093835
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Presently, the cloud computing environment attracts many application developers to deploy their web applications on cloud data centers. Kubernetes, a well-known container orchestration for deploying web applications on cloud systems, offers an automatic scaling feature to meet clients' ever-changing demands with the reactive approach. This paper proposes a system architecture based on Kubernetes with a proactive custom autoscaler using a deep neural network model to handle the workload during run time dynamically. The proposed system architecture is designed based on the Monitor-Analyze-Plan-Execute (MAPE) loop. The main contribution of this paper is the proactive custom autoscaler, which focuses on the analysis and planning phases. In analysis phase, Bidirectional Long Short-term Memory (Bi-LSTM) is applied to predict the number of HTTP workloads in the future. In the planning phase, a cooling-down time period is implemented to mitigate the oscillation problem. In addition, a resource removal strategy is proposed to remove a part of the resources when the workload decreases, so that the autoscaler can handle it faster when the burst of workload happens. Through experiments with two different realistic workloads, the Bi-LSTM model achieves better accuracy not only than the Long Short-Term Memory model but also than the state-of-the-art statistical auto-regression integrated moving average model in terms of short- and long-term forecasting. Moreover, it offers 530 to 600 times faster prediction speed than ARIMA models with different workloads. Furthermore, as compared to the LSTM model, the Bi-LSTM model performs better in terms of resource provision accuracy and elastic speedup. Finally, it is shown that the proposed proactive custom autoscaler outperforms the default horizontal pod autoscaler (HPA) of the Kubernetes in terms of accuracy and speed when provisioning and de-provisioning resources.
引用
收藏
页数:25
相关论文
共 37 条
[1]   Elasticity in Cloud Computing: State of the Art and Research Challenges [J].
Al-Dhuraibi, Yahya ;
Paraiso, Fawaz ;
Djarallah, Nabil ;
Merle, Philippe .
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2018, 11 (02) :430-447
[2]   Autonomic Vertical Elasticity of Docker Containers with ELASTICDOCKER [J].
Al-Dhuraibi, Yahya ;
Paraiso, Fawaz ;
Djarallah, Nabil ;
Merle, Philippe .
2017 IEEE 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2017, :472-479
[3]  
[Anonymous], 2012, P 13 ANN C INT SPEEC
[4]   Auto-scaling web applications in clouds: A cost-aware approach [J].
Aslanpour, Mohammad Sadegh ;
Ghobaei-Arani, Mostafa ;
Toosi, Adel Nadjaran .
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2017, 95 :26-41
[5]   Chameleon: A Hybrid, Proactive Auto-Scaling Mechanism on a Level-Playing Field [J].
Bauer, Andre ;
Herbst, Nikolas ;
Spinner, Simon ;
Ali-Eldin, Ahmed ;
Kounev, Samuel .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (04) :800-813
[6]   On the Value of Service Demand Estimation for Auto-scaling [J].
Bauer, Andre ;
Grohmann, Johannes ;
Herbst, Nikolas ;
Kounev, Samuel .
MEASUREMENT, MODELLING AND EVALUATION OF COMPUTING SYSTEMS, MMB 2018, 2018, 10740 :142-156
[7]   Predicting Cloud Resource Utilization [J].
Borkowski, Michael ;
Schulte, Stefan ;
Hochreiner, Christoph .
2016 IEEE/ACM 9TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC), 2016, :37-42
[8]   Workload Prediction Using ARIMA Model and Its Impact on Cloud Applications' QoS [J].
Calheiros, Rodrigo N. ;
Masoumi, Enayat ;
Ranjan, Rajiv ;
Buyya, Rajkumar .
IEEE TRANSACTIONS ON CLOUD COMPUTING, 2015, 3 (04) :449-458
[9]  
Cheng HB, 2006, LECT NOTES ARTIF INT, V3918, P765
[10]  
Ciptaningtyas HT, 2017, INT CONF INFORM COMM, P193, DOI 10.1109/ICTS.2017.8265669