Learning Predictive Auto scaling Policies for Cloud-hosted Microservices Using Trace-driven Modeling

被引:15
作者
Abdullah, Muhammad [1 ]
Iqbal, Waheed [1 ]
Erradi, Abdelkarim [2 ]
Bukhari, Faisal [1 ]
机构
[1] Univ Punjab, Punjab Univ Coll Informat Technol, Lahore, Pakistan
[2] Qatar Univ, Dept Comp Sci & Engn, Coll Engn, Doha, Qatar
来源
11TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM 2019) | 2019年
关键词
Cloud computing; Predictive autoscaling; SLO violations; Microservices; Response time guarantees;
D O I
10.1109/CloudCom.2019.00028
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Autoscaling methods are important to ensure response time guarantees for cloud-hosted microservices. Most of the existing state-of-the-art autoscaling methods use rule-based reactive policies with static thresholds defined either on monitored resource consumption metrics such as CPU and memory utilization or application-level metrics such as the response time. However, it is challenging to determine the most appropriate threshold values to minimize resource consumption and performance violations. Whereas, predictive autoscaling methods can help to address these challenges. These methods require considerable time to collect sufficient performance traces representing different resource provisioning possibilities for a target infrastructure to train a useful predictive autoscaling model. In this paper, we tackle this problem by proposing a system that models the response time of microservices through stress testing and then uses a trace-driven simulation to learn a predictive autoscaling model for satisfying response time requirements automatically. The proposed solution reduces the need for collecting performance traces to learn a predictive autoscaling model. Our experimental evaluation on AWS cloud using a microservice under realistic dynamic workloads validates the proposed solution. The validation results show excellent performance to satisfy the response time requirement with only 4.5% extra cost for using the proposed autoscaling method compared to the reactive autoscaling method.
引用
收藏
页码:119 / 126
页数:8
相关论文
共 18 条
[1]  
Abdullah M., 2019, J SYSTEMS SOFTWARE, V151
[2]  
Alipour H., 2017, BIG DAT BIG DAT 2017
[3]  
Aniello L., 2014, NETWORKED SYSTEMS
[4]  
[Anonymous], 1998, 1998 WORLD CUP WEB S
[5]  
Iqbal W., 2009, IEEE INT C CLOUD COM
[6]  
Iqbal W., 2016, IEEE SYSTEMS J, V10
[7]  
Iqbal W., 2019, IEEE TRANS CLOUD COM
[8]  
Iqbal W., 2011, FUTURE GENERATION CO, V27
[9]  
Iqbal W., 2018, J NETWORK COMPUTER A, V124
[10]  
Jindal A., 2019, P 2019 ACM SPEC INT, V19