Adaptive processing rate based container provisioning for meshed Micro-services in Kubernetes Clouds

被引:0
作者
Hang Wu
Zhicheng Cai
Yamin Lei
Jian Xu
Rajkumar Buyya
机构
[1] Nanjing University of Science and Technology,School of Computer Science and Engineering
[2] School of Computing and Information Systems,Cloud Computing and Distributed Systems (CLOUDS) Laboratory
[3] The University of Melbourne,undefined
来源
CCF Transactions on High Performance Computing | 2022年 / 4卷
关键词
Kubernetes; Container; Micro-services; Provisioning; Jackson queuing network;
D O I
暂无
中图分类号
学科分类号
摘要
More and more applications are organized in the form of meshed micro-services which can be deployed on the popular container orchestration platform Kubernetes. Designing appropriate container auto-scaling methods for such applications in Kubernetes is beneficial to reducing costs and guaranteeing Quality of Services (QoS). However, most existing resource provisioning methods focus on a service without considering interactions among meshed services. Meanwhile, synchronous calls among services have different impacts on the processing ability of containers as the proportion of different business type’s requests changes which is not considered in existing methods too. Therefore, in this article, an adaptive queuing model and queuing-length aware Jackson queuing network based method is proposed. It adjusts the processing rate of containers according to the ratio of synchronous calls and considers queuing tasks when calculating the impact of bottleneck tiers to others. Experiments are performed on a real Kubernetes cluster, which illustrate that the proposal obtains the lowest percentage of Service Level Agreement (SLA)-violations (decreasing about 6.33%-12.29%) with about 0.9% additional costs compared with existing methods of Kubernetes and other latest methods.
引用
收藏
页码:165 / 181
页数:16
相关论文
共 58 条
  • [1] Abdullah M(2020)Diminishing returns and deep learning for adaptive CPU resource allocation of containers IEEE Trans. Netw. Serv. Manag. 17 2052-2063
  • [2] Iqbal W(2017)Stochastic resource provisioning for containerized multi-tier web services in clouds IEEE Trans. Parallel Distrib. Syst. 28 2060-2073
  • [3] Bukhari F(2018)Elasticity in cloud computing: state of the art and research challenges IEEE Trans. Serv. Comput. 11 430-447
  • [4] Erradi A(2013)Applying reinforcement learning towards automating resource allocation and application scalability in the cloud Concurr. Comput. Pract. Exp. 25 1656-1674
  • [5] Adam O(2015)Sla-based optimisation of virtualised resource for multi-tier web applications in cloud data centres Enterp. Inf. Syst. 9 743-767
  • [6] Lee YC(2021)Inverse queuing model based feedback control for elastic container provisioning of web systems in Kubernetes IEEE Trans. Comput. 32 e5926-50
  • [7] Zomaya AY(2020)Unequal-interval based loosely coupled control method for auto-scaling heterogeneous cloud resources for web applications Concurr. Comput. Pract. Exp. 41 23-592
  • [8] Al-Dhuraibi Y(2011)Cloudsim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms Softw. Pract. Exp. 12 559-521
  • [9] Paraiso F(2014)A review of auto-scaling techniques for elastic applications in cloud environments J. Grid Comput. 47 505-1845
  • [10] Djarallah N(2017)Containercloudsim: an environment for modeling and simulation of containers in cloud data centers Softw. Pract. Exp. 51 4-869