Adaptive processing rate based container provisioning for meshed Micro-services in Kubernetes Clouds

被引:0
作者
Hang Wu
Zhicheng Cai
Yamin Lei
Jian Xu
Rajkumar Buyya
机构
[1] Nanjing University of Science and Technology,School of Computer Science and Engineering
[2] School of Computing and Information Systems,Cloud Computing and Distributed Systems (CLOUDS) Laboratory
[3] The University of Melbourne,undefined
来源
CCF Transactions on High Performance Computing | 2022年 / 4卷
关键词
Kubernetes; Container; Micro-services; Provisioning; Jackson queuing network;
D O I
暂无
中图分类号
学科分类号
摘要
More and more applications are organized in the form of meshed micro-services which can be deployed on the popular container orchestration platform Kubernetes. Designing appropriate container auto-scaling methods for such applications in Kubernetes is beneficial to reducing costs and guaranteeing Quality of Services (QoS). However, most existing resource provisioning methods focus on a service without considering interactions among meshed services. Meanwhile, synchronous calls among services have different impacts on the processing ability of containers as the proportion of different business type’s requests changes which is not considered in existing methods too. Therefore, in this article, an adaptive queuing model and queuing-length aware Jackson queuing network based method is proposed. It adjusts the processing rate of containers according to the ratio of synchronous calls and considers queuing tasks when calculating the impact of bottleneck tiers to others. Experiments are performed on a real Kubernetes cluster, which illustrate that the proposal obtains the lowest percentage of Service Level Agreement (SLA)-violations (decreasing about 6.33%-12.29%) with about 0.9% additional costs compared with existing methods of Kubernetes and other latest methods.
引用
收藏
页码:165 / 181
页数:16
相关论文
共 58 条
  • [11] Merle P(2018)Auto-scaling web applications in clouds: a taxonomy and survey ACM Comput. Surv. 53 1830-103
  • [12] Barrett E(2009)Wikipedia workload analysis for decentralized hosting Comput. Netw. 30 855-208
  • [13] Howley E(2019)Integrating concurrency control in n-tier application scaling management in the cloud IEEE Trans. Parallel Distrib. Syst. 155 91-undefined
  • [14] Duggan J(2019)Brownoutcon: a software system based on brownout and containers for energy-efficient cloud computing J. Syst. Softw. 47 198-undefined
  • [15] Bi J(2016)CTP: a scheduling strategy to smooth response time fluctuations in multi-tier website system Microprocess. Microsyst. undefined undefined-undefined
  • [16] Yuan H(2020)A cost-efficient container orchestration strategy in kubernetes-based cloud computing infrastructures with heterogeneous resources ACM Trans. Internet Technol. undefined undefined-undefined
  • [17] Tie M(undefined)undefined undefined undefined undefined-undefined
  • [18] Tan W(undefined)undefined undefined undefined undefined-undefined
  • [19] Cai Z(undefined)undefined undefined undefined undefined-undefined
  • [20] Buyya R(undefined)undefined undefined undefined undefined-undefined