Adaptive processing rate based container provisioning for meshed Micro-services in Kubernetes Clouds

被引:2
作者
Wu, Hang [1 ]
Cai, Zhicheng [1 ]
Lei, Yamin [1 ]
Xu, Jian [1 ]
Buyya, Rajkumar [2 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing, Peoples R China
[2] Univ Melbourne, Sch Comp & Informat Syst, Cloud Comp & Distributed Syst CLOUDS Lab, Melbourne, Vic, Australia
基金
中国国家自然科学基金;
关键词
Kubernetes; Container; Micro-services; Provisioning; Jackson queuing network; RESOURCE-ALLOCATION; SIMULATION;
D O I
10.1007/s42514-022-00096-x
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
More and more applications are organized in the form of meshed micro-services which can be deployed on the popular container orchestration platform Kubernetes. Designing appropriate container auto-scaling methods for such applications in Kubernetes is beneficial to reducing costs and guaranteeing Quality of Services (QoS). However, most existing resource provisioning methods focus on a service without considering interactions among meshed services. Meanwhile, synchronous calls among services have different impacts on the processing ability of containers as the proportion of different business type's requests changes which is not considered in existing methods too. Therefore, in this article, an adaptive queuing model and queuing-length aware Jackson queuing network based method is proposed. It adjusts the processing rate of containers according to the ratio of synchronous calls and considers queuing tasks when calculating the impact of bottleneck tiers to others. Experiments are performed on a real Kubernetes cluster, which illustrate that the proposal obtains the lowest percentage of Service Level Agreement (SLA)-violations (decreasing about 6.33%-12.29%) with about 0.9% additional costs compared with existing methods of Kubernetes and other latest methods.
引用
收藏
页码:165 / 181
页数:17
相关论文
共 36 条
[1]   Diminishing Returns and Deep Learning for Adaptive CPU Resource Allocation of Containers [J].
Abdullah, Muhammad ;
Iqbal, Waheed ;
Bukhari, Faisal ;
Erradi, Abdelkarim .
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2020, 17 (04) :2052-2063
[2]   Stochastic Resource Provisioning for Containerized Multi-Tier Web Services in Clouds [J].
Adam, Omer ;
Lee, Young Choon ;
Zomaya, Albert Y. .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (07) :2060-2073
[3]   Elasticity in Cloud Computing: State of the Art and Research Challenges [J].
Al-Dhuraibi, Yahya ;
Paraiso, Fawaz ;
Djarallah, Nabil ;
Merle, Philippe .
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2018, 11 (02) :430-447
[4]  
AlibabaCloud, CONT SERV KUB
[5]  
Amazon, AMAZ ELAST KUB SERV
[6]  
Arlitt M.F., 1996, PROC ACM SIGMETRICS, P126
[7]   A comparitive study of predictive models for cloud infrastructure management [J].
Balaji, Mahesh ;
Rao, G. Subrahmanya V. R. K. ;
Kumar, Ch. Aswani .
2014 14TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2014, :923-926
[8]   A Discrete-Time Feedback Controller for Containerized Cloud Applications [J].
Baresi, Luciano ;
Guinea, Sam ;
Leva, Alberto ;
Quattrocchi, Giovanni .
FSE'16: PROCEEDINGS OF THE 2016 24TH ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON FOUNDATIONS OF SOFTWARE ENGINEERING, 2016, :217-228
[9]   Applying reinforcement learning towards automating resource allocation and application scalability in the cloud [J].
Barrett, Enda ;
Howley, Enda ;
Duggan, Jim .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2013, 25 (12) :1656-1674
[10]   SLA-based optimisation of virtualised resource for multi-tier web applications in cloud data centres [J].
Bi, Jing ;
Yuan, Haitao ;
Tie, Ming ;
Tan, Wei .
ENTERPRISE INFORMATION SYSTEMS, 2015, 9 (07) :743-767