AutoMan: Resource-efficient provisioning with tail latency guarantees for microservices

被引：5

作者：

Cai, Binlei ^{[1
,2
]}

Wang, Bin ^{[1
,2
]}

Yang, Meihong ^{[1
,2
]}

Guo, Qin ^{[3
]}

机构：

[1] Qilu Univ Technol, Shandong Acad Sci, Shandong Comp Sci Ctr, Natl Supercomp Ctr Jinan, Jinan 250103, Peoples R China

[2] Shandong Prov Key Lab Comp Networks, Jinan 250103, Peoples R China

[3] Shandong Jianzhu Univ, Sch Sci, Jinan 250101, Peoples R China

来源：

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2023年 / 143卷

基金：

美国国家科学基金会;

关键词：

Cloud computing; Microservices; Resource management; Reinforcement learning; Quality of service; Tail latency; ALLOCATION;

D O I：

10.1016/j.future.2023.01.014

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Modern user-facing services are progressively evolving from large monolithic applications to complex graphs of loosely-coupled microservices. While microservice architecture greatly improves the efficiency of development and operation, it also complicates resource allocation and performance guarantee due to complex dependencies across different microservices. To prevent resource wastage and ensure user satisfaction, we present AutoMan, a learning-driven resource manager for microservices that enables much more efficient resource provisioning while guaranteeing the end-to-end tail latency Service Level Objective (SLO). AutoMan leverages a multi-agent deep deterministic policy gradient (MADDPG)-based method to capture the dependencies across different microservices and to allocate a proper amount of resources to each microservice subject to the target end-to-end tail latency SLO. During runtime, it further proactively identifies the critical microservices responsible for performance anomaly by deriving partial SLOs mathematically, and performs dynamic reprovisioning to mitigate the potential SLO violations. Testbed experiments show that AutoMan can save CPU and memory resources by up to 49.6% and 29.1% on average, while guaranteeing the same end-to-end tail latency objective.(c) 2023 Elsevier B.V. All rights reserved.

引用

页码：61 / 75

页数：15

共 50 条

[1] Less Provisioning: A Hybrid Resource Scaling Engine for Long-Running Services With Tail Latency Guarantees
Cai, Binlei
Li, Keqiu
Zhao, Laiping
Zhang, Rongqi
IEEE TRANSACTIONS ON CLOUD COMPUTING, 2022, 10 (03) : 1941 - 1957
[2] A Resource-Efficient Predictive Resource Provisioning System in Cloud Systems
Shen, Haiying
Chen, Liuhua
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (12) : 3886 - 3900
[3] Erms: Efficient Resource Management for Shared Microservices with SLA Guarantees
Luo, Shutian
Xu, Huanle
Ye, Kejiang
Xu, Guoyao
Zhang, Liping
He, Jian
Yang, Guodong
Xu, Chengzhong
PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, VOL 1, ASPLOS 2023, 2023, : 62 - 77
[4] HCloud: Resource-Efficient Provisioning in Shared Cloud Systems
Delimitrou, Christina
Kozyrakis, Christos
ACM SIGPLAN NOTICES, 2016, 51 (04) : 473 - 488
[5] ChainsFormer: A Chain Latency-Aware Resource Provisioning Approach for Microservices Cluster
Song, Chenghao
Xu, Minxian
Ye, Kejiang
Wu, Huaming
Gill, Sukhpal Singh
Buyya, Rajkumar
Xu, Chengzhong
SERVICE-ORIENTED COMPUTING, ICSOC 2023, PT I, 2023, 14419 : 197 - 211
[6] Less Provisioning: A Fine-grained Resource Scaling Engine for Long-running Services with Tail Latency Guarantees
Cai, Binlei
Zhang, Rongqi
Zhao, Laiping
Li, Keqiu
PROCEEDINGS OF THE 47TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2018,
[7] Continual flow pipelines: Achieving resource-efficient latency tolerance
Srinivasan, ST
Rajwar, R
Akkary, H
Gandhi, A
Upton, M
IEEE MICRO, 2004, 24 (06) : 62 - 73
[8] Towards Resource-Efficient Cloud Systems: Avoiding Over-Provisioning in Demand-Prediction Based Resource Provisioning
Chen, Liuhua
Shen, Haiying
2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 184 - 193
[9] Achieving Resource-Efficient Survivable Provisioning in Service Differentiated WDM Mesh Networks
Ni, Wenda
Zheng, Xiaoping
Zhu, Chulei
Li, Yanhe
Guo, Yili
Zhang, Hanyi
JOURNAL OF LIGHTWAVE TECHNOLOGY, 2008, 26 (13-16) : 2831 - 2839
[10] Resource-efficient and sustainable
Konstruktion, 2016, 68 (03):

← 1 2 3 4 5 →