AutoMan: Resource-efficient provisioning with tail latency guarantees for microservices

被引:5
|
作者
Cai, Binlei [1 ,2 ]
Wang, Bin [1 ,2 ]
Yang, Meihong [1 ,2 ]
Guo, Qin [3 ]
机构
[1] Qilu Univ Technol, Shandong Acad Sci, Shandong Comp Sci Ctr, Natl Supercomp Ctr Jinan, Jinan 250103, Peoples R China
[2] Shandong Prov Key Lab Comp Networks, Jinan 250103, Peoples R China
[3] Shandong Jianzhu Univ, Sch Sci, Jinan 250101, Peoples R China
基金
美国国家科学基金会;
关键词
Cloud computing; Microservices; Resource management; Reinforcement learning; Quality of service; Tail latency; ALLOCATION;
D O I
10.1016/j.future.2023.01.014
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Modern user-facing services are progressively evolving from large monolithic applications to complex graphs of loosely-coupled microservices. While microservice architecture greatly improves the efficiency of development and operation, it also complicates resource allocation and performance guarantee due to complex dependencies across different microservices. To prevent resource wastage and ensure user satisfaction, we present AutoMan, a learning-driven resource manager for microservices that enables much more efficient resource provisioning while guaranteeing the end-to-end tail latency Service Level Objective (SLO). AutoMan leverages a multi-agent deep deterministic policy gradient (MADDPG)-based method to capture the dependencies across different microservices and to allocate a proper amount of resources to each microservice subject to the target end-to-end tail latency SLO. During runtime, it further proactively identifies the critical microservices responsible for performance anomaly by deriving partial SLOs mathematically, and performs dynamic reprovisioning to mitigate the potential SLO violations. Testbed experiments show that AutoMan can save CPU and memory resources by up to 49.6% and 29.1% on average, while guaranteeing the same end-to-end tail latency objective.(c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页码:61 / 75
页数:15
相关论文
共 50 条
  • [1] Less Provisioning: A Hybrid Resource Scaling Engine for Long-Running Services With Tail Latency Guarantees
    Cai, Binlei
    Li, Keqiu
    Zhao, Laiping
    Zhang, Rongqi
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2022, 10 (03) : 1941 - 1957
  • [2] A Resource-Efficient Predictive Resource Provisioning System in Cloud Systems
    Shen, Haiying
    Chen, Liuhua
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (12) : 3886 - 3900
  • [3] Erms: Efficient Resource Management for Shared Microservices with SLA Guarantees
    Luo, Shutian
    Xu, Huanle
    Ye, Kejiang
    Xu, Guoyao
    Zhang, Liping
    He, Jian
    Yang, Guodong
    Xu, Chengzhong
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, VOL 1, ASPLOS 2023, 2023, : 62 - 77
  • [4] HCloud: Resource-Efficient Provisioning in Shared Cloud Systems
    Delimitrou, Christina
    Kozyrakis, Christos
    ACM SIGPLAN NOTICES, 2016, 51 (04) : 473 - 488
  • [5] ChainsFormer: A Chain Latency-Aware Resource Provisioning Approach for Microservices Cluster
    Song, Chenghao
    Xu, Minxian
    Ye, Kejiang
    Wu, Huaming
    Gill, Sukhpal Singh
    Buyya, Rajkumar
    Xu, Chengzhong
    SERVICE-ORIENTED COMPUTING, ICSOC 2023, PT I, 2023, 14419 : 197 - 211
  • [6] Less Provisioning: A Fine-grained Resource Scaling Engine for Long-running Services with Tail Latency Guarantees
    Cai, Binlei
    Zhang, Rongqi
    Zhao, Laiping
    Li, Keqiu
    PROCEEDINGS OF THE 47TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2018,
  • [7] Continual flow pipelines: Achieving resource-efficient latency tolerance
    Srinivasan, ST
    Rajwar, R
    Akkary, H
    Gandhi, A
    Upton, M
    IEEE MICRO, 2004, 24 (06) : 62 - 73
  • [8] Towards Resource-Efficient Cloud Systems: Avoiding Over-Provisioning in Demand-Prediction Based Resource Provisioning
    Chen, Liuhua
    Shen, Haiying
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 184 - 193
  • [9] Achieving Resource-Efficient Survivable Provisioning in Service Differentiated WDM Mesh Networks
    Ni, Wenda
    Zheng, Xiaoping
    Zhu, Chulei
    Li, Yanhe
    Guo, Yili
    Zhang, Hanyi
    JOURNAL OF LIGHTWAVE TECHNOLOGY, 2008, 26 (13-16) : 2831 - 2839
  • [10] Resource-efficient and sustainable
    Konstruktion, 2016, 68 (03):