ChainsFormer: A Chain Latency-Aware Resource Provisioning Approach for Microservices Cluster

被引:4
|
作者
Song, Chenghao [1 ]
Xu, Minxian [1 ]
Ye, Kejiang [1 ]
Wu, Huaming [2 ]
Gill, Sukhpal Singh [3 ]
Buyya, Rajkumar [4 ]
Xu, Chengzhong [5 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen, Peoples R China
[2] Tianjin Univ, Tianjin, Peoples R China
[3] Queen Mary Univ London, London, England
[4] Univ Melbourne, Sch Comp & Informat Syst, Cloud Comp & Distributed Syst CLOUDS Lab, Melbourne, Australia
[5] Univ Macau, State Key Lab IoTSC, Taipa, Macau, Peoples R China
基金
中国国家自然科学基金;
关键词
Microservice; Chain; Reinforcement learning; Kubernetes; Scaling;
D O I
10.1007/978-3-031-48421-6_14
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The trend towards transitioning from monolithic applications to microservices has been widely embraced in modern distributed systems and applications. This shift has resulted in the creation of lightweight, fine-grained, and self-contained microservices. Multiple microservices can be linked together via calls and inter-dependencies to form complex functions. One of the challenges in managing microservices is provisioning the optimal amount of resources for microservices in the chain to ensure application performance while improving resource usage efficiency. This paper presents ChainsFormer, a framework that analyzes microservice inter-dependencies to identify critical chains and nodes, and provision resources based on reinforcement learning. To analyze chains, ChainsFormer utilizes light-weight machine learning techniques to address the dynamic nature of microservice chains and workloads. For resource provisioning, a reinforcement learning approach is used that combines vertical and horizontal scaling to determine the amount of allocated resources and the number of replicates. We evaluate the effectiveness of ChainsFormer using realistic applications and traces on a real testbed based on Kubernetes. Our experimental results demonstrate that ChainsFormer can reduce response time by up to 26% and improve processed requests per second by 8% compared with state-of-the-art techniques.
引用
收藏
页码:197 / 211
页数:15
相关论文
共 50 条
  • [1] Latency-Aware Kubernetes Scheduling for Microservices Orchestration at the Edge
    Centofanti, C.
    Tiberti, W.
    Marotta, A.
    Graziosi, F.
    Cassioli, D.
    2023 IEEE 9TH INTERNATIONAL CONFERENCE ON NETWORK SOFTWARIZATION, NETSOFT, 2023, : 426 - 431
  • [2] Latency-aware Traffic Provisioning for Content Delivery Networks
    Hei, Jinghao
    Than, Huiyou
    Zhang, Pengfei
    Tan, Haisheng
    2022 8TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS, BIGCOM, 2022, : 11 - 18
  • [3] CLAP: Cost and Latency-Aware Placement of microservices on the computing continuum
    Rao, Kunal
    Coviello, Giuseppe
    Chakradhar, Srimat
    2024 IEEE 24TH INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING WORKSHOPS, CCGRIDW 2024, 2024, : 68 - 77
  • [4] Latency-aware VNF Chain Deployment with Efficient Resource Reuse at Network Edge
    Jin, Panpan
    Fei, Xincai
    Zhang, Qixia
    Liu, Fangming
    Li, Bo
    IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2020, : 267 - 276
  • [5] Latency-aware decentralized resource management for IoT applications
    Avasalcai, Cosmin
    Dustdar, Schahram
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON THE INTERNET OF THINGS (IOT'18), 2018,
  • [6] Energy and Latency-aware Resource Reconfiguration in Fog Environments
    Godinho, Noe
    Silva, Henrique
    Curado, Marilia
    Paquete, Luis
    2020 IEEE 19TH INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS (NCA), 2020,
  • [7] Latency-aware Virtualized Network Function provisioning for distributed edge clouds
    Son, Jungmin
    Buyya, Rajkumar
    JOURNAL OF SYSTEMS AND SOFTWARE, 2019, 152 : 24 - 31
  • [8] AutoMan: Resource-efficient provisioning with tail latency guarantees for microservices
    Cai, Binlei
    Wang, Bin
    Yang, Meihong
    Guo, Qin
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 143 : 61 - 75
  • [9] Latency-Aware Container Scheduling in Edge Cluster Upgrades: A Deep Reinforcement Learning Approach
    Cui, Hanshuai
    Tang, Zhiqing
    Lou, Jiong
    Jia, Weijia
    Zhao, Wei
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (05) : 2530 - 2543
  • [10] PRESTO: a latency-aware power-capping orchestrator for cloud-native microservices
    Brondolin, Rolando
    Santambrogio, Marco D.
    2020 IEEE INTERNATIONAL CONFERENCE ON AUTONOMIC COMPUTING AND SELF-ORGANIZING SYSTEMS (ACSOS 2020), 2020, : 11 - 20