ChainsFormer: A Chain Latency-Aware Resource Provisioning Approach for Microservices Cluster

被引：4

作者：

Song, Chenghao ^{[1
]}

Xu, Minxian ^{[1
]}

Ye, Kejiang ^{[1
]}

Wu, Huaming ^{[2
]}

Gill, Sukhpal Singh ^{[3
]}

Buyya, Rajkumar ^{[4
]}

Xu, Chengzhong ^{[5
]}

机构：

[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen, Peoples R China

[2] Tianjin Univ, Tianjin, Peoples R China

[3] Queen Mary Univ London, London, England

[4] Univ Melbourne, Sch Comp & Informat Syst, Cloud Comp & Distributed Syst CLOUDS Lab, Melbourne, Australia

[5] Univ Macau, State Key Lab IoTSC, Taipa, Macau, Peoples R China

来源：

SERVICE-ORIENTED COMPUTING, ICSOC 2023, PT I | 2023年 / 14419卷

基金：

中国国家自然科学基金;

关键词：

Microservice; Chain; Reinforcement learning; Kubernetes; Scaling;

D O I：

10.1007/978-3-031-48421-6_14

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

The trend towards transitioning from monolithic applications to microservices has been widely embraced in modern distributed systems and applications. This shift has resulted in the creation of lightweight, fine-grained, and self-contained microservices. Multiple microservices can be linked together via calls and inter-dependencies to form complex functions. One of the challenges in managing microservices is provisioning the optimal amount of resources for microservices in the chain to ensure application performance while improving resource usage efficiency. This paper presents ChainsFormer, a framework that analyzes microservice inter-dependencies to identify critical chains and nodes, and provision resources based on reinforcement learning. To analyze chains, ChainsFormer utilizes light-weight machine learning techniques to address the dynamic nature of microservice chains and workloads. For resource provisioning, a reinforcement learning approach is used that combines vertical and horizontal scaling to determine the amount of allocated resources and the number of replicates. We evaluate the effectiveness of ChainsFormer using realistic applications and traces on a real testbed based on Kubernetes. Our experimental results demonstrate that ChainsFormer can reduce response time by up to 26% and improve processed requests per second by 8% compared with state-of-the-art techniques.

引用

页码：197 / 211

页数：15

共 50 条

[1] Latency-Aware Kubernetes Scheduling for Microservices Orchestration at the Edge
Centofanti, C.
Tiberti, W.
Marotta, A.
Graziosi, F.
Cassioli, D.
2023 IEEE 9TH INTERNATIONAL CONFERENCE ON NETWORK SOFTWARIZATION, NETSOFT, 2023, : 426 - 431
[2] Latency-aware Traffic Provisioning for Content Delivery Networks
Hei, Jinghao
Than, Huiyou
Zhang, Pengfei
Tan, Haisheng
2022 8TH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS, BIGCOM, 2022, : 11 - 18
[3] CLAP: Cost and Latency-Aware Placement of microservices on the computing continuum
Rao, Kunal
Coviello, Giuseppe
Chakradhar, Srimat
2024 IEEE 24TH INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING WORKSHOPS, CCGRIDW 2024, 2024, : 68 - 77
[4] Latency-aware VNF Chain Deployment with Efficient Resource Reuse at Network Edge
Jin, Panpan
Fei, Xincai
Zhang, Qixia
Liu, Fangming
Li, Bo
IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2020, : 267 - 276
[5] Latency-aware decentralized resource management for IoT applications
Avasalcai, Cosmin
Dustdar, Schahram
PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON THE INTERNET OF THINGS (IOT'18), 2018,
[6] Energy and Latency-aware Resource Reconfiguration in Fog Environments
Godinho, Noe
Silva, Henrique
Curado, Marilia
Paquete, Luis
2020 IEEE 19TH INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS (NCA), 2020,
[7] Latency-aware Virtualized Network Function provisioning for distributed edge clouds
Son, Jungmin
Buyya, Rajkumar
JOURNAL OF SYSTEMS AND SOFTWARE, 2019, 152 : 24 - 31
[8] AutoMan: Resource-efficient provisioning with tail latency guarantees for microservices
Cai, Binlei
Wang, Bin
Yang, Meihong
Guo, Qin
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 143 : 61 - 75
[9] Latency-Aware Container Scheduling in Edge Cluster Upgrades: A Deep Reinforcement Learning Approach
Cui, Hanshuai
Tang, Zhiqing
Lou, Jiong
Jia, Weijia
Zhao, Wei
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (05) : 2530 - 2543
[10] PRESTO: a latency-aware power-capping orchestrator for cloud-native microservices
Brondolin, Rolando
Santambrogio, Marco D.
2020 IEEE INTERNATIONAL CONFERENCE ON AUTONOMIC COMPUTING AND SELF-ORGANIZING SYSTEMS (ACSOS 2020), 2020, : 11 - 20

← 1 2 3 4 5 →