ChainsFormer: A Chain Latency-Aware Resource Provisioning Approach for Microservices Cluster

被引:4
|
作者
Song, Chenghao [1 ]
Xu, Minxian [1 ]
Ye, Kejiang [1 ]
Wu, Huaming [2 ]
Gill, Sukhpal Singh [3 ]
Buyya, Rajkumar [4 ]
Xu, Chengzhong [5 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen, Peoples R China
[2] Tianjin Univ, Tianjin, Peoples R China
[3] Queen Mary Univ London, London, England
[4] Univ Melbourne, Sch Comp & Informat Syst, Cloud Comp & Distributed Syst CLOUDS Lab, Melbourne, Australia
[5] Univ Macau, State Key Lab IoTSC, Taipa, Macau, Peoples R China
基金
中国国家自然科学基金;
关键词
Microservice; Chain; Reinforcement learning; Kubernetes; Scaling;
D O I
10.1007/978-3-031-48421-6_14
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The trend towards transitioning from monolithic applications to microservices has been widely embraced in modern distributed systems and applications. This shift has resulted in the creation of lightweight, fine-grained, and self-contained microservices. Multiple microservices can be linked together via calls and inter-dependencies to form complex functions. One of the challenges in managing microservices is provisioning the optimal amount of resources for microservices in the chain to ensure application performance while improving resource usage efficiency. This paper presents ChainsFormer, a framework that analyzes microservice inter-dependencies to identify critical chains and nodes, and provision resources based on reinforcement learning. To analyze chains, ChainsFormer utilizes light-weight machine learning techniques to address the dynamic nature of microservice chains and workloads. For resource provisioning, a reinforcement learning approach is used that combines vertical and horizontal scaling to determine the amount of allocated resources and the number of replicates. We evaluate the effectiveness of ChainsFormer using realistic applications and traces on a real testbed based on Kubernetes. Our experimental results demonstrate that ChainsFormer can reduce response time by up to 26% and improve processed requests per second by 8% compared with state-of-the-art techniques.
引用
收藏
页码:197 / 211
页数:15
相关论文
共 50 条
  • [21] Demand Response Aware Cluster Resource Provisioning for Parallel Applications
    Wang, Chen
    de Groot, Martin
    2012 IEEE THIRD INTERNATIONAL CONFERENCE ON SMART GRID COMMUNICATIONS (SMARTGRIDCOMM), 2012, : 127 - 132
  • [22] Practice of Alibaba cloud on elastic resource provisioning for large-scale microservices cluster
    Xu, Minxian
    Yang, Lei
    Wang, Yang
    Gao, Chengxi
    Wen, Linfeng
    Xu, Guoyao
    Zhang, Liping
    Ye, Kejiang
    Xu, Chengzhong
    SOFTWARE-PRACTICE & EXPERIENCE, 2024, 54 (01): : 39 - 57
  • [23] Latency-Aware Resource-Efficient Virtual Network Embedding in Software Defined Networking
    Yan, Zihui
    Wei, Ning
    Jin, Qizhen
    Zhou, Xiaobo
    2019 28TH WIRELESS AND OPTICAL COMMUNICATIONS CONFERENCE (WOCC), 2019, : 82 - 86
  • [24] Offloading Demand Prediction-Driven Latency-Aware Resource Reservation in Edge Networks
    Zhang, Jianhui
    Wang, Jiacheng
    Yuan, Zhongyin
    Zhang, Wanqing
    Liu, Liming
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (15) : 13826 - 13836
  • [25] Latency-Aware Fair Scheduling for Spatial Reuse in WLANs: A Lyapunov Optimization Approach
    Kotera, Shunnosuke
    Yin, Bo
    Yamamoto, Koji
    Nishio, Takayuki
    Morikura, Masahiro
    Abeysekera, Hirantha
    2021 IEEE 18TH ANNUAL CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE (CCNC), 2021,
  • [26] An efficient latency aware resource provisioning in cloud assisted mobile edge framework
    Mulinti, Rajasekhar Bandapalle
    Nagendra, M.
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2021, 14 (03) : 1044 - 1057
  • [27] Stringer: Balancing Latency and Resource Usage in Service Function Chain Provisioning
    Chua, Freddy C.
    Ward, Julie
    Zhang, Ying
    Sharma, Puneet
    Huberman, Bernardo A.
    IEEE INTERNET COMPUTING, 2016, 20 (06) : 22 - 31
  • [28] An efficient latency aware resource provisioning in cloud assisted mobile edge framework
    Rajasekhar Bandapalle Mulinti
    M. Nagendra
    Peer-to-Peer Networking and Applications, 2021, 14 : 1044 - 1057
  • [29] Latency-Aware Service Function Chain Placement in 5G Mobile Networks
    Harutyunyan, Davit
    Shahriar, Nashid
    Boutaba, Raouf
    Riggio, Roberto
    PROCEEDINGS OF THE 2019 IEEE CONFERENCE ON NETWORK SOFTWARIZATION (NETSOFT 2019), 2019, : 133 - 141
  • [30] An effective approach of latency-aware fog smart gateways deployment for IoT services
    Maiti, Prasenjit
    Apat, Hemant Kumar
    Sahoo, Bibhudatta
    Turuk, Ashok Kumar
    INTERNET OF THINGS, 2019, 8