Graph Neural Network-Based SLO-Aware Proactive Resource Autoscaling Framework for Microservices

被引:1
|
作者
Park, Jinwoo [1 ]
Choi, Byungkwon [1 ]
Lee, Chunghan [2 ]
Han, Dongsu [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Sch Elect Engn, Daejeon 34141, South Korea
[2] Toyota Motor Co Ltd, Tokyo 1000004, Japan
关键词
Microservice architectures; Resource management; Tail; Surges; Delays; Costs; Cloud computing; Microservices; resources optimization; graph neural networks; applied machine learning; cloud computing; autoscaler;
D O I
10.1109/TNET.2024.3393427
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Microservice is an architectural style widely adopted in various latency-sensitive cloud applications. Similar to the monolith, autoscaling has attracted the attention of operators for managing the resource utilization of microservices. However, it is still challenging to optimize resources in terms of latency service-level-objective (SLO) without human intervention. In this paper, we present GRAF, a graph neural network-based SLO-aware proactive resource autoscaling framework for minimizing total CPU resources while satisfying latency SLO. GRAF leverages front-end workload, distributed tracing data, and machine learning approaches to (a) observe/estimate the impact of traffic change (b) find optimal resource combinations (c) make proactive resource allocation. Experiments using various open-source benchmarks demonstrate that GRAF successfully targets latency SLO while saving up to 19% of total CPU resources compared to the fine-tuned autoscaler. GRAF also handles a traffic surge with 36% fewer resources while achieving up to 2.6x faster tail latency convergence compared to the Kubernetes autoscaler. Moreover, we verify the scalability of GRAF on large-scale deployments, where GRAF saves 21.6% and 25.4% for CPU resources and memory resources, respectively.
引用
收藏
页码:3331 / 3346
页数:16
相关论文
共 50 条
  • [1] GRAF: A Graph Neural Network based Proactive Resource Allocation Framework for SLO-Oriented Microservices
    Park, Jinwoo
    Choi, Byungkwon
    Lee, Chunghan
    Han, Dongsu
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON EMERGING NETWORKING EXPERIMENTS AND TECHNOLOGIES, CONEXT 2021, 2021, : 154 - 167
  • [2] LSRAM: A Lightweight Autoscaling and SLO Resource Allocation Framework for Microservices Based on Gradient Descent
    Hu, Kan
    Xu, Minxian
    Ye, Kejiang
    Xu, Chengzhong
    SOFTWARE-PRACTICE & EXPERIENCE, 2025, 55 (04): : 714 - 730
  • [3] A context-aware attention and graph neural network-based multimodal framework for misogyny detection
    Rehman, Mohammad Zia Ur
    Zahoor, Sufyaan
    Manzoor, Areeb
    Maqbool, Musharaf
    Kumar, Nagendra
    INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (01)
  • [4] Graph-PHPA: Graph-based Proactive Horizontal Pod Autoscaling for Microservices using LSTM-GNN
    Nguyen, Hoa X.
    Zhu, Shaoshu
    Liu, Mingming
    PROCEEDINGS OF THE 2022 IEEE 11TH INTERNATIONAL CONFERENCE ON CLOUD NETWORKING (IEEE CLOUDNET 2022), 2022, : 237 - 241
  • [5] A Graph Neural Network-Based Context-Aware Framework for Sentiment Analysis Classification in Chinese Microblogs
    Jin, Zhesheng
    Zhang, Yunhua
    MATHEMATICS, 2025, 13 (06)
  • [6] Graph Neural Network-Based Detailed Placement Optimization Framework
    Lim, Dho Ui
    Park, Heechun
    2024 25TH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN, ISQED 2024, 2024,
  • [7] MTBullyGNN: A Graph Neural Network-Based Multitask Framework for Cyberbullying Detection
    Maity, Krishanu
    Sen, Tanmay
    Saha, Sriparna
    Bhattacharyya, Pushpak
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (01) : 849 - 858
  • [8] A graph neural network framework based on preference-aware graph diffusion for recommendation
    Shu, Tao
    Shi, Lei
    Zhu, Chuangying
    Liu, Xia
    FRONTIERS IN PSYCHIATRY, 2022, 13
  • [9] RESCAPE: A Resource Estimation System for Microservices with Graph Neural Network and Profile Engine
    Wang, Jinghao
    Wang, Guangzu
    Wo, Tianyu
    Wang, Xu
    Yang, Renyu
    2024 IEEE INTERNATIONAL CONFERENCE ON JOINT CLOUD COMPUTING, JCC, 2024, : 37 - 44
  • [10] A Graph Neural Network and Pointer Network-Based Approach for QoS-Aware Service Composition
    Wang, Xiao
    Xu, Hanchuan
    Wang, Xianzhi
    Xu, Xiaofei
    Wang, Zhongjie
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2023, 16 (03) : 1589 - 1603