Graph Neural Network-Based SLO-Aware Proactive Resource Autoscaling Framework for Microservices

被引：1

作者：

Park, Jinwoo ^{[1
]}

Choi, Byungkwon ^{[1
]}

Lee, Chunghan ^{[2
]}

Han, Dongsu ^{[1
]}

机构：

[1] Korea Adv Inst Sci & Technol, Sch Elect Engn, Daejeon 34141, South Korea

[2] Toyota Motor Co Ltd, Tokyo 1000004, Japan

来源：

IEEE-ACM TRANSACTIONS ON NETWORKING | 2024年 / 32卷 / 04期

关键词：

Microservice architectures; Resource management; Tail; Surges; Delays; Costs; Cloud computing; Microservices; resources optimization; graph neural networks; applied machine learning; cloud computing; autoscaler;

D O I：

10.1109/TNET.2024.3393427

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Microservice is an architectural style widely adopted in various latency-sensitive cloud applications. Similar to the monolith, autoscaling has attracted the attention of operators for managing the resource utilization of microservices. However, it is still challenging to optimize resources in terms of latency service-level-objective (SLO) without human intervention. In this paper, we present GRAF, a graph neural network-based SLO-aware proactive resource autoscaling framework for minimizing total CPU resources while satisfying latency SLO. GRAF leverages front-end workload, distributed tracing data, and machine learning approaches to (a) observe/estimate the impact of traffic change (b) find optimal resource combinations (c) make proactive resource allocation. Experiments using various open-source benchmarks demonstrate that GRAF successfully targets latency SLO while saving up to 19% of total CPU resources compared to the fine-tuned autoscaler. GRAF also handles a traffic surge with 36% fewer resources while achieving up to 2.6x faster tail latency convergence compared to the Kubernetes autoscaler. Moreover, we verify the scalability of GRAF on large-scale deployments, where GRAF saves 21.6% and 25.4% for CPU resources and memory resources, respectively.

引用

页码：3331 / 3346

页数：16

共 50 条

[1] GRAF: A Graph Neural Network based Proactive Resource Allocation Framework for SLO-Oriented Microservices
Park, Jinwoo
Choi, Byungkwon
Lee, Chunghan
Han, Dongsu
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON EMERGING NETWORKING EXPERIMENTS AND TECHNOLOGIES, CONEXT 2021, 2021, : 154 - 167
[2] LSRAM: A Lightweight Autoscaling and SLO Resource Allocation Framework for Microservices Based on Gradient Descent
Hu, Kan
Xu, Minxian
Ye, Kejiang
Xu, Chengzhong
SOFTWARE-PRACTICE & EXPERIENCE, 2025, 55 (04): : 714 - 730
[3] A context-aware attention and graph neural network-based multimodal framework for misogyny detection
Rehman, Mohammad Zia Ur
Zahoor, Sufyaan
Manzoor, Areeb
Maqbool, Musharaf
Kumar, Nagendra
INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (01)
[4] Graph-PHPA: Graph-based Proactive Horizontal Pod Autoscaling for Microservices using LSTM-GNN
Nguyen, Hoa X.
Zhu, Shaoshu
Liu, Mingming
PROCEEDINGS OF THE 2022 IEEE 11TH INTERNATIONAL CONFERENCE ON CLOUD NETWORKING (IEEE CLOUDNET 2022), 2022, : 237 - 241
[5] A Graph Neural Network-Based Context-Aware Framework for Sentiment Analysis Classification in Chinese Microblogs
Jin, Zhesheng
Zhang, Yunhua
MATHEMATICS, 2025, 13 (06)
[6] Graph Neural Network-Based Detailed Placement Optimization Framework
Lim, Dho Ui
Park, Heechun
2024 25TH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN, ISQED 2024, 2024,
[7] MTBullyGNN: A Graph Neural Network-Based Multitask Framework for Cyberbullying Detection
Maity, Krishanu
Sen, Tanmay
Saha, Sriparna
Bhattacharyya, Pushpak
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (01) : 849 - 858
[8] A graph neural network framework based on preference-aware graph diffusion for recommendation
Shu, Tao
Shi, Lei
Zhu, Chuangying
Liu, Xia
FRONTIERS IN PSYCHIATRY, 2022, 13
[9] RESCAPE: A Resource Estimation System for Microservices with Graph Neural Network and Profile Engine
Wang, Jinghao
Wang, Guangzu
Wo, Tianyu
Wang, Xu
Yang, Renyu
2024 IEEE INTERNATIONAL CONFERENCE ON JOINT CLOUD COMPUTING, JCC, 2024, : 37 - 44
[10] A Graph Neural Network and Pointer Network-Based Approach for QoS-Aware Service Composition
Wang, Xiao
Xu, Hanchuan
Wang, Xianzhi
Xu, Xiaofei
Wang, Zhongjie
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2023, 16 (03) : 1589 - 1603

← 1 2 3 4 5 →