Microservice Auto-Scaling Algorithm Based on Workload Prediction in Cloud-Edge Collaboration Environment

被引：0

作者：

Peng, Zijun ^{[1
,2
]}

Tang, Bing ^{[1
,2
]}

Xu, Wei ^{[1
,2
]}

Yang, Qing ^{[3
]}

Hussaini, Ehsanullah ^{[1
,2
]}

Xiao, Yuqiang ^{[1
,2
]}

Li, Haiyan ^{[1
,2
]}

机构：

[1] Hunan Univ Sci & Technol, Sch Comp Sci & Engn, Xiangtan 411201, Peoples R China

[2] Hunan Key Lab Serv Comp & Novel Software Technol, Xiangtan 411201, Peoples R China

[3] Guangzhou Maritime Univ, Ctr Network & Educ Technol, Guangzhou 510725, Peoples R China

来源：

2023 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS, ITHINGS IEEE GREEN COMPUTING AND COMMUNICATIONS, GREENCOM IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING, CPSCOM IEEE SMART DATA, SMARTDATA AND IEEE CONGRESS ON CYBERMATICS,CYBERMATICS | 2024年

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

Auto-Scaling; Microservice; Workload Prediction; Cloud-Edge Collaboration;

D O I：

10.1109/iThings-GreenCom-CPSCom-SmartData-Cybermatics60724.2023.00112

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Cloud computing centrally consolidates hardware and computing resources, offering efficient and cost-effective services. However, as cloud computing centers are predominantly built and operated in a fully centralized fashion, the increased distance between these centers and users can lead to a decline in service quality. Real-time interaction and high business continuity are crucial in scenarios like traffic monitoring, AR/VR applications, and the Internet of Things (IoT). Edge computing is better suited to meet the demands of such latency-sensitive business needs. By analyzing and processing massive data directly at edge computing nodes, which focus on network edge devices, reliance on transmission resources is reduced, consequently improving the overall quality and performance of services. Nevertheless, resource-constrained edge nodes require efficient utilization of available infrastructure capacity to ensure specific service level objectives (SLO) for applications. Therefore, this paper introduces XScale, a cloud-edge collaborative system that enables microservices to adaptively scale elastically. XScale applies a Bi-LSTM with an attention mechanism to forecast the workload of microservices. When combined with mechanisms designed to handle burst traffic and a cloud-edge collaborative load forwarding strategy, it achieves both adaptive elastic scaling and proactive load forwarding. Experimental results, obtained using real-world microservice workloads, indicate that the XScale system can significantly reduce SLO violations by 88%, increase resource utilization by 15%, and decrease average response time by 21% when compared to existing advanced reactive scaling methods.

引用

页码：608 / 615

页数：8

共 50 条

[21] The Survival Analysis of Big Data Application Over Auto-scaling Cloud Environment
Rajput, R. S.
Goyal, Dinesh
Pant, Anjali
EMERGING TECHNOLOGIES IN COMPUTER ENGINEERING: MICROSERVICES IN BIG DATA ANALYTICS, 2019, 985 : 155 - 166
[22] A cost-aware auto-scaling approach using the workload prediction in service clouds
Yang, Jingqi
Liu, Chuanchang
Shang, Yanlei
Cheng, Bo
Mao, Zexiang
Liu, Chunhong
Niu, Lisha
Chen, Junliang
INFORMATION SYSTEMS FRONTIERS, 2014, 16 (01) : 7 - 18
[23] Horizontal Auto-Scaling in Edge Computing Environment using Online Machine Learning
da Silva, Thiago Pereira
Rocha Neto, Aluizio F.
Batista, Thais Vasconcelos
Lopes, Frederico A. S.
Delicato, Flavia C.
Pires, Paulo F.
2021 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS DASC/PICOM/CBDCOM/CYBERSCITECH 2021, 2021, : 161 - 168
[24] Efficient Auto-scaling for Host Load Prediction through VM migration in Cloud
Verma, Shveta
Bala, Anju
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (04)
[25] Auto-scaling techniques for IoT-based cloud applications: a review
Verma, Shveta
Bala, Anju
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2021, 24 (03): : 2425 - 2459
[26] Auto-Scaling Cloud-Based Memory-Intensive Applications
Novak, Joe
Kasera, Sneha Kumar
Stutsman, Ryan
2020 IEEE 13TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2020), 2020, : 229 - 237
[27] Auto-scaling techniques for IoT-based cloud applications: a review
Shveta Verma
Anju Bala
Cluster Computing, 2021, 24 : 2425 - 2459
[28] Performance modelling and verification of cloud-based auto-scaling policies
Evangelidis, Alexandros
Parker, David
Bahsoon, Rami
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 87 : 629 - 638
[29] CRUPA: A Container Resource Utilization Prediction Algorithm for Auto-Scaling Based on Time Series Analysis
Meng, Yang
Rao, Ruonan
Zhang, Xin
Hong, Pei
PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), VOL 1, 2016, : 468 - 472
[30] Efficient Auto-scaling Approach in the Telco Cloud using Self-learning Algorithm
Tang, Pengcheng
Li, Fei
Zhou, Wei
Hu, Weihua
Yang, Li
2015 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2015,

← 1 2 3 4 5 →