Microservice Auto-Scaling Algorithm Based on Workload Prediction in Cloud-Edge Collaboration Environment

被引:0
作者
Peng, Zijun [1 ,2 ]
Tang, Bing [1 ,2 ]
Xu, Wei [1 ,2 ]
Yang, Qing [3 ]
Hussaini, Ehsanullah [1 ,2 ]
Xiao, Yuqiang [1 ,2 ]
Li, Haiyan [1 ,2 ]
机构
[1] Hunan Univ Sci & Technol, Sch Comp Sci & Engn, Xiangtan 411201, Peoples R China
[2] Hunan Key Lab Serv Comp & Novel Software Technol, Xiangtan 411201, Peoples R China
[3] Guangzhou Maritime Univ, Ctr Network & Educ Technol, Guangzhou 510725, Peoples R China
来源
2023 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS, ITHINGS IEEE GREEN COMPUTING AND COMMUNICATIONS, GREENCOM IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING, CPSCOM IEEE SMART DATA, SMARTDATA AND IEEE CONGRESS ON CYBERMATICS,CYBERMATICS | 2024年
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Auto-Scaling; Microservice; Workload Prediction; Cloud-Edge Collaboration;
D O I
10.1109/iThings-GreenCom-CPSCom-SmartData-Cybermatics60724.2023.00112
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cloud computing centrally consolidates hardware and computing resources, offering efficient and cost-effective services. However, as cloud computing centers are predominantly built and operated in a fully centralized fashion, the increased distance between these centers and users can lead to a decline in service quality. Real-time interaction and high business continuity are crucial in scenarios like traffic monitoring, AR/VR applications, and the Internet of Things (IoT). Edge computing is better suited to meet the demands of such latency-sensitive business needs. By analyzing and processing massive data directly at edge computing nodes, which focus on network edge devices, reliance on transmission resources is reduced, consequently improving the overall quality and performance of services. Nevertheless, resource-constrained edge nodes require efficient utilization of available infrastructure capacity to ensure specific service level objectives (SLO) for applications. Therefore, this paper introduces XScale, a cloud-edge collaborative system that enables microservices to adaptively scale elastically. XScale applies a Bi-LSTM with an attention mechanism to forecast the workload of microservices. When combined with mechanisms designed to handle burst traffic and a cloud-edge collaborative load forwarding strategy, it achieves both adaptive elastic scaling and proactive load forwarding. Experimental results, obtained using real-world microservice workloads, indicate that the XScale system can significantly reduce SLO violations by 88%, increase resource utilization by 15%, and decrease average response time by 21% when compared to existing advanced reactive scaling methods.
引用
收藏
页码:608 / 615
页数:8
相关论文
共 50 条
  • [21] The Survival Analysis of Big Data Application Over Auto-scaling Cloud Environment
    Rajput, R. S.
    Goyal, Dinesh
    Pant, Anjali
    EMERGING TECHNOLOGIES IN COMPUTER ENGINEERING: MICROSERVICES IN BIG DATA ANALYTICS, 2019, 985 : 155 - 166
  • [22] A cost-aware auto-scaling approach using the workload prediction in service clouds
    Yang, Jingqi
    Liu, Chuanchang
    Shang, Yanlei
    Cheng, Bo
    Mao, Zexiang
    Liu, Chunhong
    Niu, Lisha
    Chen, Junliang
    INFORMATION SYSTEMS FRONTIERS, 2014, 16 (01) : 7 - 18
  • [23] Horizontal Auto-Scaling in Edge Computing Environment using Online Machine Learning
    da Silva, Thiago Pereira
    Rocha Neto, Aluizio F.
    Batista, Thais Vasconcelos
    Lopes, Frederico A. S.
    Delicato, Flavia C.
    Pires, Paulo F.
    2021 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS DASC/PICOM/CBDCOM/CYBERSCITECH 2021, 2021, : 161 - 168
  • [24] Efficient Auto-scaling for Host Load Prediction through VM migration in Cloud
    Verma, Shveta
    Bala, Anju
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (04)
  • [25] Auto-scaling techniques for IoT-based cloud applications: a review
    Verma, Shveta
    Bala, Anju
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2021, 24 (03): : 2425 - 2459
  • [26] Auto-Scaling Cloud-Based Memory-Intensive Applications
    Novak, Joe
    Kasera, Sneha Kumar
    Stutsman, Ryan
    2020 IEEE 13TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2020), 2020, : 229 - 237
  • [27] Auto-scaling techniques for IoT-based cloud applications: a review
    Shveta Verma
    Anju Bala
    Cluster Computing, 2021, 24 : 2425 - 2459
  • [28] Performance modelling and verification of cloud-based auto-scaling policies
    Evangelidis, Alexandros
    Parker, David
    Bahsoon, Rami
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 87 : 629 - 638
  • [29] CRUPA: A Container Resource Utilization Prediction Algorithm for Auto-Scaling Based on Time Series Analysis
    Meng, Yang
    Rao, Ruonan
    Zhang, Xin
    Hong, Pei
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), VOL 1, 2016, : 468 - 472
  • [30] Efficient Auto-scaling Approach in the Telco Cloud using Self-learning Algorithm
    Tang, Pengcheng
    Li, Fei
    Zhou, Wei
    Hu, Weihua
    Yang, Li
    2015 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2015,