Heterogeneity-Aware Proactive Elastic Resource Allocation for Serverless Applications

被引:8
作者
Feng, Binbin [1 ,2 ,3 ]
Ding, Zhijun [2 ,4 ,5 ]
Jiang, Changjun [4 ,5 ]
机构
[1] Tongji Univ, Key Lab Embedded Syst & Serv Comp, Shanghai 201804, Peoples R China
[2] Tongji Univ, Dept Comp Sci & Technol, Shanghai 201804, Peoples R China
[3] Univ Macau, IOTSC Lab, Macau, Peoples R China
[4] Tongji Univ, Minist Educ, Key Lab Embedded Syst & Serv Comp, Shanghai 201804, Peoples R China
[5] Shanghai Artificial Intelligence Lab, Shanghai 200232, Peoples R China
基金
中国国家自然科学基金;
关键词
Servers; Costs; Predictive models; Estimation; Runtime; Quality of service; Media; Instance allocation; NUMA; resource estimation; server scaling; serverless; workflow; workload prediction; MACHINES;
D O I
10.1109/TSC.2024.3350711
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Serverless computing is a popular cloud computing model that offers on-demand resource allocation and pay-as-you-go application execution. However, there are still challenges in allocating resources for workflow applications: inaccurate and inefficient resource estimation, high-latency inter-function communication, and long server readiness time. Therefore, we propose the heterogeneity-aware Proactive serverLess wOrkflow Elastic Allocation method (PLOEA) to address these issues and optimize infrastructure costs for cloud service providers (CSPs) while meeting the diverse needs of developers. Specifically, we propose a resource configuration estimation method for heterogeneous workflow applications that builds an ensemble multi-task expert classifier to analyze individual and common resource usage patterns, ensuring estimation accuracy and efficiency. Further, we propose a group allocation strategy for multiple applications that optimizes the spatiotemporal distribution of instances by considering the allocation urgency, communication affinity between functions, and the multi-core architecture of servers. Furthermore, we present a proactive server elastic scaling method that senses workload features, including workload level, trend, and magnitude changes, and combines them with CSP's attention differences to guide the server scaling size. Finally, experiments based on public datasets prove that PLOEA provides better service quality and cost efficiency than existing methods.
引用
收藏
页码:2473 / 2487
页数:15
相关论文
共 36 条
[1]   A Survey on Scheduling Strategies for Workflows in Cloud Environment and Emerging Trends [J].
Adhikari, Mainak ;
Amgoth, Tarachand ;
Srirama, Satish Narayana .
ACM COMPUTING SURVEYS, 2019, 52 (04)
[2]  
Akhtar N, 2020, IEEE INFOCOM SER, P129, DOI [10.1109/INFOCOM41043.2020.9155363, 10.1109/infocom41043.2020.9155363]
[3]   Triggerflow: Trigger-based orchestration of serverless workflows [J].
Arjona, Aitor ;
Lopez, Pedro Garcia ;
Sampe, Josep ;
Slominski, Aleksander ;
Villard, Lionel .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 124 :215-229
[4]  
Breiman L., 2001, MACH LEARN, V45, P5
[5]   Support Vector Machines for classification and regression [J].
Brereton, Richard G. ;
Lloyd, Gavin R. .
ANALYST, 2010, 135 (02) :230-267
[6]  
Burkat K, 2021, P40
[7]  
Chung JY, 2014, Arxiv, DOI [arXiv:1412.3555, DOI 10.48550/ARXIV.1412.3555]
[8]   NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].
COVER, TM ;
HART, PE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+
[9]   Real-time resource scaling platform for Big Data workloads on serverless environments [J].
Enes, Jonatan ;
Exposito, Roberto R. ;
Tourino, Juan .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 105 :361-379
[10]   Elastic Resource Provisioning Using Data Clustering in Cloud Service Platform [J].
Fei, Bowen ;
Zhu, Xiaomin ;
Liu, Daqian ;
Chen, Junjie ;
Bao, Weidong ;
Liu, Ling .
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2022, 15 (03) :1578-1591