WorkloadDiff: Conditional Denoising Diffusion Probabilistic Models for Cloud Workload Prediction

被引:2
作者
Zheng, Weiping [1 ]
Chen, Zongxiao [1 ]
Zheng, Kaiyuan [1 ]
Zheng, Weijian [1 ]
Chen, Yiqi [1 ]
Fan, Xiaomao [2 ]
机构
[1] South China Normal Univ, Sch Comp Sci, Guangzhou 510630, Peoples R China
[2] Shenzhen Technol Univ, Coll Big Data & Internet, Shenzhen 518122, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Predictive models; Cloud computing; Diffusion models; Time series analysis; Data models; Hidden Markov models; Forecasting; Cloud workload prediction; diffusion models; resource management; resampling; ARIMA;
D O I
10.1109/TCC.2024.3461649
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accurate workload forecasting plays a crucial role in optimizing resource allocation, enhancing performance, and reducing energy consumption in cloud data centers. Deep learning-based methods have emerged as the dominant approach in this field, exhibiting exceptional performance. However, most existing methods lack the ability to quantify confidence, limiting their practical decision-making utility. To address this limitation, we propose a novel denoising diffusion probabilistic model (DDPM)-based method, termed WorkloadDiff, for multivariate probabilistic workload prediction. WorkloadDiff leverages both original and noisy signals from input conditions using a two-path neural network. Additionally, we introduce a multi-scale feature extraction method and an adaptive fusion approach to capture diverse temporal patterns within the workload. To enhance consistency between conditions and predicted values, we incorporate a resampling strategy into the inference of WorkloadDiff. Extensive experiments conducted on four public datasets demonstrate the superior performance of WorkloadDiff over all baseline models, establishing it as a robust tool for resource management in cloud data centers.
引用
收藏
页码:1291 / 1304
页数:14
相关论文
共 51 条
[1]  
Alcaraz JML, 2022, Arxiv, DOI [arXiv:2208.09399, 10.48550/arXiv.2208.09399]
[2]  
Arbat S, 2022, AAAI CONF ARTIF INTE, P12433
[3]   Adaptive Prediction Models for Data Center Resources Utilization Estimation [J].
Baig, Shuja-ur-Rehman ;
Iqbal, Waheed ;
Berral, Josep Lluis ;
Erradi, Abdelkarim ;
Carrera, David .
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2019, 16 (04) :1681-1693
[4]   Multi-step-ahead time series prediction using multiple-output support vector regression [J].
Bao, Yukun ;
Xiong, Tao ;
Hu, Zhongyi .
NEUROCOMPUTING, 2014, 129 :482-493
[5]   Workload Prediction Using ARIMA Model and Its Impact on Cloud Applications' QoS [J].
Calheiros, Rodrigo N. ;
Masoumi, Enayat ;
Ranjan, Rajiv ;
Buyya, Rajkumar .
IEEE TRANSACTIONS ON CLOUD COMPUTING, 2015, 3 (04) :449-458
[6]   RPTCN: Resource Prediction for High-dynamic Workloads in Clouds based on Deep Learning [J].
Chen, Wenyan ;
Lu, Chengzhi ;
Ye, Kejiang ;
Wang, Yang ;
Xu, Cheng-Zhong .
2021 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2021), 2021, :59-69
[7]   Towards Accurate Prediction for High-Dimensional and Highly-Variable Cloud Workloads with Deep Learning [J].
Chen, Zheyi ;
Hu, Jia ;
Min, Geyong ;
Zomaya, Albert Y. ;
El-Ghazawi, Tarek .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (04) :923-934
[8]   Diffusion Models in Vision: A Survey [J].
Croitoru, Florinel-Alin ;
Hondru, Vlad ;
Ionescu, Radu Tudor ;
Shah, Mubarak .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (09) :10850-10869
[9]   Attentional Feature Fusion [J].
Dai, Yimian ;
Gieseke, Fabian ;
Oehmcke, Stefan ;
Wu, Yiquan ;
Barnard, Kobus .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, :3559-3568
[10]   Multivariate workload and resource prediction in cloud computing using CNN and GRU by attention mechanism [J].
Dogani, Javad ;
Khunjush, Farshad ;
Mahmoudi, Mohammad Reza ;
Seydali, Mehdi .
JOURNAL OF SUPERCOMPUTING, 2023, 79 (03) :3437-3470