Clustering-Based Numerosity Reduction for Cloud Workload Forecasting

被引:0
作者
Rossi, Andrea [1 ]
Visentin, Andrea [2 ]
Prestwich, Steven [2 ]
Brown, Kenneth N. [2 ]
机构
[1] Univ Coll Cork, SFI CRT Artificial Intelligence, Cork, Ireland
[2] Univ Coll Cork, Sch Comp Sci, Cork, Ireland
来源
ALGORITHMIC ASPECTS OF CLOUD COMPUTING, ALGOCLOUD 2023 | 2024年 / 14053卷
关键词
Cloud Computing; Workload Prediction; Clustering; Bayesian Neural Network; Deep Learning;
D O I
10.1007/978-3-031-49361-4_7
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Finding smaller versions of large datasets that preserve the same characteristics as the original ones is becoming a central problem in Machine Learning, especially when computational resources are limited, and there is a need to reduce energy consumption. In this paper, we apply clustering techniques for wisely selecting a subset of datasets for training models for time series prediction of future workload in cloud computing. We train Bayesian Neural Networks (BNNs) and state-of-the-art probabilistic models to predict machine-level future resource demand distribution and evaluate them on unseen data from virtual machines in the Google Cloud data centre. Experiments show that selecting the training data via clustering approaches such as Self Organising Maps allows the model to achieve the same accuracy in less than half the time, requiring less than half the datasets rather than selecting more data at random. Moreover, BNNs can capture uncertainty aspects that can better inform scheduling decisions, which state-of-the-art time series forecasting methods cannot do. All the considered models achieve prediction time performance suitable for real-world scenarios.
引用
收藏
页码:115 / 132
页数:18
相关论文
共 50 条
[1]   Self-adaptive architecture for virtual machines consolidation based on probabilistic model evaluation of data centers in Cloud computing [J].
Abadi, Reza Mohammadi Bahram ;
Rahmani, Amir Masoud ;
Alizadeh, Sasan Hossein .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2018, 21 (03) :1711-1733
[2]  
Achar S., 2022, J. Comput. Hardware Eng. (JCHE), V1, P1
[3]   SeQual: an unsupervised feature selection method for cloud workload traces [J].
Ali, Shallaw Mohammed ;
Kecskemeti, Gabor .
JOURNAL OF SUPERCOMPUTING, 2023, 79 (13) :15079-15097
[4]   A Forecasting Methodology for Workload Forecasting in Cloud Systems [J].
Baldan, Francisco J. ;
Ramirez-Gallego, Sergio ;
Bergmeir, Christoph ;
Herrera, Francisco ;
Benitez, Jose M. .
IEEE TRANSACTIONS ON CLOUD COMPUTING, 2018, 6 (04) :929-941
[5]  
Barreto GA, 2007, STUD COMPUT INTELL, V77, P135
[6]   DYNAMIC PROGRAMMING [J].
BELLMAN, R .
SCIENCE, 1966, 153 (3731) :34-&
[7]   Integrated deep learning method for workload and resource prediction cloud systems [J].
Bi, Jing ;
Li, Shuang ;
Yuan, Haitao ;
Zhou, MengChu .
NEUROCOMPUTING, 2021, 424 :35-48
[8]  
Blundell C, 2015, PR MACH LEARN RES, V37, P1613
[9]   Workload Prediction Using ARIMA Model and Its Impact on Cloud Applications' QoS [J].
Calheiros, Rodrigo N. ;
Masoumi, Enayat ;
Ranjan, Rajiv ;
Buyya, Rajkumar .
IEEE TRANSACTIONS ON CLOUD COMPUTING, 2015, 3 (04) :449-458
[10]   SOM time series clustering and prediction with recurrent neural networks [J].
Cherif, Aymen ;
Cardot, Hubert ;
Bone, Romuald .
NEUROCOMPUTING, 2011, 74 (11) :1936-1944