A hybrid CNN-LSTM model for predicting server load in cloud computing

被引:31
作者
Patel, Eva [1 ]
Kushwaha, Dharmender Singh [1 ]
机构
[1] MNNIT Allahabad, Dept Comp Sci & Engn, Prayagraj 211004, Uttar Pradesh, India
关键词
Cloud computing; Capacity planning; Long short-term memory network; 1-dimensional Convolution Neural Networks; Dilated convolutions; Receptive fields; Temporal patterns; CONVOLUTIONAL NEURAL-NETWORKS;
D O I
10.1007/s11227-021-04234-0
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Complex resource usage patterns of scaling Cloud workloads and heterogeneous infrastructure remain a challenge for accurate modelling of server load, which is the key to effective capacity sizing and provisioning in data centers. Recently, Long Short-Term Memory (LSTM) network has been used for host load prediction. However, learning complex noisy variations in host load is still an issue that needs to be addressed. In this work, we propose pCNN-LSTM, a hybrid prediction approach comprising of 1-dimensional Convolution Neural Networks (1D CNN) and LSTM, to predict CPU utilization on Cloud servers at multiple consecutive time-steps. It consists of three parallel dilated 1D CNN layers with different dilation rates for pattern extraction from noisy host CPU usage and an LSTM layer that learns temporal dependencies within the raw usage values as well as within the patterns extracted by the 1D CNN layers. Convolutions with different dilation rates enable the model to learn CPU load variations at different scales. Prediction skill of pCNN-LSTM is demonstrated using Google cluster trace, Alibaba trace and Bitbrains data, and performance is measured using Mean Squared Error (MSE) and Root Mean Squared Error (RMSE). pCNN-LSTM achieves up to 15%, 13% and 16% improvements in host load prediction with Google Trace, Alibaba trace and Bitbrains data set, respectively, over LSTM, Bidirectional LSTM (BLSTM), CNN-LSTM, CNN-BLSTM and two of its variants, showing the effectiveness of multi-scale learning capability of pCNN-LSTM and establishes its applicability as an adaptive prediction method for improved capacity planning and provisioning.
引用
收藏
页码:10595 / +
页数:30
相关论文
共 43 条
[21]   1D convolutional neural networks and applications: A survey [J].
Kiranyaz, Serkan ;
Avci, Onur ;
Abdeljaber, Osama ;
Ince, Turker ;
Gabbouj, Moncef ;
Inman, Daniel J. .
MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2021, 151
[22]   Long Short Term Memory Recurrent Neural Network (LSTM-RNN) Based Workload Forecasting Model For Cloud Datacenters [J].
Kumar, Jitendra ;
Goomer, Rimsha ;
Singh, Ashutosh Kumar .
6TH INTERNATIONAL CONFERENCE ON SMART COMPUTING AND COMMUNICATIONS, 2018, 125 :676-682
[23]   Time-series forecasting with deep learning: a survey [J].
Lim, Bryan ;
Zohren, Stefan .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2021, 379 (2194)
[24]   A Hybrid CNN-LSTM Model for Aircraft 4D Trajectory Prediction [J].
Ma, Lan ;
Tian, Shan .
IEEE ACCESS, 2020, 8 :134668-134680
[25]   Statistical and Machine Learning forecasting methods: Concerns and ways forward [J].
Makridakis, Spyros ;
Spiliotis, Evangelos ;
Assimakopoulos, Vassilios .
PLOS ONE, 2018, 13 (03)
[26]  
Nemirovsky Daniel, 2018, [Supercomputing Frontiers and Innovations, Supercomputing Frontiers and Innovations], V5, P95
[27]   A Multivariate Fuzzy Time Series Resource Forecast Model for Clouds using LSTM and Data Correlation Analysis [J].
Nhuan Tran ;
Thang Nguyen ;
Binh Minh Nguyen ;
Giang Nguyen .
KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES-2018), 2018, 126 :636-645
[28]   A Novel Combined Prediction Scheme Based on CNN and LSTM for Urban PM2.5 Concentration [J].
Qin, Dongming ;
Yu, Jian ;
Zou, Guojian ;
Yong, Ruihan ;
Zhao, Qin ;
Zhang, Bo .
IEEE ACCESS, 2019, 7 :20050-20059
[29]  
Reiss C., 2011, CISC VIS NETW IND GL, V1, P1
[30]   Statistical Characterization of Business-Critical Workloads Hosted in Cloud Datacenters [J].
Shen, Siqi ;
van Beek, Vincent ;
Iosup, Alexandru .
2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, :465-474