Combining time series prediction models using genetic algorithm to autoscaling Web applications hosted in the cloud infrastructure

被引:77
作者
Messias, Valter Rogerio [1 ]
Estrella, Julio Cezar [1 ]
Ehlers, Ricardo [1 ]
Santana, Marcos Jose [1 ]
Santana, Regina Carlucci [1 ]
Reiff-Marganiec, Stephan [2 ]
机构
[1] Univ Sao Paolo, Inst Math & Comp Sci, POB 668, Sao Paulo, Brazil
[2] Univ Leicester, Dept Comp Sci, Leicester, Leics, England
关键词
Genetic algorithm; Time series prediction models; Elasticity; Cloud computing;
D O I
10.1007/s00521-015-2133-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In a cloud computing environment, companies have the ability to allocate resources according to demand. However, there is a delay that may take minutes between the request for a new resource and it being ready for using. This causes the reactive techniques, which request a new resource only when the system reaches a certain load threshold, to be not suitable for the resource allocation process. To address this problem, it is necessary to predict requests that arrive at the system in the next period of time to allocate the necessary resources, before the system becomes overloaded. There are several time series forecasting models to calculate the workload predictions based on history of monitoring data. However, it is difficult to know which is the best time series forecasting model to be used in each case. The work becomes even more complicated when the user does not have much historical data to be analyzed. Most related work considers only single methods to evaluate the results of the forecast. Other works propose an approach that selects suitable forecasting methods for a given context. But in this case, it is necessary to have a significant amount of data to train the classifier. Moreover, the best solution may not be a specific model, but rather a combination of models. In this paper we propose an adaptive prediction method using genetic algorithms to combine time series forecasting models. Our method does not require a previous phase of training, because it constantly adapts the extent to which the data are coming. To evaluate our proposal, we use three logs extracted from real Web servers. The results show that our proposal often brings the best result and is generic enough to adapt to various types of time series.
引用
收藏
页码:2383 / 2406
页数:24
相关论文
共 29 条
[1]  
Ali-Eldin A, 2012, IEEE IFIP NETW OPER, P204, DOI 10.1109/NOMS.2012.6211900
[2]  
[Anonymous], 2009, CLOUDS BERKELEY VIEW
[3]  
[Anonymous], APACHE COMMONS MATH
[4]  
[Anonymous], 2012, EHUKATIK0912 U BASQ
[5]   A workload characterization study of the 1998 World Cup Web site [J].
Arlitt, M ;
Jin, T .
IEEE NETWORK, 2000, 14 (03) :30-37
[6]   A comparitive study of predictive models for cloud infrastructure management [J].
Balaji, Mahesh ;
Rao, G. Subrahmanya V. R. K. ;
Kumar, Ch. Aswani .
2014 14TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2014, :923-926
[7]  
Beloglazov A., 2010, International Workshop on Middleware for Grids, Clouds and e-Science, P1
[8]  
Box G.E.P., 2013, TIME SERIES ANAL FOR, V4nd, DOI DOI 10.1002/9781118619193
[9]  
Chieu T. C., 2011, 2011 IEEE 8th International Conference on e-Business Engineering, P317, DOI 10.1109/ICEBE.2011.63
[10]   Dynamic Scaling of Web Applications in a Virtualized Cloud Computing Environment [J].
Chieu, Trieu C. ;
Mohindra, Ajay ;
Karve, Alexei A. ;
Segal, Alla .
ICEBE 2009: IEEE INTERNATIONAL CONFERENCE ON E-BUSINESS ENGINEERING, PROCEEDINGS, 2009, :281-286