Dynamic workload patterns prediction for proactive auto-scaling of web applications

被引:34
作者
Iqbal, Waheed [1 ]
Erradi, Abdelkarim [1 ]
Mahmood, Arif [2 ]
机构
[1] Qatar Univ, Coll Engn, Dept Comp Sci & Engn, Doha, Qatar
[2] Informat Technol Univ ITU, Dept Comp Sci, Lahore, Pakistan
关键词
Workload characterization; Workload patterns; Workload patterns prediction; Proactive auto-scaling; Web applications;
D O I
10.1016/j.jnca.2018.09.023
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Proactive auto-scaling methods dynamically manage the resources for an application according to the current and future load predictions to preserve the desired performance at a reduced cost. However, auto-scaling web applications remain challenging mainly due to dynamic workload intensity and characteristics which are difficult to predict. Most existing methods mainly predict the request arrival rate which only partially captures the workload characteristics and the changing system dynamics that influence the resource needs. This may lead to inappropriate resource provisioning decisions. In this paper, we address these challenges by proposing a framework for prediction of dynamic workload patterns as follows. First, we use an unsupervised learning method to analyze the web application access logs to discover URI (Uniform Resource Identifier) space partitions based on the response time and the document size features. Then for each application URI, we compute its distribution across these partitions based on historical access logs to accurately capture the workload characteristics compared to just representing the workload using the request arrival rate. These URI distributions are then used to compute the Probabilistic Workload Pattern (PWP), which is a probability vector describing the overall distribution of incoming requests across URI partitions. Finally, the identified workload patterns for a specific number of last time intervals are used to predict the workload pattern of the next interval. The latter is used for future resource demand prediction and proactive auto-scaling to dynamically control the provisioning of resources. The framework is implemented and experimentally evaluated using historical access logs of three real web applications, each with increasing, decreasing, periodic, and randomly varying arrival rate behaviors. Results show that the proposed solution yields significantly more accurate predictions of workload patterns and resource demands of web applications compared to existing approaches.
引用
收藏
页码:94 / 107
页数:14
相关论文
共 41 条
[21]   An adaptive prediction approach based on workload pattern discrimination in the cloud [J].
Liu, Chunhong ;
Liu, Chuanchang ;
Shang, Yanlei ;
Chen, Shiping ;
Cheng, Bo ;
Chen, Junliang .
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2017, 80 :35-44
[22]  
Liu HA, 2009, LECT NOTES COMPUT SC, V5931, P369
[23]   A Review of Auto-scaling Techniques for Elastic Applications in Cloud Environments [J].
Lorido-Botran, Tania ;
Miguel-Alonso, Jose ;
Lozano, Jose A. .
JOURNAL OF GRID COMPUTING, 2014, 12 (04) :559-592
[24]   Workload modeling for resource usage analysis and simulation in cloud computing [J].
Magalhaes, Deborah ;
Calheiros, Rodrigo N. ;
Buyya, Rajkumar ;
Gomes, Danielo G. .
COMPUTERS & ELECTRICAL ENGINEERING, 2015, 47 :69-81
[25]   Using Geodesic Space Density Gradients for Network Community Detection [J].
Mahmood, Arif ;
Small, Michael ;
Al-Maadeed, Somaya Ali ;
Rajpoot, Nasir .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (04) :921-935
[26]   Subspace Based Network Community Detection Using Sparse Linear Coding [J].
Mahmood, Arif ;
Small, Michael .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (03) :801-812
[27]  
Mahmud A. Hasan, 2014, ACM SIGMETRICS Performance Evaluation Review, V42, P563, DOI 10.1145/2591971.2592019
[28]  
Mehrotra Rajat, 2013, 2013 IEEE/ACM 6th International Conference on Utility and Cloud Computing (UCC), P269, DOI 10.1109/UCC.2013.54
[29]  
OW2 Consortium, 2008, RUBIS AUCT SIT PROT
[30]  
Papadopoulos A. V., 2016, ACM Transactions on Modeling and Performance Evaluation of Computing Systems (TOMPECS), V1, P1