Dynamic workload patterns prediction for proactive auto-scaling of web applications

被引：34

作者：

Iqbal, Waheed ^{[1
]}

Erradi, Abdelkarim ^{[1
]}

Mahmood, Arif ^{[2
]}

机构：

[1] Qatar Univ, Coll Engn, Dept Comp Sci & Engn, Doha, Qatar

[2] Informat Technol Univ ITU, Dept Comp Sci, Lahore, Pakistan

来源：

JOURNAL OF NETWORK AND COMPUTER APPLICATIONS | 2018年 / 124卷

关键词：

Workload characterization; Workload patterns; Workload patterns prediction; Proactive auto-scaling; Web applications;

D O I：

10.1016/j.jnca.2018.09.023

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Proactive auto-scaling methods dynamically manage the resources for an application according to the current and future load predictions to preserve the desired performance at a reduced cost. However, auto-scaling web applications remain challenging mainly due to dynamic workload intensity and characteristics which are difficult to predict. Most existing methods mainly predict the request arrival rate which only partially captures the workload characteristics and the changing system dynamics that influence the resource needs. This may lead to inappropriate resource provisioning decisions. In this paper, we address these challenges by proposing a framework for prediction of dynamic workload patterns as follows. First, we use an unsupervised learning method to analyze the web application access logs to discover URI (Uniform Resource Identifier) space partitions based on the response time and the document size features. Then for each application URI, we compute its distribution across these partitions based on historical access logs to accurately capture the workload characteristics compared to just representing the workload using the request arrival rate. These URI distributions are then used to compute the Probabilistic Workload Pattern (PWP), which is a probability vector describing the overall distribution of incoming requests across URI partitions. Finally, the identified workload patterns for a specific number of last time intervals are used to predict the workload pattern of the next interval. The latter is used for future resource demand prediction and proactive auto-scaling to dynamically control the provisioning of resources. The framework is implemented and experimentally evaluated using historical access logs of three real web applications, each with increasing, decreasing, periodic, and randomly varying arrival rate behaviors. Results show that the proposed solution yields significantly more accurate predictions of workload patterns and resource demands of web applications compared to existing approaches.

引用

页码：94 / 107

页数：14

共 41 条

[21] An adaptive prediction approach based on workload pattern discrimination in the cloud [J].