Auto-scaling of Web Applications in Clouds: A Tail Latency Evaluation

被引：6

作者：

Aslanpour, Mohammad S. ^{[1
,2
]}

Toosi, Adel N. ^{[1
]}

Gaire, Raj ^{[2
]}

Cheema, Muhammad Aamir ^{[1
]}

机构：

[1] Monash Univ, Fac Informat Technol, Clayton, Vic, Australia

[2] CSIROs Data61, Canberra, ACT, Australia

来源：

2020 IEEE/ACM 13TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC 2020) | 2020年

关键词：

cloud computing; auto-scaling; tail latency; resource provisioning; performance evaluation;

D O I：

10.1109/UCC48980.2020.00037

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Mechanisms for dynamically adding and removing Virtual Machines (VMs) to reduce cost while minimizing the latency are called auto-scaling. Latency improvements are mainly fulfilled through minimizing the "average" response times while unpredictabilities and fluctuations of the Web applications, aka flash crowds, can result in very high latencies for users' requests. Requests influenced by flash crowd suffer from long latencies, known as outliers. Such outliers are inevitable to a large extent as auto-scaling solutions continue to improve the average, not the "tail" of latencies. In this paper, we study possible sources of tail latency in auto-scaling mechanisms for Web applications. Based on our extensive evaluations in a real cloud platform, we discovered sources of a tail latency as 1) large requests, i.e. those data-intensive; 2) long-term scaling intervals; 3) instant analysis of scaling parameters; 4) conservative, i.e. tight, threshold tuning; 5) load-unaware surplus VM selection policies used for executing a scale-down decision; 6) cooldown feature, although cost-effective; and 7) VM start-up delay. We also discovered that after improving the average latency by auto-scaling mechanisms, the tail may behave differently, demanding dedicated tail-aware solutions for auto-scaling mechanisms.

引用

页码：186 / 195

页数：10

共 25 条

[11] CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms
Calheiros, Rodrigo N.
Ranjan, Rajiv
Beloglazov, Anton
De Rose, Cesar A. F.
Buyya, Rajkumar
[J]. SOFTWARE-PRACTICE & EXPERIENCE, 2011, 41 (01) : 23 - 50
[12] Mechanisms for SLA provisioning in cloud-based service providers
Casalicchio, Emiliano
Silvestri, Luca
[J]. COMPUTER NETWORKS, 2013, 57 (03) : 795 - 810
[13] A Survey and Taxonomy of Self-Aware and Self-Adaptive Cloud Autoscaling Systems
Chen, Tao
Bahsoon, Rami
Yao, Xin
[J]. ACM COMPUTING SURVEYS, 2018, 51 (03)
[14] Computing A., 2006, ARCHITECTURAL BLUEPR, V31
[15] Elasticity in cloud computing: a survey
Coutinho, Emanuel Ferreira
de Carvalho Sousa, Flavio Rubens
Leal Rego, Paulo Antonio
Gomer, Danielo Goncalves
de Souza, Jose Neuman
[J]. ANNALS OF TELECOMMUNICATIONS, 2015, 70 (7-8) : 289 - 309
[16] The Tail at Scale
Dean, Jeffrey
Barroso, Luiz Andre
[J]. COMMUNICATIONS OF THE ACM, 2013, 56 (02) : 74 - 80
[17] Huang J., 2012, 2012 2nd International Conference on Consumer Electronics, Communications and Networks (CECNet), P2056
[18] Linden Greg, 2006, Marissa Mayer atWeb 2.0
[19] Liu CF, 2017, IEEE GLOBE WORK
[20] A Review of Auto-scaling Techniques for Elastic Applications in Cloud Environments
Lorido-Botran, Tania
Miguel-Alonso, Jose
Lozano, Jose A.
[J]. JOURNAL OF GRID COMPUTING, 2014, 12 (04) : 559 - 592

← 1 2 3 →