Effective load balancing for cluster-based servers employing job preemption

被引：26

作者：

Ungureanu, Victoria ^{[2
]}

Melamed, Benjamin ^{[1
]}

Katehakis, Michael ^{[3
]}

机构：

[1] Rutgers State Univ, Dept MSIS, Piscataway, NJ 08854 USA

[2] Rutgers State Univ, DIMACS Ctr, Piscataway, NJ 08854 USA

[3] Rutgers State Univ, Dept MSIS, Newark, NJ 07102 USA

来源：

PERFORMANCE EVALUATION | 2008年 / 65卷 / 08期

关键词：

cluster-based servers; back-end server architecture; job preemption; simulation;

D O I：

10.1016/j.peva.2008.01.001

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

A cluster-based server consists of a front-end dispatcher and multiple back-end servers. The dispatcher receives incoming jobs, and then decides how to assign them to back-end servers, which in turn serve the jobs according to some discipline. Cluster-based servers have been widely deployed, as they combine good performance with low costs. Several assignment policies have been proposed for cluster-based servers, most of which aim to balance the load among back-end servers. There are two main strategies for load balancing: The first aims to balance the amount of workload at back-end servers, while the second aims to balance the number of jobs assigned to back-end servers. Examples of policies using these strategies are Dynamic and LC (Least Connected), respectively. In this paper we propose a policy, called LC*, which combines the two aforementioned strategies. The paper shows experimentally that when preemption is admitted (i.e., when jobs execute concurrently on back-end servers), LC* substantially outperforms both Dynamic and LC in terms of response-time metrics. This improved performance is achieved by using only information readily available to the dispatcher, rendering LC* a practical policy to implement. Finally, we study a refinement, called ALC* (Adaptive LC*), which further improves on the response-time performance of LC* by adapting its actions to incoming traffic rates. Published by Elsevier B.V.

引用

页码：606 / 622

页数：17

共 22 条

[1]

[Anonymous], 2001, WEB PROTOCOLS PRACTI

[2] A workload characterization study of the 1998 World Cup Web site [J].

Arlitt, M ;

Jin, T .

IEEE NETWORK, 2000, 14 (03) :30-37

[3]

ARLITT M, 1999, ACM SIGMETRICS PERFO, V27, P25

[4]

BRUCKNER P, 2001, SCHEDULING ALGORITHM

[5] The state of the art in locally distributed Web-server systems [J].

Cardellini, V ;

Casalicchio, E ;

Colajanni, M ;

Yu, PS .

ACM COMPUTING SURVEYS, 2002, 34 (02) :263-311

[6] EQUILOAD: a load balancing policy for clustered web servers [J].

Ciardo, G ;

Riska, A ;

Smirni, E .

PERFORMANCE EVALUATION, 2001, 46 (2-3) :101-124

[7]

Couloris G., 2001, DISTRIBUTED SYSTEMS

[8]

Crovella ME, 1998, PRACTICAL GUIDE TO HEAVY TAILS, P3

[9] Multi-layered round robin routing for parallel servers [J].

Down, Douglas G. ;

Wu, Rong .

QUEUEING SYSTEMS, 2006, 53 (04) :177-188

[10]

Faloutsos M, 1999, COMP COMM R, V29, P251, DOI 10.1145/316194.316229

← 1 2 3 →