Effective load balancing for cluster-based servers employing job preemption

被引:26
作者
Ungureanu, Victoria [2 ]
Melamed, Benjamin [1 ]
Katehakis, Michael [3 ]
机构
[1] Rutgers State Univ, Dept MSIS, Piscataway, NJ 08854 USA
[2] Rutgers State Univ, DIMACS Ctr, Piscataway, NJ 08854 USA
[3] Rutgers State Univ, Dept MSIS, Newark, NJ 07102 USA
关键词
cluster-based servers; back-end server architecture; job preemption; simulation;
D O I
10.1016/j.peva.2008.01.001
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A cluster-based server consists of a front-end dispatcher and multiple back-end servers. The dispatcher receives incoming jobs, and then decides how to assign them to back-end servers, which in turn serve the jobs according to some discipline. Cluster-based servers have been widely deployed, as they combine good performance with low costs. Several assignment policies have been proposed for cluster-based servers, most of which aim to balance the load among back-end servers. There are two main strategies for load balancing: The first aims to balance the amount of workload at back-end servers, while the second aims to balance the number of jobs assigned to back-end servers. Examples of policies using these strategies are Dynamic and LC (Least Connected), respectively. In this paper we propose a policy, called LC*, which combines the two aforementioned strategies. The paper shows experimentally that when preemption is admitted (i.e., when jobs execute concurrently on back-end servers), LC* substantially outperforms both Dynamic and LC in terms of response-time metrics. This improved performance is achieved by using only information readily available to the dispatcher, rendering LC* a practical policy to implement. Finally, we study a refinement, called ALC* (Adaptive LC*), which further improves on the response-time performance of LC* by adapting its actions to incoming traffic rates. Published by Elsevier B.V.
引用
收藏
页码:606 / 622
页数:17
相关论文
共 22 条
[1]  
[Anonymous], 2001, WEB PROTOCOLS PRACTI
[2]   A workload characterization study of the 1998 World Cup Web site [J].
Arlitt, M ;
Jin, T .
IEEE NETWORK, 2000, 14 (03) :30-37
[3]  
ARLITT M, 1999, ACM SIGMETRICS PERFO, V27, P25
[4]  
BRUCKNER P, 2001, SCHEDULING ALGORITHM
[5]   The state of the art in locally distributed Web-server systems [J].
Cardellini, V ;
Casalicchio, E ;
Colajanni, M ;
Yu, PS .
ACM COMPUTING SURVEYS, 2002, 34 (02) :263-311
[6]   EQUILOAD: a load balancing policy for clustered web servers [J].
Ciardo, G ;
Riska, A ;
Smirni, E .
PERFORMANCE EVALUATION, 2001, 46 (2-3) :101-124
[7]  
Couloris G., 2001, DISTRIBUTED SYSTEMS
[8]  
Crovella ME, 1998, PRACTICAL GUIDE TO HEAVY TAILS, P3
[9]   Multi-layered round robin routing for parallel servers [J].
Down, Douglas G. ;
Wu, Rong .
QUEUEING SYSTEMS, 2006, 53 (04) :177-188
[10]  
Faloutsos M, 1999, COMP COMM R, V29, P251, DOI 10.1145/316194.316229