Scheduling for efficiency and fairness in systems with redundancy

被引:18
作者
Gardner, Kristen [1 ]
Harchol-Balter, Mor [1 ]
Hyytia, Esa [2 ]
Righter, Rhonda [3 ]
机构
[1] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
[2] Univ Iceland, Dept Comp Sci, Reykjavik, Iceland
[3] Univ Calif Berkeley, IEOR Dept, Berkeley, CA USA
基金
美国国家科学基金会; 芬兰科学院;
关键词
Queueing theory; Redundancy; Replication; Scheduling; Stochastic processes; Resource allocation;
D O I
10.1016/j.peva.2017.07.001
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Server-side variability the idea that the same job can take longer to run on one server than another due to server-dependent factors is an increasingly important concern in many queueing systems. One strategy for overcoming server-side variability to achieve low response time is redundancy, under which jobs create copies of themselves and send these copies to multiple different servers, waiting for only one copy to complete service. Most of the existing theoretical work on redundancy has focused on developing bounds, approximations, and exact analysis to study the response time gains offered by redundancy. However, response time is not the only important metric in redundancy systems: in addition to providing low overall response time, the system should also be fair in the sense that no job class should have a worse mean response time in the system with redundancy than it did in the system before redundancy is allowed. In this paper we use scheduling to address the simultaneous goals of (1) achieving low response time and (2) maintaining fairness across job classes. We develop new exact analysis for per-class response time under First-Come First-Served (FCFS) scheduling for a general type of system structure; our analysis shows that FCFS can be unfair in that it can hurt non-redundant jobs. We then introduce the Least Redundant First (LRF) scheduling policy, which we prove is optimal with respect to overall system response time, but which can be unfair in that it can hurt the jobs that become redundant. Finally, we introduce the Primaries First (PF) scheduling policy, which is provably fair and also achieves excellent overall mean response time. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 25
页数:25
相关论文
共 16 条
[1]  
Adan I., 2014, STOCH SYST, V4, P250, DOI DOI 10.1287/13-SSY117
[2]  
Ananthanarayanan G., 2010, OSDI, V10, P24
[3]  
Bonald Thomas, 2016, ARXIV160406763
[4]   The Tail at Scale [J].
Dean, Jeffrey ;
Barroso, Luiz Andre .
COMMUNICATIONS OF THE ACM, 2013, 56 (02) :74-80
[5]  
Friedman E. J., 2003, Performance Evaluation Review, V31, P229, DOI 10.1145/885651.781056
[6]  
Gardner K., 2015, CMUCS15141
[7]  
Gardner K., 2015, SIGMETRICS
[8]   Redundancy-d: The Power of d Choices for Redundancy [J].
Gardner, Kristen ;
Harchol-Balter, Mor ;
Scheller-Wolf, Alan ;
Velednitsky, Mark ;
Zbarsky, Samuel .
OPERATIONS RESEARCH, 2017, 65 (04) :1078-1094
[9]  
Joshi G, 2012, ANN ALLERTON CONF, P326, DOI 10.1109/Allerton.2012.6483236
[10]  
Koole G, 2008, J SCHEDULING, V11, P163, DOI 10.1007/S10951-007-0018-8