Performance Analysis of Work Stealing in Large-scale Multithreaded Computing

被引:4
作者
Sonenberg, Nikki [1 ]
Kielanski, Grzegorz [2 ]
Van Houdt, Benny [2 ]
机构
[1] Alan Turing Inst, British Lib, 96 Euston Rd, London, England
[2] Univ Antwerp, Middelheimlaan 1, B-2020 Antwerp, Belgium
关键词
Mean-field model; matrix analytic methods; performance analysis; distributed computing; PUSH STRATEGIES; PULL;
D O I
10.1145/3470887
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Randomized work stealing is used in distributed systems to increase performance and improve resource utilization. In this article, we consider randomized work stealing in a large system of homogeneous processors where parent jobs spawn child jobs that can feasibly be executed in parallel with the parent job. We analyse the performance of two work stealing strategies: one where only child jobs can be transferred across servers and the other where parent jobs are transferred. We define a mean-field model to derive the response time distribution in a large-scale system with Poisson arrivals and exponential parent and child job durations. We prove that the model has a unique fixed point that corresponds to the steady state of a structured Markov chain, allowing us to use matrix analytic methods to compute the unique fixed point. The accuracy of the mean-field model is validated using simulation. Using numerical examples, we illustrate the effect of different probe rates, load, and different child job size distributions on performance with respect to the two stealing strategies, individually, and compared to each other.
引用
收藏
页数:28
相关论文
共 27 条
[1]  
Benny, 2019, PROC ACM MEAS ANAL C, V3, P1
[2]  
Bini D., 2006, P 2006 WORKSH TOOLS, P1
[3]  
Bladt M, 2017, PROB THEOR STOCH MOD, V81, P1, DOI 10.1007/978-1-4939-7049-0
[4]   Scheduling multithreaded computations by work stealing [J].
Blumofe, RD ;
Leiserson, CE .
JOURNAL OF THE ACM, 1999, 46 (05) :720-748
[5]   Cilk: An efficient multithreaded runtime system [J].
Blumofe, RD ;
Joerg, CF ;
Kuszmaul, BC ;
Leiserson, CE ;
Randall, KH ;
Zhou, YL .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1996, 37 (01) :55-69
[6]   Asymptotic independence of queues under randomized load balancing [J].
Bramson, Maury ;
Lu, Yi ;
Prabhakar, Balaji .
QUEUEING SYSTEMS, 2012, 71 (03) :247-292
[7]   A COMPARISON OF RECEIVER-INITIATED AND SENDER-INITIATED ADAPTIVE LOAD SHARING [J].
EAGER, DL ;
LAZOWSKA, ED ;
ZAHORJAN, J .
PERFORMANCE EVALUATION, 1986, 6 (01) :53-68
[8]   Expected values estimated via mean-field approximation are 1/N-accurate [J].
Gast, Nicolas .
Proceedings of the ACM on Measurement and Analysis of Computing Systems, 2017, 1 (01)
[9]  
Gast N, 2010, PERF E R SI, V38, P13, DOI 10.1145/1811099.1811042
[10]  
Gautier T., 2007, PASCO 07 PROC 2007 I, P15, DOI [10.1145/1278177.1278182, DOI 10.1145/1278177.1278182]