Optimizing Egalitarian Performance when Colocating Tasks with Types for Cloud Data Center Resource Management

被引:0
作者
Pascual, Fanny [1 ]
Rzadca, Krzysztof [2 ]
机构
[1] Sorbonne Univ, CNRS, Lab Informat Paris 6, F-75005 Paris, France
[2] Univ Warsaw, Inst Informat, PL-00927 Warsaw, Poland
关键词
Task analysis; Approximation algorithms; Data centers; Resource management; Clustering algorithms; Computational modeling; Load modeling; Cloud computing; scheduling; complexity; approximation algorithm; heterogeneity; co-tenancy; workload co-location; SCHEDULING PROBLEMS; OPTIMIZATION;
D O I
10.1109/TPDS.2019.2911084
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In data centers, up to dozens of tasks are colocated on a single physical machine. Machines are used more efficiently, but the performance of the tasks deteriorates, as the colocated tasks compete for shared resources. Since the tasks are heterogeneous, the resulting performance dependencies are complex. In our previous work [1] , [2] we proposed a new combinatorial optimization model that uses two parameters of a task - its size and its type - to characterize how a task influences the performance of other tasks allocated to the same machine. In this paper, we study the egalitarian optimization goal: the aim is to optimize the performance of the worst-off task. This problem generalizes the classic makespan minimization on multiple processors ($P||C_{\max }$P||Cmax). We prove that polynomially-solvable variants of $P||C_{\max }$P||Cmax are NP-hard for this generalization, and that the problem is hard to approximate when the number of types is not constant. For a constant number of types, we propose a PTAS, a fast approximation algorithm, and a series of heuristics. We simulate the algorithms on instances derived from a trace of one of Google clusters. Compared with baseline algorithms solving $P||C_{\max }$P||Cmax, our proposed algorithms aware of the types of the jobs lead to significantly better tasks' performance. The notion of type enables us to extend standard combinatorial optimization methods to handle degradation of performance caused by colocation. Types add a layer of additional complexity. However, our results - approximation algorithms and good average-case performance - show that types can be handled efficiently.
引用
收藏
页码:2523 / 2535
页数:13
相关论文
共 37 条
[1]   A survey of scheduling problems with setup times or costs [J].
Allahverdi, Ali ;
Ng, C. T. ;
Cheng, T. C. E. ;
Kovalyov, Mikhail Y. .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2008, 187 (03) :985-1032
[2]  
[Anonymous], 1979, Computers and Intractablity: A Guide to the Theory of NP-Completeness
[3]   Heterogeneous Resource Allocation under Degree Constraints [J].
Beaumont, Olivier ;
Eyraud-Dubois, Lionel ;
Thraves Caro, Christopher ;
Rejeb, Hejer .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2013, 24 (05) :926-937
[4]  
Bobroff N, 2007, 2007 10TH IFIP/IEEE INTERNATIONAL SYMPOSIUM ON INTEGRATED NETWORK MANAGEMENT (IM 2009), VOLS 1 AND 2, P119, DOI 10.1109/INM.2007.374776
[5]  
Bu X., 2013, P 22 INT S HIGH PERF, P227
[6]  
Cheng Y, 2018, IEEE INT CONF BIG DA, P292, DOI 10.1109/BigData.2018.8622518
[7]  
Chiang RonC., 2011, proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, P47
[8]   Resource Central: Understanding and Predicting Workloads for Improved Resource Management in Large Cloud Platforms [J].
Cortez, Eli ;
Bonde, Anand ;
Muzio, Alexandre ;
Russinovich, Mark ;
Fontoura, Marcus ;
Bianchini, Ricardo .
PROCEEDINGS OF THE TWENTY-SIXTH ACM SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES (SOSP '17), 2017, :153-167
[9]   Optimization of Composite Cloud Service Processing with Virtual Machines [J].
Di, Sheng ;
Kondo, Derrick ;
Wang, Cho-Li .
IEEE TRANSACTIONS ON COMPUTERS, 2015, 64 (06) :1755-1768
[10]  
Garefalakis P., 2018, EUROSYS, P4