Data Center Scheduling With Network Tasks

被引:0
作者
Giroire, F. [1 ]
Huin, N. [2 ]
Tomassilli, A. [1 ]
Perennes, S. [1 ]
机构
[1] Univ Cote Azur, CNRS, Inria, I3S, F-06103 Sophia Antipolis, France
[2] IMT Atlantique, IRISA, CNRS, UMR 6074, F-35700 Rennes, France
来源
IEEE TRANSACTIONS ON NETWORKING | 2025年
关键词
Data centers; Delays; Scheduling; Partitioning algorithms; Bandwidth; Schedules; Approximation algorithms; Servers; Data models; Costs; Scheduling algorithms; data centers; network-aware scheduling; communication modeling; approximation algorithms;
D O I
10.1109/TON.2025.3578455
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the placement of jobs inside a data center. Traditionally, this is done by a task orchestrator without taking into account network constraints. According to recent studies, network transfers may account for up to 50% of the completion time of classical jobs. Thus, network resources must be considered when placing jobs in a data center. In this paper, we propose a new scheduling framework, introducing network tasks that need to be executed on network machines alongside traditional (CPU) tasks. The model takes into account the competition between communications for the network resources, which is not considered in the formerly proposed scheduling models with communication. Network transfers inside a data center can be easily modeled in our framework. As we show, classical algorithms do not efficiently handle a limited amount of network bandwidth. We thus propose new provably efficient algorithms with the goal of minimizing the makespan in this framework. We show their efficiency and the importance of taking into consideration network capacity through extensive simulations on workflows built from Google data center traces.
引用
收藏
页数:15
相关论文
共 33 条
[1]  
Ahmad F. S., 2014, P USENIX ANN TECH C, P1
[2]  
Chen Bo, 1998, Handbook of Combinatorial Optimization, V3, P1493, DOI [DOI 10.1007/978-1-4613-0303-9_25, DOI 10.1007/978-1-4613-0303-925]
[3]  
Chen FF, 2012, IEEE INFOCOM SER, P1143, DOI 10.1109/INFCOM.2012.6195473
[4]   The features, hardware, and architectures of data center networks: A survey [J].
Chen, Tao ;
Gao, Xiaofeng ;
Chen, Guihai .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2016, 96 :45-74
[5]   Efficient Coflow Scheduling with Varys [J].
Chowdhury, Mosharaf ;
Zhong, Yuan ;
Stoica, Ion .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2014, 44 (04) :443-454
[6]   Managing Data Transfers in Computer Clusters with Orchestra [J].
Chowdhury, Mosharaf ;
Zaharia, Matei ;
Ma, Justin ;
Jordan, Michael I. ;
Stoica, Ion .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2011, 41 (04) :98-109
[7]  
Chowdhury M, 2012, PROCEEDINGS OF THE 11TH ACM WORKSHOP ON HOT TOPICS IN NETWORKS (HOTNETS-XI), P31
[8]   Mapreduce: Simplified data processing on large clusters [J].
Dean, Jeffrey ;
Ghemawat, Sanjay .
COMMUNICATIONS OF THE ACM, 2008, 51 (01) :107-113
[9]   Decentralized Task-Aware Scheduling for Data Center Networks [J].
Dogar, Fahad R. ;
Karagiannis, Thomas ;
Ballani, Hitesh ;
Rowstron, Antony .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2014, 44 (04) :431-442
[10]   Fast approximate graph partitioning algorithms [J].
Even, G ;
Naor, JS ;
Rao, S ;
Schieber, B .
SIAM JOURNAL ON COMPUTING, 1999, 28 (06) :2187-2214