Towards Practical and Near-Optimal Coflow Scheduling for Data Center Networks

被引:53
作者
Luo, Shouxi [1 ]
Yu, Hongfang [1 ]
Zhao, Yangming [1 ]
Wang, Sheng [1 ]
Yu, Shui [2 ]
Li, Lemin [1 ]
机构
[1] Univ Elect Sci & Technol China, Minist Educ, Key Lab Opt Fiber Sensing & Commun, Chengdu 611731, Peoples R China
[2] Deakin Univ, Sch IT, Burwood, Vic 3125, Australia
基金
中国国家自然科学基金;
关键词
Coflow; datacenter networks; decentralized; scheduling;
D O I
10.1109/TPDS.2016.2525767
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In current data centers, an application (e.g., MapReduce, Dryad, search platform, etc.) usually generates a group of parallel flows to complete a job. These flows compose a coflow and only completing them all is meaningful to the application. Accordingly, minimizing the average Coflow Completion Time (CCT) becomes a critical objective of flow scheduling. However, achieving this goal in today's Data Center Networks (DCNs) is quite challenging, not only because the schedule problem is theoretically NP-hard, but also because it is tough to perform practical flow scheduling in large-scale DCNs. In this paper, we find that minimizing the average CCT of a set of coflows is equivalent to the well-known problem of minimizing the sum of completion times in a concurrent open shop. As there are abundant existing solutions for concurrent open shop, we open up a variety of techniques for coflow scheduling. Inspired by the best known result, we derive a 2-approximation algorithm for coflow scheduling, and further develop a decentralized coflow scheduling system, D-CAS, which avoids the system problems associated with current centralized proposals while addressing the performance challenges of decentralized suggestions. Trace-driven simulations indicate that D-CAS achieves a performance close to Varys, the state-of-the-art centralized method, and outperforms Baraat, the only existing decentralized method, significantly.
引用
收藏
页码:3366 / 3380
页数:15
相关论文
共 36 条
[1]  
Al-Fares M., 2010, P NSDI SAN JOSE CA U, P19, DOI 10.5555/1855711.1855730
[2]   A scalable, commodity data center network architecture [J].
Al-Fares, Mohammad ;
Loukissas, Alexander ;
Vahdat, Amin .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2008, 38 (04) :63-74
[3]  
Alizadeh M., 2011, P ACM SIGMETRICS JOI, V39, P73, DOI DOI 10.1145/2007116.2007125
[4]   CONGA: Distributed Congestion-Aware Load Balancing for Datacenters [J].
Alizadeh, Mohammad ;
Edsall, Tom ;
Dharmapurikar, Sarang ;
Vaidyanathan, Ramanan ;
Chu, Kevin ;
Fingerhut, Andy ;
Lam, Vinh The ;
Matus, Francis ;
Pan, Rong ;
Yadav, Navindra ;
Varghese, George .
SIGCOMM'14: PROCEEDINGS OF THE 2014 ACM CONFERENCE ON SPECIAL INTEREST GROUP ON DATA COMMUNICATION, 2014, :503-514
[5]   Data Center TCP (DCTCP) [J].
Alizadeh, Mohammad ;
Greenberg, Albert ;
Maltz, David A. ;
Padhye, Jitendra ;
Patel, Parveen ;
Prabhakar, Balaji ;
Sengupta, Sudipta ;
Sridharan, Murari .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2010, 40 (04) :63-74
[6]  
[Anonymous], 2013, ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, DOI [10.1145/2534169.2486031, DOI 10.1145/2534169.2486031]
[7]  
[Anonymous], P NSDI
[8]   Towards Predictable Datacenter Networks [J].
Ballani, Hitesh ;
Costa, Paolo ;
Karagiannis, Thomas ;
Rowstron, Ant .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2011, 41 (04) :242-253
[9]  
Benson Theophilus, 2011, P CONEXT
[10]   Efficient Coflow Scheduling with Varys [J].
Chowdhury, Mosharaf ;
Zhong, Yuan ;
Stoica, Ion .
SIGCOMM'14: PROCEEDINGS OF THE 2014 ACM CONFERENCE ON SPECIAL INTEREST GROUP ON DATA COMMUNICATION, 2014, :443-454