TROD: Evolving From Electrical Data Center to Optical Data Center

被引:8
作者
Cao, Peirui [1 ]
Zhao, Shizhen [1 ]
Teh, Min Yee [2 ]
Liu, Yunzhuo [1 ]
Wang, Xinbing [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Columbia Univ, New York, NY 10027 USA
来源
2021 IEEE 29TH INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (ICNP 2021) | 2021年
关键词
D O I
10.1109/ICNP52444.2021.9651977
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Despite the bandwidth scaling limit of electrical switching and the high cost of building Clos data center networks (DCNs), the adoption of optical DCNs is still limited. There are two reasons. First, existing optical DCN designs usually face tremendous deployment complexity. Second, these designs are not full-optical and the performance benefit against the non-blocking Clos DCN is not clear. After exploring the design tradeoffs of the existing optical DCN designs, we propose TROD (Threshold Routing based Optical Datacenter), a low-complexity optical DCN with superior performance than other optical DCNs. There are two novel designs in TROD that contribute to its success. First, TROD performs robust topology optimization based on the recurring traffic patterns and thus does not need to react to every traffic change, which lowers deployment and management complexity. Second, TROD introduces tVLB (threshold-based VLB), which can avoid network congestion as much as possible even under unexpected traffic bursts. We conduct simulation based on both Facebook's real DCN traces and our synthesized highly bursty DCN traces. TROD reduces flow completion time (FCT) by at least 2x compared with the existing optical DCN designs, and by approximately 2.4-3.2x compared with expander graph DCN. Compared with the non-blocking Clos, TROD reduces the hop count of the majority packets by one, and could even outperform the non-blocking Clos with proper bandwidth over-provision at the optical layer. Note that TROD can be built with commercially available hardware and does not require host modifications.
引用
收藏
页数:11
相关论文
共 38 条
[1]   Data Center TCP (DCTCP) [J].
Alizadeh, Mohammad ;
Greenberg, Albert ;
Maltz, David A. ;
Padhye, Jitendra ;
Patel, Parveen ;
Prabhakar, Balaji ;
Sengupta, Sudipta ;
Sridharan, Murari .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2010, 40 (04) :63-74
[2]  
[Anonymous], 2008, SIGCOMM
[3]   Sirius: A Flat Datacenter Network with Nanosecond Optical Switching [J].
Ballani, Hitesh ;
Costa, Paolo ;
Behrendt, Raphael ;
Cletheroe, Daniel ;
Haller, Istvan ;
Jozwik, Krzysztof ;
Karinou, Fotini ;
Lange, Sophie ;
Shi, Kai ;
Thomsen, Benn ;
Williams, Hugh .
SIGCOMM '20: PROCEEDINGS OF THE 2020 ANNUAL CONFERENCE OF THE ACM SPECIAL INTEREST GROUP ON DATA COMMUNICATION ON THE APPLICATIONS, TECHNOLOGIES, ARCHITECTURES, AND PROTOCOLS FOR COMPUTER COMMUNICATION, 2020, :782-797
[4]  
Benson T, 2010, P 10 ACM SIGCOMM C I, P267, DOI DOI 10.1145/1879141.1879175
[5]  
Benson T., 2011, Proc. ACM CoNEXT, P1
[6]   BBR: Congestion-Based Congestion Control [J].
Cardwell, Neal ;
Cheng, Yuchung ;
Gunn, C. Stephen ;
Yeganeh, Soheil Hassas ;
Jacobson, Van .
COMMUNICATIONS OF THE ACM, 2017, 60 (02) :58-66
[7]  
Chaves LucianoJerez., 2016, P WORKSH NS 3 WNS3 1, P33
[8]  
Chen L, 2017, PROCEEDINGS OF NSDI '17: 14TH USENIX SYMPOSIUM ON NETWORKED SYSTEMS DESIGN AND IMPLEMENTATION, P577
[9]  
Cheng F, 2016, CALLIGRAMS, P1
[10]  
Delimitrou C., 2012, IISWC