Lightwave Fabrics: At-Scale Optical Circuit Switching for Datacenter and Machine Learning Systems

被引:32
作者
Liu, Hong [1 ]
Urata, Ryohei [1 ]
Yasumura, Kevin [1 ]
Zhou, Xiang [1 ]
Bannon, Roy [1 ]
Berger, Jill [1 ]
Dashti, Pedram [1 ]
Jouppi, Norm [1 ]
Lam, Cedric [1 ]
Li, Sheng [1 ]
Mao, Erji [1 ]
Nelson, Daniel [1 ]
Papen, George [1 ]
Tariq, Mukarram [1 ]
Vahdat, Amin [1 ]
机构
[1] Google, Mountain View, CA 94043 USA
来源
PROCEEDINGS OF THE 2023 ACM SIGCOMM 2023 CONFERENCE, SIGCOMM 2023 | 2023年
关键词
Data center networks; Optical circuit switches; Machine learning;
D O I
10.1145/3603269.3604836
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We describe our experience developing what we believe to be the world's first large-scale production deployments of lightwave fabrics used for both datacenter networking and machine-learning (ML) applications. Using optical circuit switches (OCSes) and optical transceivers developed in-house, we employ hardware and software codesign to integrate the fabrics into our network and computing infrastructure. Key to our design is a high degree of multiplexing enabled by new kinds of wavelength-division-multiplexing (WDM) and optical circulators that support high-bandwidth bidirectional traffic on a single strand of optical fiber. The development of the requisite OCS and optical transceiver technologies leads to a synchronous lightwave fabric that is reconfigurable, low latency, rate agnostic, and highly available. These fabrics have provided substantial benefits for long-lived traffic patterns in our datacenter networks and predictable traffic patterns in tightly-coupled machine learning clusters. We report results for a large-scale ML superpod with 4096 tensor processing unit (TPU) V4 chips that has more than one ExaFLOP of computing power. For this use case, the deployment of a lightwave fabric provides up to 3x better system availability and model-dependent performance improvements of up to 3.3x compared to a static fabric, despite constituting less than 6% of the total system cost.
引用
收藏
页码:499 / 515
页数:17
相关论文
共 66 条
[31]   In-Datacenter Performance Analysis of a Tensor Processing Unit [J].
Jouppi, Norman P. ;
Young, Cliff ;
Patil, Nishant ;
Patterson, David ;
Agrawal, Gaurav ;
Bajwa, Raminder ;
Bates, Sarah ;
Bhatia, Suresh ;
Boden, Nan ;
Borchers, Al ;
Boyle, Rick ;
Cantin, Pierre-luc ;
Chao, Clifford ;
Clark, Chris ;
Coriell, Jeremy ;
Daley, Mike ;
Dau, Matt ;
Dean, Jeffrey ;
Gelb, Ben ;
Ghaemmaghami, Tara Vazir ;
Gottipati, Rajendra ;
Gulland, William ;
Hagmann, Robert ;
Ho, C. Richard ;
Hogberg, Doug ;
Hu, John ;
Hundt, Robert ;
Hurt, Dan ;
Ibarz, Julian ;
Jaffey, Aaron ;
Jaworski, Alek ;
Kaplan, Alexander ;
Khaitan, Harshit ;
Killebrew, Daniel ;
Koch, Andy ;
Kumar, Naveen ;
Lacy, Steve ;
Laudon, James ;
Law, James ;
Le, Diemthu ;
Leary, Chris ;
Liu, Zhuyuan ;
Lucke, Kyle ;
Lundin, Alan ;
MacKean, Gordon ;
Maggiore, Adriana ;
Mahony, Maire ;
Miller, Kieran ;
Nagarajan, Rahul ;
Narayanaswami, Ravi .
44TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2017), 2017, :1-12
[32]  
Kamil S., 2007, Proceedings of the 4th international conference on Computing frontiers, P183
[33]   Evaluation of an InfiniBand Switch: Choose Latency or Bandwidth, but Not Both [J].
Katebzadeh, M. R. Siavash ;
Costa, Paolo ;
Grot, Boris .
2020 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS), 2020, :180-191
[34]   SiP-ML: High-Bandwidth Optical Net work Interconnects for Machine Learning Training [J].
Khani, Mehrdad ;
Ghobadi, Manya ;
Alizadeh, Mohammad ;
Zhu, Ziyi ;
Glick, Madeleine ;
Bergman, Keren ;
Vahdat, Amin ;
Klenk, Benjamin ;
Ebrahimi, Eiman .
SIGCOMM '21: PROCEEDINGS OF THE 2021 ACM SIGCOMM 2021 CONFERENCE, 2021, :657-675
[35]   Evaluating Modern GPU Interconnect: PCIe, NVLink, NV-SLI, NVSwitch and GPUDirect [J].
Li, Ang ;
Song, Shuaiwen Leon ;
Chen, Jieyang ;
Li, Jiajia ;
Liu, Xu ;
Tallent, Nathan R. ;
Barker, Kevin J. .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (01) :94-110
[36]   Hyperscale Hardware Optimized Neural Architecture Search [J].
Li, Sheng ;
Andersen, Garrett ;
Chen, Tao ;
Cheng, Liqun ;
Grady, Julian ;
Da Huang ;
Le, Quoc V. ;
Li, Andrew ;
Li, Xin ;
Li, Yang ;
Liang, Chen ;
Lu, Yifeng ;
Ni, Yun ;
Pang, Ruoming ;
Tan, Mingxing ;
Wicke, Martin ;
Wu, Gang ;
Zhu, Shengqi ;
Ranganathan, Parthasarathy ;
Jouppi, Norman P. .
PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, VOL 3, ASPLOS 2023, 2023, :343-358
[37]  
Liu H., 2012, Optical Interconnects for Future Data Center Networks. Optical Networks, P17
[38]  
Liu Hong, 2021, WORKSHOP 2021 OPTICA
[39]   RotorNet: A Scalable, Low-complexity, Optical Datacenter Network [J].
Mellette, William M. ;
McGuinness, Rob ;
Roy, Arjun ;
Forencich, Alex ;
Papen, George ;
Snoeren, Alex C. ;
Porter, George .
SIGCOMM '17: PROCEEDINGS OF THE 2017 CONFERENCE OF THE ACM SPECIAL INTEREST GROUP ON DATA COMMUNICATION, 2017, :267-280
[40]  
Minkenberg C, 2016, 2016 OPTICAL FIBER COMMUNICATIONS CONFERENCE AND EXHIBITION (OFC)