Clio: A Hardware-Software Co-Designed Disaggregated Memory System

被引:86
作者
Guo, Zhiyuan [1 ]
Shan, Yizhou [1 ]
Luo, Xuhao [1 ]
Huang, Yutong [1 ]
Zhang, Yiying [1 ]
机构
[1] Univ Calif San Diego, San Diego, CA 92103 USA
来源
ASPLOS '22: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS | 2022年
基金
美国国家科学基金会;
关键词
Resource Disaggregation; FPGA; Virtual Memory; Hardware-Software Co-design;
D O I
10.1145/3503222.3507762
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Memory disaggregation has attracted great attention recently because of its benefits in efficient memory utilization and ease of management. So far, memory disaggregation research has all taken one of two approaches: building/emulating memory nodes using regular servers or building them using raw memory devices with no processing power. The former incurs higher monetary cost and faces tail latency and scalability limitations, while the latter introduces performance, security, and management problems. Server-based memory nodes and memory nodes with no processing power are two extreme approaches. We seek a sweet spot in the middle by proposing a hardware-based memory disaggregation solution that has the right amount of processing power at memory nodes. Furthermore, we take a clean-slate approach by starting from the requirements of memory disaggregation and designing a memory-disaggregation-native system. We built Clio, a disaggregated memory system that virtualizes, protects, and manages disaggregated memory at hardware-based memory nodes. The Clio hardware includes a new virtual memory system, a customized network system, and a framework for computation offloading. In building Clio, we not only co-design OS functionalities, hardware architecture, and the network system, but also co-design compute nodes and memory nodes. Our FPGA prototype of Clio demonstrates that each memory node can achieve 100 Gbps throughput and an end-to-end latency of 2.5 mu s at median and 3.2 mu s at the 99th percentile. Clio also scales much better and has orders of magnitude lower tail latency than RDMA. It has 1.1x to 3.4x energy saving compared to CPU-based and SmartNIC-based disaggregated memory systems and is 2.7x faster than software-based SmartNIC solutions.
引用
收藏
页码:417 / 433
页数:17
相关论文
共 81 条
[41]   HPCC: High Precision Congestion Control [J].
Li, Yuliang ;
Miao, Rui ;
Liu, Hongqiang Harry ;
Zhuang, Yan ;
Feng, Fei ;
Tang, Lingbo ;
Cao, Zheng ;
Zhang, Ming ;
Kelly, Frank ;
Alizadeh, Mohammad ;
Yu, Minlan .
SIGCOMM '19 - PROCEEDINGS OF THE ACM SPECIAL INTEREST GROUP ON DATA COMMUNICATION, 2019, :44-58
[42]  
Lim K, 2012, INT S HIGH PERF COMP, P189
[43]  
Lim K, 2009, CONF PROC INT SYMP C, P267, DOI 10.1145/1555815.1555789
[44]  
Linux Kernel, RED BLACK TREES RBTR
[45]   Memory Efficient Loss Recovery for Hardware-based Transport in Datacenter [J].
Lu, Yuanwei ;
Chen, Guo ;
Ruan, Zhenyuan ;
Xiao, Wencong ;
Li, Bojie ;
Zhang, Jiansong ;
Xiong, Yongqiang ;
Cheng, Peng ;
Chen, Enhong .
PROCEEDINGS OF THE 2017 ASIA-PACIFIC WORKSHOP ON NETWORKING (APNET '17), 2017, :22-28
[46]  
Mellanox, 2018, BLUEFIELD SMARTNIC
[47]  
Milojicic Dejan., 2015, Working Paper
[48]   Revisiting Network Support for RDMA [J].
Mittal, Radhika ;
Shpiner, Alexander ;
Panda, Aurojit ;
Zahavi, Eitan ;
Krishnamurthy, Arvind ;
Ratnasamy, Sylvia ;
Shenker, Scott .
PROCEEDINGS OF THE 2018 CONFERENCE OF THE ACM SPECIAL INTEREST GROUP ON DATA COMMUNICATION (SIGCOMM '18), 2018, :313-326
[49]   TIMELY: RTT-based Congestion Control for the Datacenter [J].
Mittal, Radhika ;
Vinh The Lam ;
Dukkipati, Nandita ;
Blem, Emily ;
Wassel, Hassan ;
Ghobadi, Monia ;
Vahdat, Amin ;
Wang, Yaogong ;
Wetherall, David ;
Zats, David .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2015, 45 (04) :537-550
[50]   Homa: A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities [J].
Montazeri, Behnam ;
Li, Yilong ;
Alizadeh, Mohammad ;
Ousterhout, John .
PROCEEDINGS OF THE 2018 CONFERENCE OF THE ACM SPECIAL INTEREST GROUP ON DATA COMMUNICATION (SIGCOMM '18), 2018, :221-235