SMART: A High-Performance Adaptive Radix Tree for Disaggregated Memory

被引:0
作者
Luo, Xuchuan [1 ]
Zuo, Pengfei [2 ]
Shen, Jiacheng [3 ]
Gu, Jiazhen [3 ]
Wang, Xin [1 ,4 ]
Lyu, Michael R. [3 ]
Zhou, Yangfan [1 ,4 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China
[2] Huawei Cloud, Shenzhen, Peoples R China
[3] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[4] Shanghai Key Lab Intelligent Informat Proc, Shanghai, Peoples R China
来源
PROCEEDINGS OF THE 17TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, OSDI 2023 | 2023年
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
INDEX;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Disaggregated memory (DM) is an increasingly prevalent architecture in academia and industry with high resource utilization. It separates computing and memory resources into two pools and interconnects them with fast networks. Existing range indexes on DM are based on B+ trees, which suffer from large inherent read and write amplifications. The read and write amplifications rapidly saturate the network bandwidth, resulting in low request throughput and high access latency of B+ trees on DM. In this paper, we propose to use the radix tree, which is more suitable for DM than the B+ tree due to smaller read and write amplifications. However, constructing a radix tree on DM is challenging due to the costly lock-based concurrency control, the bounded memory-side IOPS, and the complicated computing-side cache validation. To address these challenges, we design SMART, the first radix tree for disaggregated memory with high performance. Specifically, we leverage 1) a hybrid concurrency control scheme including lock-free internal nodes and fine-grained lock-based leaf nodes to reduce lock overhead, 2) a computing-side read-delegation and write-combining technique to break through the IOPS upper bound by reducing redundant I/Os, and 3) a simple yet effective reverse check mechanism for computing-side cache validation. Experimental results show that SMART achieves 6.1x higher throughput under typical write-intensive workloads and 2.8x higher throughput under read-only workloads, compared with state-of-the-art B+ trees on DM.
引用
收藏
页码:553 / 571
页数:19
相关论文
共 60 条
  • [1] Designing Far Memory Data Structures: Think Outside the Box
    Aguilera, Marcos K.
    Keeton, Kimberly
    Novakovic, Stanko
    Singhal, Sharad
    [J]. PROCEEDINGS OF THE WORKSHOP ON HOT TOPICS IN OPERATING SYSTEMS (HOTOS '19), 2019, : 120 - 126
  • [2] Aguilera MK, 2008, PROC VLDB ENDOW, V1, P598
  • [3] Angel S., 2020, 12 USENIX WORKSH HOT
  • [4] Atikoglu Berk, 2012, Performance Evaluation Review, V40, P53, DOI 10.1145/2318857.2254766
  • [5] Caching with Delayed Hits
    Atre, Nirav
    Sherry, Justine
    Wang, Weina
    Berger, Daniel S.
    [J]. SIGCOMM '20: PROCEEDINGS OF THE 2020 ANNUAL CONFERENCE OF THE ACM SPECIAL INTEREST GROUP ON DATA COMMUNICATION ON THE APPLICATIONS, TECHNOLOGIES, ARCHITECTURES, AND PROTOCOLS FOR COMPUTER COMMUNICATION, 2020, : 495 - 513
  • [6] HOT: A Height Optimized Trie Index for Main-Memory Database Systems
    Binna, Robert
    Zangerle, Eva
    Pichl, Martin
    Specht, Guenther
    Leis, Viktor
    [J]. SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2018, : 521 - 534
  • [7] PRISM: Rethinking the RDMA Interface for Distributed Systems
    Burke, Matthew
    Dharanipragada, Sowmya
    Joyner, Shannon
    Szekeres, Adriana
    Nelson, Jacob
    Zhang, Irene
    Ports, Dan R. K.
    [J]. PROCEEDINGS OF THE 28TH ACM SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES, SOSP 2021, 2021, : 228 - 242
  • [8] PolarDB Serverless: A Cloud Native Database for Disaggregated Data Centers
    Cao, Wei
    Zhang, Yingqiang
    Yang, Xinjun
    Li, Feifei
    Wang, Sheng
    Hu, Qingda
    Cheng, Xuntao
    Chen, Zongzhi
    Liu, Zhenjun
    Fang, Jing
    Wang, Bo
    Wang, Yuhui
    Sun, Haiqing
    Yang, Ze
    Cheng, Zhushi
    Chen, Sen
    Wu, Jian
    Hu, Wei
    Zhao, Jianwei
    Gao, Yusong
    Cai, Songlu
    Zhang, Yunyang
    Tong, Jiawang
    [J]. SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 2477 - 2489
  • [9] Tolerating Faults in Disaggregated Datacenters
    Carbonari, Amanda
    Beschasnikh, Ivan
    [J]. HOTNETS-XVI: PROCEEDINGS OF THE 16TH ACM WORKSHOP ON HOT TOPICS IN NETWORKS, 2017, : 164 - 170
  • [10] Cooper Brian F., 2010, Proceedings of the 1st ACM symposium on Cloud computing, P143, DOI DOI 10.1145/1807128.1807152