A Scalable and Write-Optimized Disaggregated B+-Tree With Adaptive Cache Assistance

被引:0
作者
An, Hang [1 ]
Wang, Fang [1 ]
Feng, Dan [1 ]
Zou, Xiaomin [1 ]
Liu, Zefeng [1 ]
Zhang, Jianshun [1 ]
机构
[1] Huazhong Univ Sci & Technol, Engn Res Ctr Data Syst & Technol, Key Lab Informat Storage Syst, Wuhan Natl Laborotory Optoelect,Minist Educ, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
Indexes; Synchronization; Computer architecture; Throughput; Concurrent computing; Servers; Memory management; Disaggregated memory; B+-tree; concurrency Control; RDMA;
D O I
10.1109/TCC.2024.3437472
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Disaggregated memory (DM) architecture separates CPU and DRAM into computing/memory resource pools and interconnects them with high-speed networks. Storage systems on DM locate data by distributed index. However, existing distributed indexes either suffer from prohibitive synchronization overhead of write operation or sacrifice the performance of read operation, resulting in low throughput, high tail latency, and challenging trade-off. In this paper, we present Marlin+, a scalable and optimized B+-tree on DM. Marlin+ provides atomic granularity synchronization between write operations via three strategies: 1) a concurrent algorithm that is friendly to IDU operations (Insert, Delete, and Update), enabling different clients to concurrently operate on the same leaf node, 2) shared-exclusive leaf node lock, effectively preventing conflicts between index structure modification operation (SMO) and IDU operations, and 3) critical path compression of write to reduce latency of write operation. Moreover, Marlin+ proposes an adaptive remote address cache to accelerate the access of hot data. Compared to the state-of-the-art schemes based on DM, Marlin achieves 2.21x higher throughput and 83.4% lower P99 latency under YCSB hybrid workloads. Compared to Marlin, Marlin+ improves the throughput by up to 1.58x and reduces the P50 latency by up to 50.5% under YCSB read-intensive workloads.
引用
收藏
页码:1074 / 1087
页数:14
相关论文
共 43 条
  • [1] Marlin: A Concurrent andWrite-Optimized B plus -tree Index on Disaggregated Memory
    An, Hang
    Wang, Fang
    Feng, Dan
    Zou, Xiaomin
    Liu, Zefeng
    Zhang, Jianshun
    [J]. PROCEEDINGS OF THE 52ND INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2023, 2023, : 695 - 704
  • [2] [Anonymous], 2020, Compute Express Link Specification Revision 2.0
  • [3] Rethinking Software Runtimes for Disaggregated Memory
    Calciu, Irina
    Imran, M. Talha
    Puddu, Ivan
    Kashyap, Sanidhya
    Al Maruf, Hasan
    Mutlu, Onur
    Kolli, Aasheesh
    [J]. ASPLOS XXVI: TWENTY-SIXTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, 2021, : 79 - 92
  • [4] PolarDB Serverless: A Cloud Native Database for Disaggregated Data Centers
    Cao, Wei
    Zhang, Yingqiang
    Yang, Xinjun
    Li, Feifei
    Wang, Sheng
    Hu, Qingda
    Cheng, Xuntao
    Chen, Zongzhi
    Liu, Zhenjun
    Fang, Jing
    Wang, Bo
    Wang, Yuhui
    Sun, Haiqing
    Yang, Ze
    Cheng, Zhushi
    Chen, Sen
    Wu, Jian
    Hu, Wei
    Zhao, Jianwei
    Gao, Yusong
    Cai, Songlu
    Zhang, Yunyang
    Tong, Jiawang
    [J]. SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 2477 - 2489
  • [5] Cha S. K., 2001, Proceedings of the 27th International Conference on Very Large Data Bases, P181
  • [6] Chen Y., 2020, Proc. VLDB Endowment, V13, P2634, DOI DOI 10.14778/3407790.3407850
  • [7] Cooper B.F., 2010, Proceedings of the 1st ACM symposium on Cloud computing, SoCC '10, ACM, P143, DOI DOI 10.1145/1807128.1807152
  • [8] en.m, 2022, Gen-Z technology
  • [9] Fan B., 2013, 10th USENIX Symposium on Networked Systems Design and Implementation (NSDI 13), P371
  • [10] GaussDB, 2024, About us