A Scalable and Write-Optimized Disaggregated B+-Tree With Adaptive Cache Assistance

被引：0

作者：

An, Hang ^{[1
]}

Wang, Fang ^{[1
]}

Feng, Dan ^{[1
]}

Zou, Xiaomin ^{[1
]}

Liu, Zefeng ^{[1
]}

Zhang, Jianshun ^{[1
]}

机构：

[1] Huazhong Univ Sci & Technol, Engn Res Ctr Data Syst & Technol, Key Lab Informat Storage Syst, Wuhan Natl Laborotory Optoelect,Minist Educ, Wuhan 430074, Peoples R China

来源：

IEEE TRANSACTIONS ON CLOUD COMPUTING | 2024年 / 12卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Indexes; Synchronization; Computer architecture; Throughput; Concurrent computing; Servers; Memory management; Disaggregated memory; B+-tree; concurrency Control; RDMA;

D O I：

10.1109/TCC.2024.3437472

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Disaggregated memory (DM) architecture separates CPU and DRAM into computing/memory resource pools and interconnects them with high-speed networks. Storage systems on DM locate data by distributed index. However, existing distributed indexes either suffer from prohibitive synchronization overhead of write operation or sacrifice the performance of read operation, resulting in low throughput, high tail latency, and challenging trade-off. In this paper, we present Marlin+, a scalable and optimized B+-tree on DM. Marlin+ provides atomic granularity synchronization between write operations via three strategies: 1) a concurrent algorithm that is friendly to IDU operations (Insert, Delete, and Update), enabling different clients to concurrently operate on the same leaf node, 2) shared-exclusive leaf node lock, effectively preventing conflicts between index structure modification operation (SMO) and IDU operations, and 3) critical path compression of write to reduce latency of write operation. Moreover, Marlin+ proposes an adaptive remote address cache to accelerate the access of hot data. Compared to the state-of-the-art schemes based on DM, Marlin achieves 2.21x higher throughput and 83.4% lower P99 latency under YCSB hybrid workloads. Compared to Marlin, Marlin+ improves the throughput by up to 1.58x and reduces the P50 latency by up to 50.5% under YCSB read-intensive workloads.

引用

页码：1074 / 1087

页数：14

共 43 条

[1] Marlin: A Concurrent andWrite-Optimized B plus -tree Index on Disaggregated Memory
An, Hang
Wang, Fang
Feng, Dan
Zou, Xiaomin
Liu, Zefeng
Zhang, Jianshun
[J]. PROCEEDINGS OF THE 52ND INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2023, 2023, : 695 - 704
[2] [Anonymous], 2020, Compute Express Link Specification Revision 2.0
[3] Rethinking Software Runtimes for Disaggregated Memory
Calciu, Irina
Imran, M. Talha
Puddu, Ivan
Kashyap, Sanidhya
Al Maruf, Hasan
Mutlu, Onur
Kolli, Aasheesh
[J]. ASPLOS XXVI: TWENTY-SIXTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, 2021, : 79 - 92
[4] PolarDB Serverless: A Cloud Native Database for Disaggregated Data Centers
Cao, Wei
Zhang, Yingqiang
Yang, Xinjun
Li, Feifei
Wang, Sheng
Hu, Qingda
Cheng, Xuntao
Chen, Zongzhi
Liu, Zhenjun
Fang, Jing
Wang, Bo
Wang, Yuhui
Sun, Haiqing
Yang, Ze
Cheng, Zhushi
Chen, Sen
Wu, Jian
Hu, Wei
Zhao, Jianwei
Gao, Yusong
Cai, Songlu
Zhang, Yunyang
Tong, Jiawang
[J]. SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 2477 - 2489
[5] Cha S. K., 2001, Proceedings of the 27th International Conference on Very Large Data Bases, P181
[6] Chen Y., 2020, Proc. VLDB Endowment, V13, P2634, DOI DOI 10.14778/3407790.3407850
[7] Cooper B.F., 2010, Proceedings of the 1st ACM symposium on Cloud computing, SoCC '10, ACM, P143, DOI DOI 10.1145/1807128.1807152
[8] en.m, 2022, Gen-Z technology
[9] Fan B., 2013, 10th USENIX Symposium on Networked Systems Design and Implementation (NSDI 13), P371
[10] GaussDB, 2024, About us

← 1 2 3 4 5 →