A Fast Learned Key-Value Store for Concurrent and Distributed Systems

被引：0

作者：

Li, Pengfei ^{[1
]}

Hua, Yu ^{[1
]}

Jia, Jingnan ^{[1
]}

Zuo, Pengfei ^{[1
]}

机构：

[1] Huazhong Univ Sci & Technol, Wuhan 430074, Peoples R China

来源：

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING | 2024年 / 36卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Computers and information processing; computer architecture; data structures; distributed computing; INDEX; TREE;

D O I：

10.1109/TKDE.2023.3327009

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Efficient key-value (KV) store becomes important for concurrent and distributed systems to deliver high performance. The promising learned indexes leverage deep-learning models to complement existing KV stores and obtain significant performance improvements. However, existing schemes show limited scalability in concurrent systems due to containing high dependency among data. The practical system performance decreases when inserting a large amount of new data due to triggering frequent and inefficient retraining operations. Moreover, existing learned indexes become inefficient in distributed systems, since different machines incur high overheads to guarantee the data consistency when the index structures dynamically change. To address these problems in concurrent and distributed systems, we propose a fine-grained learned index scheme with high scalability, called FineStore, which constructs independent models with a flattened data structure under the trained data array to concurrently process the requests with low overheads. FineStore processes the new requests in-place with the support of non-blocking retraining, hence adapting to the new distributions without blocking the systems. In the distributed systems, different machines efficiently leverage the extended RCU barrier to guarantee the data consistency. We evaluate FineStore via YCSB and real-world datasets, and extensive experimental results demonstrate that FineStore improves the performance respectively by up to 1.8x and 2.5x than state-of-the-art XIndex and Masstree. We have released the open-source codes of FineStore for public use in GitHub.

引用

页码：2301 / 2315

页数：15

共 32 条

[31] Fast and accurate variable batch size convolution neural network training on large scale distributed systems
Hu, Zhongzhe
Xiao, Junmin
Sun, Ninghui
Tan, Guangming
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (21)
[32] ABCDEF - The 6 key features behind scalable, multi-tenant web archive processing with ARCH: Archive, Big Data, Concurrent, Distributed, Efficient, Flexible
Holzmann, Helge
Ruest, Nick
Bailey, Jefferson
Dempsey, Alex
Fritz, Samantha
Lee, Peggy
Milligan, Ian
2022 ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES (JCDL), 2022,

← 1 2 3 4 →