TrieKV: A High-Performance Key-Value Store Design With Memory as Its First-Class Citizen

被引:0
作者
Sun, Hui [1 ]
Kong, Deyan [1 ]
Jiang, Song [2 ]
Yue, Yinliang [3 ]
Qin, Xiao [4 ]
机构
[1] Anhui Univ, Sch Sci & Technol, Hefei 230601, Anhui, Peoples R China
[2] Univ Texas Arlington, Dept Comp Sci & Engn, Arlington, TX 76019 USA
[3] Zhongguancun Lab, Dept 3, Beijing 100094, Peoples R China
[4] Auburn Univ, Dept Comp Sci & Software Engn, Auburn, AL 36849 USA
基金
美国国家航空航天局; 美国国家科学基金会; 中国国家自然科学基金;
关键词
Compaction; Random access memory; Indexes; Memory management; Resource management; Costs; Throughput; Solid state drives; Optimization; Bandwidth; Hybrid memory; key-value (KV) store; log-structured merge tree (LSM-tree); persistent memory;
D O I
10.1109/TPDS.2024.3473013
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Key-value (KV) stores based on log-structured merge tree (LSM-tree) have been extensively studied and deployed in major information technology infrastructures. Because this type of systems is catered for KV store accessing disks, a limited disk bandwidth increases the difficulty of serving online data requests. One solution involves using a large DRAM such that frequent KV pairs are buffered and accessed from the main memory - and this solution exposes a major design drawback of the KV store: its lack of support for integrated data management in memory and on disks. For example, data in the most popular LSM-tree implementation - RocksDB - may reside in a small write buffer (MemTable) that organizes KV pairs for disk writes, a buffer cache for disk blocks, a write-ahead log on the disk for data persistence, and in various LSM levels on the disk. Without the integrated management of indexes, data, and their persistence in a hierarchical memory/disk architecture, memory is under-utilized along with missed performance optimization opportunities. We propose a KV store, TrieKV, which holistically incorporates DRAM, persistent memory (PMem), and disk with certain desired features: (1) fast in-memory access, (2) accurate identification of hot/cold data at an adaptable granularity, (3) customized memory space allocation for minimized fragmentation, (4) hotness-aware data placement across the storage hierarchy, (5) in-place data persistence in the PMem, and (6) hotness-aware LSM-tree compaction. TrieKV employs a single, integrated trie-structured index for all KV pairs in memory, where access hotness can be consistently discovered. Accordingly, the KV placement is dynamically determined according to the hotness and persistence needs of the storage hierarchy spanning the DRAM, PMem, and solid-state drive. In the experiment, we demonstrate that the 99th latency of RocksDB and NoveLSM is 38x and 6x higher than that of TrieKV, respectively. In addition, TrieKV outperforms RocksDB and NoveLSM by a factor of 5.6 and 1.7in terms of throughput, respectively.
引用
收藏
页码:2479 / 2496
页数:18
相关论文
共 9 条
  • [1] HyperKV: A High Performance Concurrent Key-Value Store for Persistent Memory
    Sun, Penghao
    Xue, Dongliang
    You, Litong
    Yan, Yan
    Huang, Linpeng
    19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 125 - 134
  • [2] NStore: A High-Performance NUMA-Aware Key-Value Store for Hybrid Memory
    Wang, Zhonghua
    Lu, Kai
    Wan, Jiguang
    Jiang, Hong
    Zhao, Zeyang
    Xu, Peng
    Lai, Biliang
    Li, Guokuan
    Xie, Changsheng
    IEEE TRANSACTIONS ON COMPUTERS, 2025, 74 (03) : 929 - 943
  • [3] FastStore: A High-Performance RDMA-enabled Distributed Key-Value Store with Persistent Memory
    Xiong, Ziwei
    Jiang, Dejun
    Xiong, Jin
    2023 IEEE 43RD INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, ICDCS, 2023, : 406 - 417
  • [4] GHStore: A High Performance Global Hash Based Key-Value Store
    Li, Jiaoyang
    Yue, Yinliang
    Wang, Weiping
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2022, PT I, 2022, : 493 - 508
  • [5] PHash: A memory-efficient, high-performance key-value store for large-scale data-intensive applications
    Shim, Hyotaek
    JOURNAL OF SYSTEMS AND SOFTWARE, 2017, 123 : 33 - 44
  • [6] Design of a High-Performance, High-Endurance Key-Value SSD for Large-Key Workloads
    Park, Chanyoung
    Liu, Chun-Yi
    Kang, Kyungtae
    Kandemir, Mahmut
    Choi, Wonil
    IEEE COMPUTER ARCHITECTURE LETTERS, 2023, 22 (02) : 149 - 152
  • [7] Improving Performance of Key-Value Stores for High-Performance Storage Devices
    Kim, Sunggon
    Kim, Hwajung
    APPLIED SCIENCES-BASEL, 2024, 14 (17):
  • [8] High-Performance Remote Data Persisting for Key-Value Stores via Persistent Memory Region
    Luo, Yongping
    Jin, Peiquan
    Wang, Xiaoliang
    Chu, Zhaole
    Guo, Kuankuan
    Guo, Jinhui
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 43 (11) : 3828 - 3839
  • [9] FULL-KV: Flexible and Ultra-Low-Latency In-Memory Key-Value Store System Design on CPU-FPGA
    Qiu, Yunhui
    Xie, Jinyu
    Lv, Hankun
    Yin, Wenbo
    Luk, Wai-Shing
    Wang, Lingli
    Yu, Bowei
    Chen, Hua
    Ge, Xianjun
    Liao, Zhijian
    Shi, Xiaozhong
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (08) : 1828 - 1844