Dostoevsky: Better Space-Time Trade-Offs for LSM-Tree Based Key-Value Stores via Adaptive Removal of Superfluous Merging

被引:119
作者
Dayan, Niv [1 ]
Idreos, Stratos [1 ]
机构
[1] Harvard Univ, Cambridge, MA 02138 USA
来源
SIGMOD'18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA | 2018年
基金
美国国家科学基金会;
关键词
D O I
10.1145/3183713.3196927
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We slim that all mainstream ISM-tree based key-value stores in the literature and in industry suboptimally trade between the I/O cost of updates on one hand and the I/O cost of lookups and storage space on the other. The reason is that they perform equally expensive merge operations across all levels of ISM-tree to bound the number of runs that a lookup has to probe and to remove obsolete entries to reclaim storage space. With state-of-the-art designs, however, merge operations from all levels of LSM-tree but the largest (i.e., most merge operations) reduce point lookup cost, long range lookup cost, and storage space by a negligible amount while significantly adding to the amortized cost of updates. To address this problem, we introduce Lazy Leveling, a new design that removes merge operations from all levels of LSM-tree but the largest. Lazy Leveling improves the worst-case complexity of update cost while maintaining the same bounds on point lookup cost, long range lookup cost, and storage space. We further introduce, Fluid ISM-tree, a generalization of the entire LSM-tree design space that can be parameterized to assume any existing design. Relative to Lazy Leveling, Fluid LSM-tree can optimize more for updates by merging less at the largest level, or it can optimize more for short range lookups by merging more at all other levels. We put everything together to design Dostoevsky, a key-value store that adaptively removes superfluous merging by navigating the Fluid LSM-tree design space based on the application workload and hardware. We implemented Dostoevsky on top of RocksDB, and we show that it strictly dominates state-of-the-art designs in terms of performance and storage space.
引用
收藏
页码:505 / 520
页数:16
相关论文
共 46 条
[21]   Monkey: Optimal Navigable Key-Value Store [J].
Dayan, Niv ;
Athanassoulis, Manos ;
Idreos, Stratos .
SIGMOD'17: PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2017, :79-94
[22]   FlashStore: High Throughput Persistent Key-Value Store [J].
Debnath, Biplob ;
Sengupta, Sudipta ;
Li, Jin .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2010, 3 (02) :1414-1425
[23]  
DeCandia Giuseppe, 2007, Operating Systems Review, V41, P205, DOI 10.1145/1323293.1294281
[24]  
Dong S., 2017, P 8 BIENN C INN DAT
[25]  
Facebook, RocksDB
[26]  
Facebook, MyRocks
[27]  
Fitzpatrick B., 2011, CISC VIS NETW IND GL
[28]  
Golan-Gueta Guy, 2015, P 10 EUR C COMP SYST
[29]  
Jagadish HV, 1997, PROCEEDINGS OF THE TWENTY-THIRD INTERNATIONAL CONFERENCE ON VERY LARGE DATABASES, P16
[30]  
Lakshman Avinash, 2010, Operating Systems Review, V44, P35, DOI 10.1145/1773912.1773922