On Log-Structured Merge for Solid-State Drives

被引:19
作者
Thonangi, Risi [1 ]
Yang, Jun [2 ]
机构
[1] VMware Inc, Palo Alto, CA 94304 USA
[2] Duke Univ, Durham, NC 27706 USA
来源
2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017) | 2017年
基金
美国国家科学基金会;
关键词
D O I
10.1109/ICDE.2017.121
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Log-structure merge (LSM) is an increasingly prevalent approach to indexing, especially for modern write-heavy workloads. LSM organizes data in levels with geometrically increasing sizes. Records enter the top level; whenever a level fills up, it is merged down into the next level. Hence, the index is updated only through merges and records are never updated in-place. While originally conceived to avoid slow random accesses of hard drives, LSM also turns out to be especially suited to solid-state drives, or any block-based storage with expensive writes. We study how to further reduce writes in LSM. Traditionally, LSM always merges an overflowing level fully into the next. We investigate in depth how partial merges save writes and prove bounds on their effectiveness. We propose new algorithms that make provably good decisions on whether to perform a partial merge, and if yes, which part of a level to merge. We also show how to further reduce writes by reusing data blocks during merges. Overall, our approach offers better worst-case guarantees and better practical performance than existing LSM variants.
引用
收藏
页码:683 / 694
页数:12
相关论文
共 19 条
[1]  
Agrawal Ganesan, 2009, PVLDB, V2, P361
[2]  
Awasthi, 2012, COMAD
[3]  
Bentley, 1980, J ALGORITHMS, V1, P301
[4]   Fractional Cascading: I. A Data Structuring Technique [J].
Chazelle, Bernard ;
Guibas, Leonidas J. .
ALGORITHMICA, 1986, 1 (1-4) :133-162
[5]  
Debnath, 2011, SIGMOD
[6]  
Ghemawat, 2014, LEVELDB 1 18
[7]  
Jagadish Narayan, 1997, VLDB
[8]   The partitioned exponential file for database storage management [J].
Jermaine, Christopher ;
Omiecinski, Edward ;
Yee, Wai Gen .
VLDB JOURNAL, 2007, 16 (04) :417-437
[9]  
Li He, 2010, PVLDB, V3, P1195
[10]  
Lim Fan, 2011, SOSP