Building New Key-value Store with High Performance and High Availability

被引:0
作者
Zhu Y.-A. [1 ,2 ]
Jian H.-B. [2 ]
Long Y.-C. [2 ]
Li B. [2 ]
Wang S. [2 ]
Wu X.-L. [2 ]
Zhong Z.-C. [2 ]
Zhang Y.-S. [3 ]
机构
[1] School of Computer Science, Jiaying University, Meizhou
[2] Guangzhou YY Inc., Guangzhou
[3] National Survey Research Center at Renmin University of China, Beijing
来源
Zhu, Yue-An (iwillgoon@126.com); Zhu, Yue-An (iwillgoon@126.com) | 1600年 / Chinese Academy of Sciences卷 / 32期
基金
中国国家自然科学基金;
关键词
Fine-grain buffer management; Recovery; Replication protocol; Single-copy system; Write optimized;
D O I
10.13328/j.cnki.jos.006023
中图分类号
学科分类号
摘要
In recent year, the write-heavy applications are more and more prevalent. How to efficiently handle this sort of workload is one of intensive research direction in the field of database system. The overhead caused by write operation is mainly issued by two factors. One is the hardware level, i.e., the IO cost caused by write operation. This cost cannot be removed in short period. The other is dual-copy software architecture, i.e., multiple writes caused by modifying in-memory data copy and formulating log records. The log-as-database architecture (the following refers it as single-copy system) can reduce the IOs and software cost caused by write as well. But existing systems treating log-as-database either are built on top of special infrastructure such as infiniband or NVRam (non-volatile random access memory) which is far from widely available or is constructed with the help of other system such as Dynamo, which is lack of flexibility and generality. This study builds from scratch a single copy system called LogStore oriented for commodity environment, which adopts log-as-database design philosophy to fully utilize its advantages that can boost the write performance and minimize the gap between primary and secondary. Embedding consensus module into system other than dependent on auxiliary systems makes it more flexible and controllable. The novel execution model binding thread to certain partition plus multi-version concurrency control technique eliminates read-write, write-write conflict, and context switch overhead in lock-free style. The YCSB benchmark is used to assess system performance thoroughly. Compared to prevalent key-value store HBase and its single-copy implementation LogBase, the proposed system can achieve about 4x better. In term of crash recovery, LogStore can finish recovery within one minute for TB scale data volume, which is one order of magnitude recovery time less than LogBase. © Copyright 2021, Institute of Software, the Chinese Academy of Sciences. All rights reserved.
引用
收藏
页码:3203 / 3218
页数:15
相关论文
共 23 条
[1]  
Sears R, Ramakrishnan R., BLSM: A general purpose log structured merge tree, Proc. of the ACM SIGMOD Int'l Conf. on Management of Data (SIGMOD 2012), pp. 217-228, (2012)
[2]  
Mohan C, Haderle D, Lindsay B, Pirahesh H, Schwarz P., Aries: A transaction recovery method supporting fine-granularity locking and partial rollbacks using write-ahead logging, ACM Trans. on Database System, 17, 1, pp. 94-162, (1992)
[3]  
Stonebraker M., The land sharks are on the squawk box, ACM Communication, 59, 2, pp. 74-83, (2016)
[4]  
Open source implementation of HBase
[5]  
Chang F, Dean J, Ghemawat S, Hsieh WC, Wallach DA, Burrows M, Chandra T, Fikes A, Gruber RE., Bigtable: A distributed storage system for structured data, ACM Trans. on Computer Systems, 26, 2, (2008)
[6]  
RocksDB, (2016)
[7]  
Chandramouli B, Prasaad G, Kossmann D, Levandoski JJ, Hunter J, Barnett M., FASTER: A concurrent key-value store with in-place updates, Proc. of the ACM SIGMOD Int'l Conf. on Management of Data, pp. 275-290, (2018)
[8]  
Sheehy J, Smith D., Bitcask: A log-structured hash table for fast key/value data, (2010)
[9]  
Verbitski A, Gupta A, Saha D, Brahmadesam M, Gupta K, Mittal R, Krishnamurthy S, Maurice S, Kharatishvili T, Bao XF., Amazon aurora: Design considerations for high throughput cloud-native relational databases, Proc. of the ACM SIGMOD Int'l Conf. on Management of Data, pp. 1041-1052, (2017)
[10]  
Vo HT, Wang S, Agrawal D, Chen G, Ooi BC., LogBase: A scalable log-structured database system in the cloud, PVLDB, 5, 10, pp. 1004-1015, (2012)