Improvement of Rock PR Performance via Large-Scale Parameter Analysis and Optimization

被引:2
作者
Jin, Huijun [1 ]
Choi, Won Gi [2 ]
Choi, Jonghwan [1 ]
Sung, Hanseung [3 ]
Park, Sanghyun [1 ]
机构
[1] Yonsei Univ, Dept Comp Sci, Seoul, South Korea
[2] Korea Elect Technol Inst KETI, Seoul, South Korea
[3] Tmax Tibero R&D Ctr, Seoul, South Korea
来源
JOURNAL OF INFORMATION PROCESSING SYSTEMS | 2022年 / 18卷 / 03期
关键词
Database; Genetic Algorithm; Log-Structured Merge-Tree; Optimization; Random Forest; Space Amplification; Write Amplification;
D O I
10.3745/JIPS.04.0244
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Database systems usually have many parameters that must be configured by database administrators and users. RocksDB achieves fast data writing performance using a log-structured merged tree. This database has many parameters associated with write and space amplifications. Write amplification degrades the database performance, and space amplification leads to an increased storage space owing to the storage of unwanted data. Previously, it was proven that significant performance improvements can be achieved by tuning the database parameters. However, tuning the multiple parameters of a database is a laborious task owing to the large number of potential configuration combinations. To address this problem, we selected the important parameters that affect the performance of RocksDB using random forest. We then analyzed the effects of the selected parameters on write and space amplifications using analysis of variance. We used a genetic algorithm to obtain optimized values of the major parameters. The experimental results indicate an insignificant reduction (-5.64%) in the execution time when using these optimized values; however, write amplification, space amplification, and data processing rates improved considerably by 20.65%, 54.50%, and 89.68%, respectively, as compared to the performance when using the default settings.
引用
收藏
页码:374 / 388
页数:15
相关论文
共 21 条
[1]  
[Anonymous], 2021, IEEE Trans. Broadcast.
[2]  
Cao ZC, 2020, PROCEEDINGS OF THE 18TH USENIX CONFERENCE ON FILE AND STORAGE TECHNOLOGIES, P209
[3]  
Dong Siying, 2017, P BIENN C INN DAT SY
[4]  
GitHub, 2022, BENCHM TOOLS
[5]  
Howell D.C., 2006, STAT METHODS PSYCHOL, V6th
[6]  
Hu Xiao-Yu, 2009, SYSTOR, DOI [DOI 10.1145/1534530.1534544, 10.1145/1534530.1534544]
[7]  
Hyojin Kim, 2019, 2019 23rd International Computer Science and Engineering Conference (ICSEC), P305, DOI 10.1109/ICSEC47112.2019.8974829
[8]  
Jena Sampreeti., 2013, International Journal of Current Engineering and Technology, V3, P1379
[9]  
Kanellis Konstantinos, 2020, 12 USENIX WORKSHOP H
[10]  
Lu Youyou., 2013, FAST, P257