Benchmarking data management systems: from traditional database to emergent big data

被引:0
作者
Jin, Che-Qing [1 ]
Qian, Wei-Ning [1 ]
Zhou, Min-Qi [1 ]
Zhou, Ao-Ying [1 ]
机构
[1] Institute for Data Science and Engineering, Software Engineering Institute, East China Normal University, Shanghai
来源
Jisuanji Xuebao/Chinese Journal of Computers | 2015年 / 38卷 / 01期
关键词
Benchmark; Big data; Data generator; Metric; Workload;
D O I
10.3724/SP.J.1016.2015.00018
中图分类号
学科分类号
摘要
The arrival of big data era means the emergence of novel techniques, systems and products. How to compare and evaluate different database systems objectively becomes a hot research area, which is similar to the age when database systems were just flourishing thirty years ago. As well as we know, database benchmarking plays an important role in the development of database systems, and greatly promotes the development of database technology and systems. The database benchmark refers to a set of specifications to evaluate and compare different database systems, which is capable of reflecting the performance gap between various database systems objectively and comprehensively, so as to promote technological progress and guide the positive development of the industry. Database benchmark is closely related to the application developments: it describes new data management needs, sparks innovative data management theory, gives birth to new data management systems, and ultimately needs to develop appropriate benchmarks for evaluation. There exist various kinds of database benchmarks, including that for relational databases, for non-relational databases (semi-structured data, object-oriented data, streaming data, and spatial data), and for big data most recently. Nowadays, the tide of the research on big data benchmarking is also coming. The research on big data is strongly related to application requirements. So far, existing work cannot fully reflects the distinctive characteristics of big data applications. From a technical point of view, the developments of database benchmarks in the past thirty years are of great help to develop big data benchmarks, which is the main motivation of this paper. This paper reviews the progress of database benchmarks systematically, and points out future directions. ©, 2014, Science Press. All right reserved.
引用
收藏
页码:18 / 34
页数:16
相关论文
共 74 条
[1]  
Gray J., Benchmark Handbook for Database and Transaction System, (1993)
[2]  
Codd E.F., A relational model of data for large shared data banks, Communications of the ACM, 13, 6, (1970)
[3]  
Bitton D., de Witt D., Turbyfil C., Benchmarking database systems: A systematic approach, Proceedings of the 9th International Conference on Very Large Data Bases, pp. 8-19, (1983)
[4]  
Dewitt D., The Wisconsin benchmark: past, present, and future, Chapter 5, Benchmark Handbook for Database and Transaction System, (1993)
[5]  
Big data: Science in the petabyte era, Nature, 455, pp. 1-136, (2008)
[6]  
Turbyfill C., Orji C., Bitton D., AS<sup>3</sup>AP-An ANSI SQL standard scalable and portable benchmark for relational database systems, Chapter 5. Benchmark Handbook for Database and Transaction System, (1993)
[7]  
Cooper B., Silberstein A., Tam E., Et al., Beanchmarking cloud serving systems with YCSB, Proceedings of the 1st ACM Symposium on Cloud Computing, pp. 143-154, (2010)
[8]  
Ghazal A., Rabl T., Hu M., Et al., BigBench: Towards an industry standard benchmark for big data analytics, Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, pp. 1197-1208, (2013)
[9]  
Rabl T., Poess M., Parallel data generation for performance analysis of large, complex RDBMS, Proceedings of the 4th International Workshop on Testing Database Systems, (2011)
[10]  
Nambiar R., Poess M., Masland A., Et al., TPC state of the council 2013, Proceedings of the 5th TPC Technology Conference on Performance Characterization and Benchmarking, pp. 1-15, (2013)