Prototyping and In-Depth Analysis of Big Data Benchmarking

被引:1
作者
Pandove, Divya [1 ]
Goel, Shivani [1 ]
机构
[1] Thapar Univ, CSED, Patiala, Punjab, India
来源
CIT/IUCC/DASC/PICOM 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY - UBIQUITOUS COMPUTING AND COMMUNICATIONS - DEPENDABLE, AUTONOMIC AND SECURE COMPUTING - PERVASIVE INTELLIGENCE AND COMPUTING | 2015年
关键词
Big Data Benchmarking; Big Data Systems; Performance Measures; 4-V Data Properties; Prototype; SUITE;
D O I
10.1109/CIT/IUCC/DASC/PICOM.2015.182
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Today's digital age has witnessed an explosion of data and information. This has resulted into changing the nature of data from being a medium of supporting transactions to becoming a transactional commodity itself. The consequential increase in the value of data has led to many innovations in both academic and industrial circles. The main focus remains on finding efficient ways to analyse data and derive meaningful results out of it. An efficient way of doing so is constructing benchmarks in order to effectively evaluate the performances of existing and upcoming data systems. A successful benchmark should cover all the major big data system application domains and there workloads. A prototype, outlining a small and diverse benchmark, which takes minimum time to cover a wide range of applications needs to be developed. In designing this prototype the four cornerstones of big data namely volume, veracity, velocity and variety should also be maintained. In addition to this the workload of a benchmark set should be carefully selected. It should represent a wide spectrum of application domains; have diversity of data characteristics and should not have any redundancy. Lastly, there should be a metric to evaluate the benchmarks so as to give them validity.
引用
收藏
页码:1223 / 1230
页数:8
相关论文
共 50 条
[1]   On big data benchmarking [J].
Han, Rui (r.han10@imperial.ac.uk), 1600, Springer Verlag (8807) :3-18
[2]   Benchmarking Big Data Systems: A Review [J].
Han, Rui ;
John, Lizy Kurian ;
Zhan, Jianfeng .
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2018, 11 (03) :580-597
[3]   From use cases to a big data benchmarking framework in clearing houses and exchanges [J].
Lewandowska, Olga ;
Mai, Edgar .
JOURNAL OF FINANCIAL MARKET INFRASTRUCTURES, 2020, 9 (01) :125-139
[4]   Benchmarking with data envelopment analysis: a return on asset perspective [J].
Joo, Seong-Jong ;
Nixon, Don ;
Stoeberl, Philipp A. .
BENCHMARKING-AN INTERNATIONAL JOURNAL, 2011, 18 (04) :529-+
[5]   Benchmarking patient improvement in physical therapy with data envelopment analysis [J].
Friesner, Daniel ;
Neufelder, Donna ;
Raisor, Janet ;
Khayum, Mohammed .
INTERNATIONAL JOURNAL OF HEALTH CARE QUALITY ASSURANCE, 2005, 18 (06) :441-+
[6]   In-Depth Appraisal of Bus Transport Services for Sustainability Performance: A Cost-Benefit Analysis Approach [J].
Alogdianakis, Filippos ;
Dimitriou, Loukas .
TRANSPORTATION RESEARCH RECORD, 2024, 2678 (05) :574-587
[7]   The enterprise data warehouse prototyping experiences [J].
Burita, L ;
Ondryhal, V ;
Trunda, M .
8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL IV, PROCEEDINGS: INFORMATION SYSTEMS, TECHNOLOGIES AND APPLICATIONS: I, 2004, :338-342
[8]   System-Auditing, Data Analysis and Characteristics of Cyber Attacks for Big Data Systems [J].
Huang, Liangyi ;
Hall, Sophia ;
Shao, Fei ;
Nihar, Arafath ;
Chaudhary, Vipin ;
Wu, Yinghui ;
French, Roger ;
Xiao, Xusheng .
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, :4872-4876
[9]   An Emulation Environment for Prototyping PMU Data Errors [J].
Idehen, Ikponmwosa ;
Mao, Zeyu ;
Overbye, Thomas .
2016 NORTH AMERICAN POWER SYMPOSIUM (NAPS), 2016,
[10]   Benchmarking Distributed Stream Data Processing Systems [J].
Karimov, Jeyhun ;
Rabl, Tilmann ;
Katsifodimos, Asterios ;
Samarev, Roman ;
Heiskanen, Henri ;
Markl, Volker .
2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, :1507-1518