Prototyping and In-Depth Analysis of Big Data Benchmarking

被引:1
作者
Pandove, Divya [1 ]
Goel, Shivani [1 ]
机构
[1] Thapar Univ, CSED, Patiala, Punjab, India
来源
CIT/IUCC/DASC/PICOM 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY - UBIQUITOUS COMPUTING AND COMMUNICATIONS - DEPENDABLE, AUTONOMIC AND SECURE COMPUTING - PERVASIVE INTELLIGENCE AND COMPUTING | 2015年
关键词
Big Data Benchmarking; Big Data Systems; Performance Measures; 4-V Data Properties; Prototype; SUITE;
D O I
10.1109/CIT/IUCC/DASC/PICOM.2015.182
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Today's digital age has witnessed an explosion of data and information. This has resulted into changing the nature of data from being a medium of supporting transactions to becoming a transactional commodity itself. The consequential increase in the value of data has led to many innovations in both academic and industrial circles. The main focus remains on finding efficient ways to analyse data and derive meaningful results out of it. An efficient way of doing so is constructing benchmarks in order to effectively evaluate the performances of existing and upcoming data systems. A successful benchmark should cover all the major big data system application domains and there workloads. A prototype, outlining a small and diverse benchmark, which takes minimum time to cover a wide range of applications needs to be developed. In designing this prototype the four cornerstones of big data namely volume, veracity, velocity and variety should also be maintained. In addition to this the workload of a benchmark set should be carefully selected. It should represent a wide spectrum of application domains; have diversity of data characteristics and should not have any redundancy. Lastly, there should be a metric to evaluate the benchmarks so as to give them validity.
引用
收藏
页码:1223 / 1230
页数:8
相关论文
共 50 条
[41]   BiDaML in Practice: Collaborative Modeling of Big Data Analytics Application Requirements [J].
Khalajzadeh, Hourieh ;
Simmons, Andrew J. ;
Verma, Tarun ;
Abdelrazek, Mohamed ;
Grundy, John ;
Hosking, John ;
He, Qiang ;
Ratnakanthan, Prasanna ;
Zia, Adil ;
Law, Meng .
EVALUATION OF NOVEL APPROACHES TO SOFTWARE ENGINEERING, ENASE 2020, 2021, 1375 :106-129
[42]   Business Process Analytics and Big Data Systems: A Roadmap to Bridge the Gap [J].
Sakr, Sherif ;
Maamar, Zakaria ;
Awad, Ahmed ;
Benatallah, Boualem ;
Van Der Aalst, Wil M. P. .
IEEE ACCESS, 2018, 6 :77308-77320
[43]   Big Data Architecture for Forest Fire Management Support in the Region of Araucania [J].
Vasquez-Morales, Felipe ;
Cravero-Leal, Ania .
REVISTA CIENTIFICA, 2021, 42 (03) :304-314
[44]   Diagnostic performance dashboards: tracking diagnostic errors using big data [J].
Mane, Ketan K. ;
Rubenstein, Kevin B. ;
Nassery, Najlla ;
Sharp, Adam L. ;
Shamim, Ejaz A. ;
Sangha, Navdeep S. ;
Hassoon, Ahmed ;
Fanai, Mehdi ;
Wang, Zheyu ;
Newman-Toker, David E. .
BMJ QUALITY & SAFETY, 2018, 27 (07) :567-570
[45]   Libra and the Art of Task Sizing in Big-Data Analytic Systems [J].
Li, Rui ;
Guo, Peizhen ;
Hu, Bo ;
Hu, Wenjun .
PROCEEDINGS OF THE 2019 TENTH ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC '19), 2019, :364-376
[46]   Increasing the Accessibility to Big Data Systems via a Common Services API [J].
Malcolm, Rohan ;
Morrison, Cherrelle ;
Grandison, Tyrone ;
Thorpe, Sean ;
Christie, Kimron ;
Wallace, Akim ;
Green, Damian ;
Jarrett, Julian ;
Campbell, Arnett .
2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2014, :883-892
[47]   A Systematic Mapping of Software Engineering Approaches to Develop Big Data Systems [J].
Laigner, Rodrigo Nunes ;
Kalinowski, Marcos ;
Lifschitz, Sergio ;
Monteiro, Rodrigo Salvador ;
de Oliveira, Daniel .
44TH EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS (SEAA 2018), 2018, :446-453
[48]   Design, Prototyping, and Analysis of a Novel Modular Permanent-Magnet Transverse Flux Disk Generator [J].
Hosseini, Seyedmohsen ;
Moghani, Javad Shokrollahi ;
Ershad, Nima Farrokhzad ;
Jensen, Bogi Bech .
IEEE TRANSACTIONS ON MAGNETICS, 2011, 47 (04) :772-780
[49]   Cybermycelium: a reference architecture for domain-driven distributed big data systems [J].
Ataei, Pouya .
FRONTIERS IN BIG DATA, 2024, 7
[50]   Panthera: Holistic Memory Management for Big Data Processing over Hybrid Memories [J].
Wang, Chenxi ;
Cui, Huimin ;
Cao, Ting ;
Zigman, John ;
Volos, Haris ;
Mutlu, Onur ;
Lv, Fang ;
Feng, Xiaobing ;
Xu, Guoqing Harry .
PROCEEDINGS OF THE 40TH ACM SIGPLAN CONFERENCE ON PROGRAMMING LANGUAGE DESIGN AND IMPLEMENTATION (PLDI '19), 2019, :347-362