Prototyping and In-Depth Analysis of Big Data Benchmarking

被引:1
作者
Pandove, Divya [1 ]
Goel, Shivani [1 ]
机构
[1] Thapar Univ, CSED, Patiala, Punjab, India
来源
CIT/IUCC/DASC/PICOM 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY - UBIQUITOUS COMPUTING AND COMMUNICATIONS - DEPENDABLE, AUTONOMIC AND SECURE COMPUTING - PERVASIVE INTELLIGENCE AND COMPUTING | 2015年
关键词
Big Data Benchmarking; Big Data Systems; Performance Measures; 4-V Data Properties; Prototype; SUITE;
D O I
10.1109/CIT/IUCC/DASC/PICOM.2015.182
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Today's digital age has witnessed an explosion of data and information. This has resulted into changing the nature of data from being a medium of supporting transactions to becoming a transactional commodity itself. The consequential increase in the value of data has led to many innovations in both academic and industrial circles. The main focus remains on finding efficient ways to analyse data and derive meaningful results out of it. An efficient way of doing so is constructing benchmarks in order to effectively evaluate the performances of existing and upcoming data systems. A successful benchmark should cover all the major big data system application domains and there workloads. A prototype, outlining a small and diverse benchmark, which takes minimum time to cover a wide range of applications needs to be developed. In designing this prototype the four cornerstones of big data namely volume, veracity, velocity and variety should also be maintained. In addition to this the workload of a benchmark set should be carefully selected. It should represent a wide spectrum of application domains; have diversity of data characteristics and should not have any redundancy. Lastly, there should be a metric to evaluate the benchmarks so as to give them validity.
引用
收藏
页码:1223 / 1230
页数:8
相关论文
共 50 条
  • [21] Survey of Performance Modeling of Big Data Applications
    Pattanshetti, Tanuja
    Attar, Vahida
    [J]. PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE AND ENGINEERING (CONFLUENCE 2017), 2017, : 177 - 181
  • [22] Learning Big Data Systems via Emulation
    Wu, Wensheng
    [J]. PROCEEDINGS OF THE 55TH ACM TECHNICAL SYMPOSIUM ON COMPUTER SCIENCE EDUCATION, SIGCSE 2024, VOL. 1, 2024, : 1449 - 1455
  • [23] Effectively and Efficiently Supporting Predictive Big Data Analytics over Open Big Data in the Transportation Sector: A Bayesian Network Framework
    Cuzzocrea, Alfredo
    Leung, Carson K.
    Hajian, Mojtaba
    Jackson, Marshall D.
    [J]. 2022 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2022, : 1116 - 1123
  • [24] Prototyping Optical Ethernet-A Network for Distributed Data Centers in the Edge Cloud
    Lautenschlaeger, Wolfram
    Dembeck, Lars
    Gebhard, Ulrich
    [J]. JOURNAL OF OPTICAL COMMUNICATIONS AND NETWORKING, 2018, 10 (12) : 1005 - 1014
  • [25] Post-tensioned ceramic structures: design, analysis and prototyping
    Martin Bechthold
    Zach Seibold
    Saurabh Mhatre
    [J]. Architecture, Structures and Construction, 2022, 2 (1): : 165 - 182
  • [26] Creation of a Safety Information Platform Applying Al Analysis Technology: Prototyping Analysis Methods
    Uchida, Mitsuru
    Aicnioto, Jun
    Naicamura, Hiroyuki
    Nozaici, Takao
    [J]. JOURNAL OF THE JAPAN PETROLEUM INSTITUTE, 2022, 66 (05) : 189 - 193
  • [27] A Survey on Data-driven Performance Tuning for Big Data Analytics Platforms
    Costa, Rogerio Luis de C.
    Moreira, Jose
    Pintor, Paulo
    dos Santos, Veronica
    Lifschitz, Sergio
    [J]. BIG DATA RESEARCH, 2021, 25
  • [28] Bayesian Performance Analysis for Black-Box Optimization Benchmarking
    Calvo, Borja
    Shir, Ofer M.
    Ceberio, Josu
    Doerr, Carola
    Wang, Hao
    Back, Thomas
    Lozano, Jose A.
    [J]. PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCCO'19 COMPANION), 2019, : 1789 - 1797
  • [29] NeoMycelia: A software reference architecturefor big data systems
    Ataei, Pouya
    Litchfield, Alan
    [J]. 2021 28TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC 2021), 2021, : 452 - 462
  • [30] Exploring the performance measures of big data analytics systems
    Ali, Ismail Mohamed
    Jusoh, Yusmadi Yah
    Abdullah, Rusli
    Ahmed, Yahye Abukar
    [J]. INTERNATIONAL JOURNAL OF ADVANCED AND APPLIED SCIENCES, 2023, 10 (01): : 92 - 104