Big data analytics in Cloud computing: an overview

被引:47
作者
Berisha, Blend [1 ]
Meziu, Endrit [1 ]
Shabani, Isak [1 ]
机构
[1] Univ Prishtina, Fac Elect & Comp Engn, Dept Comp Engn, Prishtina 10000, Kosovo
来源
JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS | 2022年 / 11卷 / 01期
关键词
Big data; Analytics; BigQuery; Cloud computing;
D O I
10.1186/s13677-022-00301-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Big Data and Cloud Computing as two mainstream technologies, are at the center of concern in the IT field. Every day a huge amount of data is produced from different sources. This data is so big in size that traditional processing tools are unable to deal with them. Besides being big, this data moves fast and has a lot of variety. Big Data is a concept that deals with storing, processing and analyzing large amounts of data. Cloud computing on the other hand is about offering the infrastructure to enable such processes in a cost-effective and efficient manner. Many sectors, including among others businesses (small or large), healthcare, education, etc. are trying to leverage the power of Big Data. In healthcare, for example, Big Data is being used to reduce costs of treatment, predict outbreaks of pandemics, prevent diseases etc. This paper, presents an overview of Big Data Analytics as a crucial process in many fields and sectors. We start by a brief introduction to the concept of Big Data, the amount of data that is generated on a daily bases, features and characteristics of Big Data. We then delve into Big Data Analytics were we discuss issues such as analytics cycle, analytics benefits and the movement from ETL to ELT paradigm as a result of Big Data analytics in Cloud. As a case study we analyze Google's BigQuery which is a fully-managed, serverless data warehouse that enables scalable analysis over petabytes of data. As a Platform as a Service (PaaS) supports querying using ANSI SQL. We use the tool to perform different experiments such as average read, average compute, average write, on different sizes of datasets.
引用
收藏
页数:10
相关论文
共 19 条
[1]  
[Anonymous], 2018, BIG DATA ARCHITECTS
[2]  
[Anonymous], 2019, WHISHWORKS
[3]  
[Anonymous], 2018, LAPRINTHX
[4]  
[Anonymous], 2020, Forbes
[5]  
[Anonymous], 2018, PC Magazine
[6]  
[Anonymous], 2018, forbes
[7]  
[Anonymous], 2019, XPLENTY
[8]  
EDHEC-Risk Institute, 2010, AD GREEN INV I INV E
[9]  
Gewirtz David., 2018, ZDNet
[10]  
Google Cloud, 2020, BIGQUERY