Scalability and Realtime on Big Data, MapReduce, NoSQL and Spark

被引:1
作者
Furtado, Pedro [1 ,2 ]
机构
[1] Univ Coimbra, Dept Engn Informat, Polo 2, P-3030290 Coimbra, Portugal
[2] Univ Coimbra, Ctr Informat & Sistemas, Polo 2, P-3030290 Coimbra, Portugal
来源
BUSINESS INTELLIGENCE (EBISS 2016) | 2017年 / 280卷
关键词
Big data; Realtime; Scalability; NoSQL; MapReduce; DBMS; Hadoop; Spark; Spark-SQL; PARALLEL;
D O I
10.1007/978-3-319-61164-8_4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Big data platforms strive to achieve scalability and realtime for query processing and complex analytics over "big" and/or "fast" data. In this context, big data warehouses are huge repositories of data to be used in analytics and machine learning. This work discusses models, concepts and approaches to reach scalability and realtime in big data processing and big data warehouses. The main concepts of NoSQL, Parallel Data Management Systems (PDBMS), MapReduce and Spark are reviewed in the context of scalability. The first two offering data management, the last two adding flexible and scalable processing capacities. We also turn our attention to realtime data processing, lambda architecture and its relation with scalability, and we revisit our own recent research on the issue. Three approaches are included that are directly related to realtime and scalability: the use of a realtime component in a data warehouse, parallelized de-normalization for scalability and execution tree sharing for scaling to simultaneous sessions. With these models and technologies we revisit some of the major current solutions for data management and data processing with scalability and realtime capacities.
引用
收藏
页码:79 / 104
页数:26
相关论文
共 49 条
[1]   Survey of graph database models [J].
Angles, Renzo ;
Gutierrez, Claudio .
ACM COMPUTING SURVEYS, 2008, 40 (01)
[2]  
[Anonymous], 2010, ECONOMIST
[3]  
[Anonymous], 2011 6 INT C PERV CO
[4]  
[Anonymous], 2015, GRAPH DATABASES
[5]  
[Anonymous], 2015, THESIS
[6]  
[Anonymous], SUP CLIM NASAS BIG D
[7]  
Armbrust M., 2015, P 2015 ACM SIGMOD IN
[8]   CAP Twelve Years Later: How the "Rules" Have Changed [J].
Brewer, Eric .
COMPUTER, 2012, 45 (02) :23-29
[9]   Scalable SQL and NoSQL Data Stores [J].
Cattell, Rick .
SIGMOD RECORD, 2010, 39 (04) :12-27
[10]  
Chang F, 2006, OSDI 2006 7 S OP SYS