Big Data Processing Stacks

被引:10
作者
Sakr S. [1 ]
机构
[1] Sakr, Sherif
关键词
big data; data analysis; Flink; Hadoop; Spark;
D O I
10.1109/MITP.2017.6
中图分类号
学科分类号
摘要
After roughly a decade of dominance by the Hadoop framework in the big data processing world, we are witnessing the emergence of various stacks that have been enhanced with domain-specific, optimized, and vertically focused big data processing features. The author analyzes in detail the capabilities of various big data processing stacks and provides insights and guidelines about the latest ongoing developments in this domain. © 2017 IEEE.
引用
收藏
页码:34 / 41
页数:7
相关论文
共 12 条
[1]  
Sakr S., Liu A., Fayoumi A.G., The family of MapReduce and large-scale data processing systems, ACM Computing Surveys, 46, 1, (2013)
[2]  
Sakr S., Et al., A survey of large scale data management approaches in cloud environments, IEEE Comm. Surveys & Tutorials, 13, 3, pp. 311-336, (2011)
[3]  
Dean J., Ghemawa S., MapReduce: Simplified data processing on large clusters, Proc. 6th Conf. Symp. Operating Systems Design & Implementation (OSDI), (2004)
[4]  
Pavlo A., Et al., A comparison of approaches to large-scale data analysis, Proc. 2009 ACM SIGMOD Int'l Conf. Management of Data, pp. 165-178, (2009)
[5]  
Huai Y., Et al., Major technical advancements in apache hive, Proc. 2014 ACM SIGMOD Int'l Conf. Management of Data, pp. 1235-1246, (2014)
[6]  
Malewicz G., Et al., Pregel: A system for large-scale graph processing, Proc. 2010 ACM SIGMOD Int'l Conf. Management of Data, pp. 135-146, (2010)
[7]  
Armbrust M., Et al., Spark SQL: Relational data processing in spark, Proc. 2015 ACM SIGMOD Int'l Conf. Management of Data, pp. 1383-1394, (2015)
[8]  
Sparks E.R., Et al., MLI: An API for distributed machine learning, Proc. IEEE Int'l Conf. Data Mining, (2013)
[9]  
Gonzalez J.E., Et al., GraphX: Graph processing in a distributed dataflow framework, Proc. 11th Usenix Symp. Operating Systems Design and Implementation (OSDI), pp. 599-613, (2014)
[10]  
Alexandrov A., Et al., The stratosphere platform for big data analytics, Int'l J. Very Large Databases, 23, 6, pp. 939-964, (2014)