Spark : A Big Data Processing Platform Based On Memory Computing

被引:25
作者
Han, Zhijie [1 ]
Zhang, Yujie [2 ]
机构
[1] Nanjing Univ Posts & Telecommun, Inst Data & Knowledge Engn, Nanjing, Jiangsu, Peoples R China
[2] Henan Univ, Inst Data & Knowledge Engn, Kaifeng, Henan, Peoples R China
来源
2015 SEVENTH INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING (PAAP) | 2015年
关键词
Spark; Memory Computing; Spark SQL; MLlib; GraphX; Spark Streaming;
D O I
10.1109/PAAP.2015.41
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Spark is a memory-based computing framework which has a better ability of computing and fault tolerance, supports batch, interactive, iterative and flow calculations. In this paper, we analyze the Spark's primary framework, core technologies, and point out the advantages and disadvantages of the Spark. In the end, we make a discussion for the future trends of the Spark technologies.
引用
收藏
页码:172 / 176
页数:5
相关论文
共 27 条
[1]  
Alpaydin E, 2014, ADAPT COMPUT MACH LE, P115
[2]  
[Anonymous], 2011, Google's PageRank and beyond: The science of search engine rankings
[3]  
[Anonymous], 2014, OSDI 14
[4]  
[Anonymous], 2012, NSDI
[5]   Spark SQL: Relational Data Processing in Spark [J].
Armbrust, Michael ;
Xin, Reynold S. ;
Lian, Cheng ;
Huai, Yin ;
Liu, Davies ;
Bradley, Joseph K. ;
Meng, Xiangrui ;
Kaftan, Tomer ;
Franklint, Michael J. ;
Ghodsi, Ali ;
Zaharia, Matei .
SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, :1383-1394
[6]   Global link between deformation and volcanic eruption quantified by satellite imagery [J].
Biggs, J. ;
Ebmeier, S. K. ;
Aspinall, W. P. ;
Lu, Z. ;
Pritchard, M. E. ;
Sparks, R. S. J. ;
Mather, T. A. .
NATURE COMMUNICATIONS, 2014, 5
[7]  
Biswas S, 2014, ARXIV14122700
[8]   Probabilistic Topic Models [J].
Blei, David M. .
COMMUNICATIONS OF THE ACM, 2012, 55 (04) :77-84
[9]  
Boyle W B, 2014, U.S. Patent, Patent No. [8,706,985, 8706985]
[10]  
Chakrabarti A, 2009, CS85 DATA STREAM FAL