Evaluating the Scaling of Graph-Algorithms for Big Data using GraphX

被引:44
作者
Andersen, Jakob Smedegaard [1 ]
Zukunft, Olaf [1 ]
机构
[1] HAW Hamburg, Dept Comp Sci, Hamburg, Germany
来源
PROCEEDINGS 2016 2ND INTERNATIONAL CONFERENCE ON OPEN AND BIG DATA - OBD 2016 | 2016年
关键词
GraphX; Graph Processing; Semi-Clustering; Collaborative Filtering; Parallel Computing;
D O I
10.1109/OBD.2016.8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Graph processing has achieved a lot of attention in different big data scenarios. In this paper, we present the design, implementation, and experimental evaluation of graph processing algorithms in two different application areas. First, we use semi-clustering as an example of an algorithm typically used social network analysis. Then, we examine an algorithm for collaborative filtering as typically used in E-Commerce scenarios. For both algorithms, we make use of Apache GraphX as an existing distributed graph processing framework based on Apache Spark. As GraphX does not include these two algorithms, we describe how to implement them using a combination of GraphX and the underlying Spark Core. Based on our implementation, we perform experiments to test the scalability of both the algorithms and the GraphX processing framework. The experiments show that different kinds of graph algorithms can be supported within the Spark framework. Furthermore, we show that for our test data the algorithms scale almost linearly when properly designed.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 19 条
[11]  
Malewicz G., 2010, P 2010 ACM SIGMOD IN, P135, DOI [DOI 10.1145/1807167.1807184, 10.1145/1807167.1807184]
[12]   Techniques for Graph Analytics on Big Data [J].
Nisar, M. Usman ;
Fard, Arash ;
Miller, John A. .
2013 IEEE INTERNATIONAL CONGRESS ON BIG DATA, 2013, :255-262
[13]  
Sharma S., 2014, Data Science Journal, V13, P138, DOI [DOI 10.2481/DSJ.14-041, DOI 10.2481/dsj.14-041]
[14]  
Silberstein A., 2011, DBSOCIAL, P1
[15]   A BRIDGING MODEL FOR PARALLEL COMPUTATION [J].
VALIANT, LG .
COMMUNICATIONS OF THE ACM, 1990, 33 (08) :103-111
[16]  
Xin R. S., 2014, ARXIV E PRINTS
[17]  
Xin Reynold S, 2013, 1 INT WORKSH GRAPH D, P1
[18]  
Zaharia M., 2010, 2 USENIX WORKSHOP HO, V10, P95
[19]  
Zaharia Matei., 2012, NSDI 12