MapReduce for Graphs Processing: New Big Data Algorithm for 2-Edge Connected Components and Future Ideas

被引:3
作者
Dahiphale, Devendra [1 ]
机构
[1] Univ Maryland, Dept Comp Sci & Elect Engn, Baltimore, MD 21250 USA
关键词
Graph; distributed; algorithm; connected component; big data; MapReduce; undirected; small star; large star; analysis; serial algorithms; parallel; cascaded jobs; Hadoop; streaming; connectivity; CLOUD MAPREDUCE;
D O I
10.1109/ACCESS.2023.3281266
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Finding connectivity in graphs has numerous applications, such as social network analysis, data mining, intra-city or inter-cities connectivity, neural network, and many more. The deluge of graph applications makes graph connectivity problems extremely important and worthwhile to explore. Currently, there are many single-node algorithms for graph mining and analysis; however, those algorithms primarily apply to small graphs and are implemented on a single machine node. Finding 2-edge connected components (2-ECCs) in massive graphs (billions of edges and vertices) is impractical and time-consuming, even with the best-known single-node algorithms. Processing a big graph in a parallel and distributed fashion saves considerable time to finish processing. Moreover, it enables stream data processing by allowing quick results for vast and continuous nature data sets. This research proposes a distributed and parallel algorithm for finding 2-ECCs in big undirected graphs (subsequently called ``BiECCA'') and presents its time complexity analysis. The proposed algorithm is implemented on a MapReduce framework and uses an existing algorithm to find connected components (CCs) in a graph as a sub-step. Finally, we suggest a few novel ideas and approaches as extensions to our work.
引用
收藏
页码:54986 / 55001
页数:16
相关论文
共 42 条
[11]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[12]  
Erusalimskii Ya. M., 1980, Cybernetics, V16, P41, DOI 10.1007/BF01099359
[13]  
Fleischer LK, 2000, LECT NOTES COMPUT SC, V1800, P505
[14]  
Gentilini R, 2003, SIAM PROC S, P573
[15]  
Georgiadis L, 2013, Arxiv, DOI arXiv:1210.8303
[16]  
Hadoop, About us
[17]   ON DEGREES OF VERTICES OF DIRECTED GRAPH [J].
HAKIMI, SL .
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 1965, 279 (04) :290-&
[18]  
Henzinger M, 2015, Arxiv, DOI arXiv:1412.6466
[19]   Visual Adjacency Lists for Dynamic Graphs [J].
Hlawatsch, Marcel ;
Burch, Michael ;
Weiskopf, Daniel .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2014, 20 (11) :1590-1603
[20]   On computing the 2-vertex-connected components of directed graphs [J].
Jaberi, Raed .
DISCRETE APPLIED MATHEMATICS, 2016, 204 :164-172