An Efficient Parallel Triangle Enumeration on the MapReduce Framework

被引:0
作者
Kim, Hongyeon [1 ]
Min, Jun-Ki [1 ]
机构
[1] Korea Univ Tech & Edu, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
triangle enumeration; vertex partition; graph data; MapReduce;
D O I
10.1587/transinf.2018EDP7421
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A triangle enumerating problem is one of fundamental problems of graph data. Although several triangle enumerating algorithms based on MapReduce have been proposed, they still suffer from generating a lot of intermediate data. In this paper, we propose the efficient MapReduce algorithms to enumerate every triangle in the massive graph based on a vertex partition. Since a triangle is composed of an edge and a wedge, our algorithms check the existence of an edge connecting the end-nodes of each wedge. To generate every triangle from a graph in parallel, we first split a graph into several vertex partitions and group the edges and wedges in the graph for each pair of vertex partitions. Then, we form the triangles appearing in each group. Furthermore, to enhance the performance of our algorithm, we remove the duplicated wedges existing in several groups. Our experimental evaluation shows the performance of our proposed algorithm is better than that of the state-of-the-art algorithm in diverse environments.
引用
收藏
页码:1902 / 1915
页数:14
相关论文
共 20 条
[1]   Parallel Triangle Counting and Enumeration using Matrix Algebra [J].
Azad, Ariful ;
Buluc, Aydin ;
Gilbert, John .
2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, 2015, :804-811
[2]   Graph structure in the Web [J].
Broder, A ;
Kumar, R ;
Maghoul, F ;
Raghavan, P ;
Rajagopalan, S ;
Stata, R ;
Tomkins, A ;
Wiener, J .
COMPUTER NETWORKS-THE INTERNATIONAL JOURNAL OF COMPUTER AND TELECOMMUNICATIONS NETWORKING, 2000, 33 (1-6) :309-320
[3]   Graph Twiddling in a MapReduce World [J].
Cohen, Jonathan .
COMPUTING IN SCIENCE & ENGINEERING, 2009, 11 (04) :29-41
[4]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[5]   Community structure in social and biological networks [J].
Girvan, M ;
Newman, MEJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (12) :7821-7826
[6]  
Hu X., 2013, P SIGMOD, P325
[7]  
Kwak Haewoon, 2010, INT WORLD WIDE WEB C, P591, DOI DOI 10.1145/1772690.1772751
[8]   JOIN PROCESSING IN RELATIONAL DATABASES [J].
MISHRA, P ;
EICH, MH .
COMPUTING SURVEYS, 1992, 24 (01) :63-113
[9]  
Park H.-M., 2014, ACM C INF KNOWL MAN, P1739
[10]   PTE: Enumerating Trillion Triangles On Distributed Systems [J].
Park, Ha-Myung ;
Myaeng, Sung-Hyon ;
Kang, U. .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :1115-1124