R3MAT: A Rapid and Robust Graph Generator

被引:3
作者
Angles, Renzo [1 ]
Paredes, Rodrigo [1 ]
Garcia, Roberto [2 ]
机构
[1] Univ Talca, Fac Engn, Dept Comp Sci, Curico 3340000, Chile
[2] Univ Talca, Fac Engn, Engn Syst Doctoral Program, Curico 3340000, Chile
关键词
Generators; Partitioning algorithms; !text type='Java']Java[!/text; Tools; Robustness; Medical services; Network theory (graphs); data engineering; performance analysis; NETWORKS;
D O I
10.1109/ACCESS.2020.3009577
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One of the main problems when developing graph-based applications is the availability of large and representative datasets. The lack of real graphs has motivated the development of software tools for generating synthetic graphs. R-MAT is a data generation method that was designed to produce synthetic graphs whose characteristics resemble those occurring in real networks. Although the generation model defined by R-MAT is easy to understand, its implementation is not trivial and it has intrinsic memory restrictions that makes the generation of very large graphs difficult. This paper studies the practical implementation of R-MAT. We discuss the issues of the original implementation which works with the adjacency matrix of the graph and analyze the size of the resulting graph obtained with the R-MAT model. Then, we introduce and experimentally evaluate R(3)MAT, an alternative implementation for R-MAT based on an array of degrees. These experiments show that (i) our R(3)MAT is able to generate graphs of hundred million nodes and billion edges in a single machine, (ii) our method preserves the characteristic power-law distribution of the edge degrees present in real-world graphs, and (iii) R(3)MAT has the best performance in the current state of the art, when considering a single modest computer in a sequential fashion.
引用
收藏
页码:130048 / 130065
页数:18
相关论文
共 32 条
[1]  
Angles R., 2013, 1 INT WORKSHOP GRAPH
[2]  
Bader DavidA., 2009, Hpc scalable graph analysis benchmark v1.0
[3]   gMark: Schema-Driven Generation of Graphs and Queries [J].
Bagan, Guillaume ;
Bonifati, Angela ;
Ciucanu, Radu ;
Fletcher, George H. L. ;
Lemay, Aurelien ;
Advokaat, Nicky .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (04) :856-869
[4]  
Chakrabarti D, 2004, SIAM PROC S, P442
[5]  
Chakrabarti D., 2012, Graph Mining: Laws Tools and Case studies
[6]   Graph mining: Laws, generators, and algorithms [J].
Chakrabarti, Deepayan ;
Faloutsos, Christos .
ACM COMPUTING SURVEYS, 2006, 38 (01) :A1-A69
[7]   Visualization of large networks with min-cut plots, A-plots and R-MAT [J].
Chakrabarti, Deepayan ;
Faloutsos, Christos ;
Zhan, Yiping .
INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2007, 65 (05) :434-445
[8]   Community Detection in Networks Based on Modified PageRank and Stochastic Block Model [J].
Chen, Jing ;
Xu, Guangluan ;
Wang, Yang ;
Zhang, Yuanben ;
Wang, Lei ;
Sun, Xian .
IEEE ACCESS, 2018, 6 :77133-77144
[9]   Power-Law Distributions in Empirical Data [J].
Clauset, Aaron ;
Shalizi, Cosma Rohilla ;
Newman, M. E. J. .
SIAM REVIEW, 2009, 51 (04) :661-703
[10]   Characterization of complex networks: A survey of measurements [J].
Costa, L. Da F. ;
Rodrigues, F. A. ;
Travieso, G. ;
Boas, P. R. Villas .
ADVANCES IN PHYSICS, 2007, 56 (01) :167-242