Techniques for Graph Analytics on Big Data

被引:19
作者
Nisar, M. Usman [1 ]
Fard, Arash [1 ]
Miller, John A. [1 ]
机构
[1] Univ Georgia, Dept Comp Sci, Athens, GA 30602 USA
来源
2013 IEEE INTERNATIONAL CONGRESS ON BIG DATA | 2013年
关键词
graph analytics; big data; graph simulation; parallel and distributed algorithms; ALGORITHMS; SCHEME;
D O I
10.1109/BigData.Congress.2013.78
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Graphs enjoy profound importance because of their versatility and expressivity. They can be effectively used to represent social networks, web search engines and genome sequencing. The field of graph pattern matching has been of significant importance and has wide-spread applications. Conceptually, we want to find subgraphs that match a pattern in a given graph. Much work has been done in this field with solutions like Subgraph Isomorphism and Regular Expression matching. With Big Data, scientists are frequently running into massive graphs that have amplified the challenge that this area poses. We study the speedup and communication behavior of three distributed algorithms for inexact graph pattern matching. We also study the impact of different graph partitionings on runtime and network I/O. Our extensive results show that the algorithms exhibit excellent scalable behavior and min-cut partitioning can lead to improved performance under some circumstances, and can drastically reduce the network traffic as well.
引用
收藏
页码:255 / 262
页数:8
相关论文
共 29 条
[1]  
[Anonymous], P 2010 ACM SIGMOD IN, DOI [DOI 10.1145/1807167.1807184, 10.1145/1807167.1807184]
[2]  
[Anonymous], 2010, P 26 C UNC ART INT
[3]  
[Anonymous], 1990, COMPUT INTRACTABILIT
[4]  
[Anonymous], 2005, The Parallel BGL: A generic library for distributed graph computations
[5]   Detecting Social Positions using Simulation [J].
Brynielsson, Joel ;
Hogberg, Johanna ;
Kaati, Lisa ;
Martenson, Christian ;
Svenson, Pontus .
2010 INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2010), 2010, :48-55
[6]   CGMgraph/CGMlib: Implementing and testing cgm graph algorithms on PC clusters and shared memory machines [J].
Chan, A ;
Dehne, F ;
Taylor, R .
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2005, 19 (01) :81-97
[7]   Thirty years of graph matching in pattern recognition [J].
Conte, D ;
Foggia, P ;
Sansone, C ;
Vento, M .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2004, 18 (03) :265-298
[8]   A (sub)graph isomorphism algorithm for matching large graphs [J].
Cordella, LP ;
Foggia, P ;
Sansone, C ;
Vento, M .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (10) :1367-1372
[9]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[10]   Graph Pattern Matching: From Intractable to Polynomial Time [J].
Fan, Wenfei ;
Li, Jianzhong ;
Ma, Shuai ;
Tang, Nan ;
Wu, Yinghui ;
Wu, Yunpeng .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2010, 3 (01) :264-275