Distributed Graph Simulation: Impossibility and Possibility

被引:32
作者
Fan, Wenfei [1 ,2 ]
Wang, Xin [4 ]
Wu, Yinghui [1 ,3 ]
Deng, Dong [5 ]
机构
[1] Univ Edinburgh, Edinburgh, Midlothian, Scotland
[2] Beihang Univ, RCBD & SKLSDE Lab, Beijing, Peoples R China
[3] Washington State Univ, Pullman, WA USA
[4] Southwest Jiaotong Univ, Chengdu, Sichuan, Peoples R China
[5] Tsinghua Univ, Beijing, Peoples R China
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2014年 / 7卷 / 12期
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.14778/2732977.2732983
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies fundamental problems for distributed graph simulation. Given a pattern query Q and a graph G that is fragmented and distributed, a graph simulation algorithm A is to compute the matches Q(G) of Q in G. kVe say that A is parallel scalable in (a) response time if its parallel computational cost is determined by the largest fragment F-m of G and the size vertical bar Q vertical bar of query Q, and (b) data shipment if its total amount of data shipped is determined by vertical bar Q vertical bar and the number of fragments of C. independent of the size of graph G. (1) We prove an impossibility theorem: there exists no distributed graph simulation algorithm that is parallel scalable in either response time or data shipment. (2) However, we show that distributed graph simulation is par ion bounded, i.e., its response time depends only on vertical bar Q vertical bar, vertical bar F-m vertical bar and the number vertical bar V-f vertical bar of nodes in G with edges across different fragments; and its data shipment depends on vertical bar Q vertical bar and the number vertical bar E-f vertical bar of crossing edges only. We provide the first algorithms with these performance guarantees. (3) We also identify special cases of patterns and graphs when parallel scalability is possible. (4) We experimentally verify the scalability and efficiency of our algorithms.
引用
收藏
页码:1083 / 1094
页数:12
相关论文
共 32 条
[1]  
Anh V. L., 2007, ADRIS RES COMMUNIC
[2]  
[Anonymous], PVLDB
[3]  
Blom S., 2005, STTT, V7
[4]  
Brynielsson J., 2010, ASONAM
[5]  
Buluc A., 2012, GRAPH PARTITIONING G, P83
[6]   Maximizing data locality in distributed systems [J].
Chung, Fan ;
Graharn, Ronald ;
Bhagwan, Ranjita ;
Savage, Stefan .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2006, 72 (08) :1309-1316
[7]  
Cong G., 2007, SIGMOD
[8]  
Fan W., 2012, PVLDB, V5
[9]  
Fan W., 2013, TODS, V38
[10]  
Fard A., 2013, BIG DATA