Giraph Unchained: Barrier less Asynchronous Parallel Execution in Pregel-like Graph Processing Systems

被引:103
作者
Han, Minyang [1 ]
Daudjee, Khuzaima [1 ]
机构
[1] Univ Waterloo, David R Cheriton Sch Comp Sci, Waterloo, ON N2L 3G1, Canada
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2015年 / 8卷 / 09期
关键词
D O I
10.14778/2777598.2777604
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The bulk synchronous parallel (BSP) model used by synchronous graph processing systems allows algorithms to be easily implemented and reasoned about. However, BSP can suffer from poor performance due to stale messages and frequent global synchronization barriers. Asynchronous computation models have been proposed to alleviate these overheads but existing asynchronous systems that implement such models have limited scalability or retain frequent global barriers, and do not always support graph mutations or algorithms with multiple computation phases. We propose barrierless asynchronous parallel (BAP), a new computation model that reduces both message staleness and global synchronization. This enables BAP to overcome the limitations of existing asynchronous models while retaining support for graph mutations and algorithms with multiple computation phases. We present GiraphUC, which implements our BAP model in the open source distributed graph processing system Giraph, and evaluate our system at scale with large real-world graphs on 64 EC2 machines. We show that GiraphUC provides across-the-board performance improvements of up to 5x faster over synchronous systems and up to an order of magnitude faster than asynchronous systems. Our results demonstrate that the BAP model provides efficient and transparent asynchronous execution of algorithms that are programmed synchronously.
引用
收藏
页码:950 / 961
页数:12
相关论文
共 33 条
  • [1] [Anonymous], [No title captured]
  • [2] Backstrom L., 2012, WEBSCI 12
  • [3] UbiCrawler: a scalable fully distributed Web crawler
    Boldi, P
    Codenotti, B
    Santini, M
    Vigna, S
    [J]. SOFTWARE-PRACTICE & EXPERIENCE, 2004, 34 (08) : 711 - 726
  • [4] BOLDI P., 2011, WWW 11
  • [5] Boldi P., 2012, 4 DEGREES SEPARATION
  • [6] Boldi P., 2004, WWW 04
  • [7] Borkar V. R., 2011, ICDE 11
  • [8] Bu Y., 2015, P VLDB END, P161
  • [9] Ching Avery, 2013, SCALING APACHE GIRAP
  • [10] Chung S., IPPS 96