FrogWild! - Fast Page Rank Approximations on Graph Engines

被引:16
作者
Mitliagkas, Ioannis [1 ]
Borokhovich, Michael [1 ]
Dimakis, Alexandros G. [1 ]
Caramanis, Constantine [1 ]
机构
[1] UT Austin, ECE, Austin, TX 78712 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2015年 / 8卷 / 08期
关键词
D O I
10.14778/2757807.2757812
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose FROGWILD, a novel algorithm for fast approximation of high Page Rank vertices, geared towards reducing network costs of running traditional Page Rank algorithms. Our algorithm can be seen as a quantized version of power iteration that performs multiple parallel random walks over a directed graph. One important innovation is that we introduce a modification to the Graph Lab framework that only partially synchronizes mirror vertices. This partial synchronization vastly reduces the network traffic generated by traditional Page Rank algorithms, thus greatly reducing the per-iteration cost of Page Rank. On the other hand, this partial synchronization also creates dependencies between the random walks used to estimate Page Rank. Our main theoretical innovation is the analysis of the correlations introduced by this partial synchronization process and a bound establishing that our approximation is close to the true Page Rank vector. We implement our algorithm in Graph Lab and compare it against the default Page Rank implementation. We show that our algorithm is very fast, performing each iteration in less than one second on the Twitter graph and can be up to 7x faster compared to the standard Graph Lab Page Rank implementation.
引用
收藏
页码:874 / 885
页数:12
相关论文
共 30 条
[1]  
Agarwal Alekh, 2007, P 24 INT C MACH LEAR, P9, DOI [DOI 10.1145/1273496.1273498, 10.1145/1273496.1273498]
[2]  
Andersen R, 2007, LECT NOTES COMPUT SC, V4863, P150
[3]  
[Anonymous], 1999, TEXT APPL M
[4]   Monte Carlo methods in pagerank computation: When one iteration is sufficient [J].
Avrachenkov, K. ;
Litvak, N. ;
Nemirovsky, D. ;
Osipova, N. .
SIAM JOURNAL ON NUMERICAL ANALYSIS, 2007, 45 (02) :890-904
[5]  
Avrachenkov K., 2010, CORR
[6]   Spectral Sparsification of Graphs: Theory and Algorithms [J].
Batson, Joshua ;
Spielman, Daniel A. ;
Srivastava, Nikhil ;
Teng, Shang-Hua .
COMMUNICATIONS OF THE ACM, 2013, 56 (08) :87-94
[7]  
BECCHETTI L., 2006, P 15 INT C WORLD WID, P941
[8]  
Benczur A. A., 1996, Proceedings of the Twenty-Eighth Annual ACM Symposium on the Theory of Computing, P47, DOI 10.1145/237814.237827
[9]   A Survey on PageRank Computing [J].
Berkhin, Pavel .
INTERNET MATHEMATICS, 2005, 2 (01) :73-120
[10]  
Borokhovich M., 2014, FROGWILD CODE REPOSI