Incremental and Accuracy-Aware Personalized PageRank through Scheduled Approximation

被引:34
作者
Zhu, Fanwei [1 ]
Fang, Yuan [2 ,3 ]
Chang, Kevin Chen-Chuan [2 ,3 ]
Ying, Jing [1 ]
机构
[1] Zhejiang Univ City Coll, Hangzhou, Zhejiang, Peoples R China
[2] Univ Illinois, Urbana, IL 61801 USA
[3] Adv Digital Sci Ctr, Singapore, Singapore
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2013年 / 6卷 / 06期
基金
美国国家科学基金会;
关键词
D O I
10.14778/2536336.2536348
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As Personalized PageRank has been widely leveraged for ranking on a graph, the efficient computation of Personalized PageRank Vector (PPV) becomes a prominent issue. In this paper, we propose FastPPV, an approximate PPV computation algorithm that is incremental and accuracy-aware. Our approach hinges on a novel paradigm of scheduled approximation: the computation is partitioned and scheduled for processing in an "organized" way, such that we can gradually improve our PPV estimation in an incremental manner, and quantify the accuracy of our approximation at query time. Guided by this principle, we develop an efficient hub based realization, where we adopt the metric of hub-length to partition and schedule random walk tours so that the approximation error reduces exponentially over iterations. Furthermore, as tours are segmented by hubs, the shared substructures between different tours (around the same hub) can be reused to speed up query processing both within and across iterations. Finally, we evaluate FastPPV over two real-world graphs, and show that it not only significantly outperforms two state-of-the-art baselines in both online and offline phrases, but also scale well on larger graphs. In particular, we are able to achieve near-constant time online query processing irrespective of graph size.
引用
收藏
页码:481 / 492
页数:12
相关论文
共 18 条
[1]  
Andersen R, 2006, ANN IEEE SYMP FOUND, P475
[2]  
Bahmani B, 2010, PROC VLDB ENDOW, V4, P173
[3]  
Bahmani Bahman, 2011, P 2011 ACM SIGMOD IN, P973
[4]  
Balmin A., 2004, P 30 INT C VER LARG, V30, P564
[5]   Bookmark-Coloring Algorithm for Personalized PageRank Computing [J].
Berkhin, Pavel .
INTERNET MATHEMATICS, 2006, 3 (01) :41-62
[6]   Index design and query processing for graph conductance search [J].
Chakrabarti, Soumen ;
Pathak, Amit ;
Gupta, Manish .
VLDB JOURNAL, 2011, 20 (03) :445-470
[7]  
Chakrabarti Soumen, 2007, PROC 16 INT C WORLD, P571, DOI DOI 10.1145/1242572.1242650
[8]   Towards Scaling Fully Personalized PageRank: Algorithms, Lower Bounds, and Experiments [J].
Fogaras, Daniel ;
Racz, Balazs ;
Csalogany, Karoly ;
Sarlos, Tamas .
INTERNET MATHEMATICS, 2005, 2 (03) :333-358
[9]  
Fujisawa Yusuke, 2012, Sayabane, V8, P15
[10]  
Gupta Manish S., 2008, WWW, P1225