Fast and Exact Top-k Search for Random Walk with Restart

被引:86
作者
Fujiwara, Yasuhiro [1 ]
Nakatsuji, Makoto [2 ]
Onizuka, Makoto [1 ]
Kitsuregawa, Masaru [3 ]
机构
[1] NTT Cyber Space Labs, Kanazawa, Ishikawa, Japan
[2] NTT Cyber Sol Labs, Kanazawa, Ishikawa, Japan
[3] Univ Tokyo, Tokyo, Japan
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2012年 / 5卷 / 05期
关键词
D O I
10.14778/2140436.2140441
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Graphs are fundamental data structures and have been employed for centuries to model real-world systems and phenomena. Random walk with restart (RWR) provides a good proximity score between two nodes in a graph, and it has been successfully used in many applications such as automatic image captioning, recommender systems, and link prediction. The goal of this work is to find nodes that have top-k highest proximities for a given node. Previous approaches to this problem find nodes efficiently at the expense of exactness. The main motivation of this paper is to answer, in the affirmative, the question, 'Is it possible to improve the search time without sacrificing the exactness?'. Our solution, K-dash, is based on two ideas: (1) It computes the proximity of a selected node efficiently by sparse matrices, and (2) It skips unnecessary proximity computations when searching for the top-k nodes. Theoretical analyses show that K-dash guarantees result exactness. We perform comprehensive experiments to verify the efficiency of K-dash. The results show that K-dash can find top-k nodes significantly faster than the previous approaches while it guarantees exactness.
引用
收藏
页码:442 / 453
页数:12
相关论文
共 25 条
[1]   Simrank++: Query Rewriting through Link Analysis of the Click Graph [J].
Antonellis, Ioannis ;
Molina, Hector Garcia ;
Chang, Chi Chao .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (01) :408-421
[2]  
Avrachenkov K, 2011, LECT NOTES COMPUT SC, V6732, P50
[3]   Fast unfolding of communities in large networks [J].
Blondel, Vincent D. ;
Guillaume, Jean-Loup ;
Lambiotte, Renaud ;
Lefebvre, Etienne .
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,
[4]   Optimization and evaluation of shortest path queries [J].
Chan, Edward P. F. ;
Lim, Heechul .
VLDB JOURNAL, 2007, 16 (03) :343-369
[5]   Fast graph pattern matching [J].
Cheng, Jiefeng ;
Yu, Jeffrey Xu ;
Ding, Bolin ;
Yu, Philip S. ;
Wang, Haixun .
2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, :913-+
[6]  
Cormen T. H., 2009, INTRO ALGORITHMS, V3rd
[7]  
Gupta Manish S., 2008, WWW, P1225
[8]  
He J, 2004, P 12 ANN ACM INT C M, P9
[9]   Multilevel k-way partitioning scheme for irregular graphs [J].
Karypis, G ;
Kumar, V .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1998, 48 (01) :96-129
[10]  
Khan A., 2011, SIGMOD, P901