Fast and Unified Local Search for Random Walk Based K-Nearest-Neighbor Query in Large Graphs

被引:43
作者
Wu, Yubao [1 ]
Jin, Ruoming [2 ]
Zhang, Xiang [1 ]
机构
[1] Case Western Reserve Univ, Dept Elect Engn & Comp Sci, Cleveland, OH 44106 USA
[2] Kent State Univ, Dept Comp Sci, Kent, OH 44242 USA
来源
SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA | 2014年
基金
美国国家科学基金会;
关键词
Local search; nearest neighbors; top-k search; random walk; proximity search;
D O I
10.1145/2588555.2610500
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Given a large graph and a query node, finding its k-nearest-neighbor (kNN) is a fundamental problem. Various random walk based measures have been developed to measure the proximity (similarity) between nodes. Existing algorithms for the random walk based top-k proximity search can be categorized as global and local methods based on their search strategies. Global methods usually require an expensive pre-computing step. By only searching the nodes near the query node, local methods have the potential to support more efficient query. However, most existing local search methods cannot guarantee the exactness of the solution. Moreover, they are usually designed for specific proximity measures. Can we devise an efficient local search method that applies to different measures and also guarantees result exactness? In this paper, we present FLoS (Fast Local Search), a unified local search method for efficient and exact top-k proximity query in large graphs. FLoS is based on the no local optimum property of proximity measures. We show that many measures have no local optimum. Utilizing this property, we introduce several simple operations on transition probabilities, which allow developing lower and upper bounds on the proximity. The bounds monotonically converge to the exact proximity when more nodes are visited. We further show that FLoS can also be applied to measures having local optimum by utilizing relationship among different measures. We perform comprehensive experiments to evaluate the efficiency and applicability of the proposed method.
引用
收藏
页码:1139 / 1150
页数:12
相关论文
共 22 条
[1]  
[Anonymous], 2008, P 17 ACM C INF KNOWL
[2]  
[Anonymous], 1995, Reversible markov chains and random walks on graphs
[3]  
[Anonymous], 2011, P ACM SIGMOD INT C M
[4]  
[Anonymous], 2008, P 25 INT C MACH LEAR
[5]  
[Anonymous], 2010, 16 ACM SIGKDD INT C
[6]  
[Anonymous], 2001, Matrix Analysis and Applied Linear Algebra
[7]  
[Anonymous], 1953, Introductory circuit theory
[8]  
[Anonymous], 2003, ITERATIVE METHODS SP, DOI DOI 10.1137/1.9780898718003
[9]   Bookmark-Coloring Algorithm for Personalized PageRank Computing [J].
Berkhin, Pavel .
INTERNET MATHEMATICS, 2006, 3 (01) :41-62
[10]  
Bogdanov P., 2013, CIKM, P523