A Novel and Fast SimRank Algorithm

被引:17
作者
Lu, Juan [1 ]
Gong, Zhiguo [1 ]
Lin, Xuemin [2 ,3 ]
机构
[1] Univ Macau, Dept Comp & Informat Sci, Taipa, Macau, Peoples R China
[2] East China Normal Univ, Huashida 200062, Putuo, Peoples R China
[3] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW 2052, Australia
关键词
SimRank; linear system; conjugate gradient;
D O I
10.1109/TKDE.2016.2626282
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
SimRank is a widely adopted similarity measure for objects modeled as nodes in a graph, based on the intuition that two objects are similar if they are referenced by similar objects. The recursive nature of SimRank definition makes it expensive to compute the similarity score even for a single pair of nodes. This defect limits the applications of SimRank. To speed up the computation, some existing works replace the original model with an approximate model to seek only rough solution of SimRank scores. In this work, we propose a novel solution for computing all-pair SimRank scores. In particular, we propose to convert SimRank to the problem of solving a linear system in matrix form, and further prove that the system is non-singular, diagonally dominate, and symmetric definite positive (for undirected graphs). Those features immediately lead to the adoption of Conjugate Gradient (CG) and Bi-Conjugate Gradient (BiCG) techniques for efficiently computing SimRank scores. As a result, a significant improvement on the convergence rate can be achieved; meanwhile, the sparsity of the adjacency matrix is not damaged all the time. Inspired by the existing common neighbor sharing strategy, we further reduce the computational complexity of the matrix multiplication and resolve the scalable issues. The experimental results show our proposed algorithms significantly outperform the state-of-the-art algorithms.
引用
收藏
页码:572 / 585
页数:14
相关论文
共 31 条
[1]  
Abbassi Z., 2007, WEBKDDSNA KDD LECT N, V5439, P139
[2]  
[Anonymous], 2010, KDD
[3]  
[Anonymous], 2006, P 32 INT C VER LARG
[4]  
[Anonymous], 1952, METHODS CONJUGATE GR
[5]  
[Anonymous], 2003, P 12 INT C WORLD WID
[6]  
[Anonymous], 2005, P 14 INT C WORLD WID, DOI DOI 10.1145/1060745.1060839
[7]   Simrank++: Query Rewriting through Link Analysis of the Click Graph [J].
Antonellis, Ioannis ;
Molina, Hector Garcia ;
Chang, Chi Chao .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (01) :408-421
[8]  
Axelsson O., 1977, Sparse matrix techniques, V572, P1
[9]  
Bellman R., 1970, Introduction To Matrix Analysis, V960