Fast Matrix Computations for Pairwise and Columnwise Commute Times and Katz Scores

被引:26
作者
Bonchi, Francesco [1 ]
Esfandiar, Pooya [2 ]
Gleich, David F. [3 ]
Greif, Chen [4 ]
Lakshmanan, Laks V. S. [4 ]
机构
[1] Yahoo Res, Avinguda Diagonal 177,8th Floor, Barcelona 08018, Spain
[2] Ayogo Games, Vancouver, BC V6B 1E8, Canada
[3] Purdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USA
[4] Univ British Columbia, Dept Comp Sci, Vancouver, BC V6T 1Z4, Canada
关键词
D O I
10.1080/15427951.2012.625256
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We explore methods for approximating the commute time and Katz score between a pair of nodes. These methods are based on the approach of matrices, moments, and quadrature developed in the numerical linear algebra community. They rely on the Lanczos process and provide upper and lower bounds on an estimate of the pairwise scores. We also explore methods to approximate the commute times and Katz scores from a node to all other nodes in the graph. Here, our approach for the commute times is based on a variation of the conjugate gradient algorithm, and it provides an estimate of all the diagonals of the inverse of a matrix. Our technique for the Katz scores is based on exploiting an empirical localization property of the Katz matrix. We adapt algorithms used for personalized PageRank computing to these Katz scores and theoretically show that this approach is convergent. We evaluate these methods on 17 real-world graphs ranging in size from 1000 to 1,000,000 nodes. Our results show that our pairwise commute-time method and columnwise Katz algorithm both have attractive theoretical properties and empirical performance.
引用
收藏
页码:73 / 112
页数:40
相关论文
共 45 条
  • [1] Link Prediction on Evolving Data using Matrix and Tensor Factorizations
    Acar, Evrim
    Dunlavy, Daniel M.
    Kolda, Tamara G.
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 262 - +
  • [2] Andersen Reid, 2006, P 47 ANN IEEE S FOUN
  • [3] Quadrature rule-based bounds for functions of adjacency matrices
    Benzi, Michele
    Boito, Paola
    [J]. LINEAR ALGEBRA AND ITS APPLICATIONS, 2010, 433 (03) : 637 - 652
  • [4] Bookmark-Coloring Algorithm for Personalized PageRank Computing
    Berkhin, Pavel
    [J]. INTERNET MATHEMATICS, 2006, 3 (01) : 41 - 62
  • [5] Blackford LS, 1996, P 1996 ACM IEEE C SU
  • [6] Boldi Paolo, 2004, P 13 INT C ONWORLD W, P595
  • [7] Boldi Paolo, 2011, P 20 INT C WORLD WID, P587, DOI DOI 10.1145/1963405.1963488
  • [8] Variational Bayesian image restoration based on a product of t-distributions image prior
    Chantas, Glannis
    Galatsanos, Nikolaos
    Likas, Aristidis
    Saunders, Michael
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2008, 17 (10) : 1795 - 1805
  • [9] Chung F., 2003, ANN COMB, V7, P21, DOI DOI 10.1007/S000260300002
  • [10] Davis PJ., 1984, METHODS NUMERICAL IN