A heuristic search algorithm based on subspaces for PageRank computation

被引:2
作者
Miyata, Takafumi [1 ]
机构
[1] Fukuoka Inst Technol, Dept Comp Sci & Engn, Higashi Ku, 3-30-1 Wajiro Higashi, Fukuoka, Fukuoka 8110295, Japan
关键词
PageRank; Google matrix; Power iteration; Krylov subspace; Residual minimization; Parallel computing;
D O I
10.1007/s11227-018-2383-9
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We studied a fast algorithm for the large-scale computation of PageRank. PageRank is what the Google search engine uses to simulate the importance of web pages. It is defined by the eigenvector of a particular stochastic matrix related to the graphs of web pages. The power method is the typical means to compute the eigenvector, while the Krylov subspace method shows faster convergence, which can be regarded as a two-step algorithm. The first step predicts the eigenvector, and the second step corrects the predicted result. More precisely, the power method is first iterated to compute the eigenvector approximately. Secondly, a Krylov subspace spanned by the approximations is searched for a better approximate eigenvector in terms of minimizing a residual. To get a better approximation efficiently, we consider using subspaces not only at the second step but also at the first step. Specifically, a Krylov subspace is first used to compute an approximate eigenvector, by which another subspace is expanded. Secondly, this non-Krylov subspace is searched for a better approximate eigenvector that minimizes its residual over the subspace. This paper describes a heuristic search algorithm iterating the two steps alternately and presents its efficient implementation. Experimental results with huge Google matrices illustrate improvements in performance of the algorithm.
引用
收藏
页码:3278 / 3294
页数:17
相关论文
共 50 条
  • [31] IDL: Evaluating software quality based on pagerank algorithm
    Guoqiang Z.
    Yi F.
    Shuai Z.
    Yilun W.
    Peng L.
    Guilan D.
    Journal of China Universities of Posts and Telecommunications, 2020, 27 (01): : 10 - 25
  • [32] A note on the PageRank algorithm
    Sun, Huan
    Wei, Yimin
    APPLIED MATHEMATICS AND COMPUTATION, 2006, 179 (02) : 799 - 806
  • [33] Improved PageRank Algorithm Based on the Residence Time of the Website
    Liu, Dian-Xing
    Yan, Xia
    Xie, Wei
    INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, ICIC 2012, 2012, 7390 : 601 - 607
  • [34] An Arnoldi-Extrapolation algorithm for computing PageRank
    Wu, Gang
    Wei, Yimin
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2010, 234 (11) : 3196 - 3212
  • [35] Topic-sensitive PageRank: A context-sensitive ranking algorithm for Web search
    Haveliwala, TH
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2003, 15 (04) : 784 - 796
  • [36] PageRank Computation for Higher-Order Networks
    Coquide, Celestin
    Queiros, Julie
    Queyroi, Francois
    COMPLEX NETWORKS & THEIR APPLICATIONS X, VOL 1, 2022, 1015 : 183 - 193
  • [37] Revisiting Local Computation of PageRank: Simple and Optimal
    Wang, Hanzhi
    Wei, Zhewei
    Wen, Ji-Rong
    Yang, Mingji
    PROCEEDINGS OF THE 56TH ANNUAL ACM SYMPOSIUM ON THEORY OF COMPUTING, STOC 2024, 2024, : 911 - 922
  • [38] Keyword Extraction from News Articles Based on PageRank Algorithm
    Gu Y.-R.
    Xu M.-X.
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2017, 46 (05): : 777 - 783
  • [39] The PageRank vector: Properties, computation, approximation, and acceleration
    Brezinski, Claude
    Redivo-Zaglia, Michela
    SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2006, 28 (02) : 551 - 575
  • [40] RDF Data Assessment Based on Metrics and Improved PageRank Algorithm
    Wei, Kai
    Tian, Pingfang
    Gu, Jinguang
    Huang, Li
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2017), 2017, 10179 : 204 - 212