Accelerating pairwise SimRank estimation over static and dynamic graphs

被引:1
作者
Yue Wang
Lei Chen
Yulin Che
Qiong Luo
机构
[1] Hong Kong University of Science and Technology,Department of Computer Science and Engineering
来源
The VLDB Journal | 2019年 / 28卷
关键词
SimRank; Dynamic graph; Graph theory; Similarity measure;
D O I
暂无
中图分类号
学科分类号
摘要
Measuring similarities among different vertices is a fundamental problem in graph analysis. Among different similarity measurements, SimRank is one of the most promising and popular. In reality, instead of computing the whole similarity matrix, people often issue SimRank queries in a pairwise manner, each of which needs to estimate an approximate SimRank value within a specified accuracy for a given pair of nodes. These pairwise SimRank queries are often processed on real-life graphs, which typically evolve over time, requiring efficient algorithms that can query pairwise SimRank under dynamic graph updates. However, current single-pair SimRank solutions are either static or inefficient in handling dynamic cases with good-quality results. Observing that the sample size is the major factor that determines the efficiency and the accuracy in Monte Carlo methods to estimate pairwise SimRank, in this paper, we propose three algorithms to query pairwise SimRank over static and dynamic graphs efficiently, by using different sample reduction strategies. The accuracy of our algorithms is guaranteed by the different invariants we propose for pairwise SimRank. We show that our algorithms outperform the state-of-the-art static and dynamic solutions for pairwise SimRank estimation.
引用
收藏
页码:99 / 122
页数:23
相关论文
共 47 条
[1]  
He J(2014)Assessing single-pair similarity over graphs by aggregating first-meeting probabilities Inf. Syst. 42 107-122
[2]  
Liu H(2017)READS: a random walk approach for efficient and accurate dynamic SimRank PVLDB 10 937-948
[3]  
Yu JX(2015)Walking in the cloud: parallel simrank at scale PVLDB 9 24-35
[4]  
Li P(2007)The link-prediction problem for social networks J. Assoc. Inf. Sci. Technol. 58 1019-1031
[5]  
He W(2008)Accuracy estimate and optimization techniques for SimRank computation VLDB 1 422-433
[6]  
Du X(2017)A novel and fast SimRank algorithm IEEE Trans. Knowl. Data Eng. 8 838-849
[7]  
Jiang M(2015)An efficient similarity search framework for SimRank over large dynamic graphs PVLDB 13 50-64
[8]  
Fu AW(2012)Survey on web spam detection: principles and algorithms SIGKDD Explor. Newsl. 8 569-580
[9]  
Wong RC(2015)Efficient partial-pairs SimRank search for large networks PVLDB 7 13-24
[10]  
Wang K(2013)More is simpler: effectively and efficiently assessing node-pair similarities based on hyperlinks PVLDB 27 79-104