JacSim*: An Effective and Efficient Solution to the Pairwise Normalization Problem in SimRank

被引:2
作者
Hamedani, Masoud Reyhani [1 ]
Kim, Sang-Wook [2 ]
机构
[1] Hanyang Univ, Dept Comp Sci, BK21PLUS Program Adv AI Res & Educ, Seoul 04763, South Korea
[2] Hanyang Univ, Dept Comp Sci, Seoul 04763, South Korea
基金
新加坡国家研究基金会;
关键词
Sparse matrices; Time complexity; Social networking (online); Philosophical considerations; Licenses; Directed graphs; Web pages; Link-based similarity; pairwise normalization problem; similarity computation; SimRank; SIMILARITY MEASURE; LINK;
D O I
10.1109/ACCESS.2021.3123114
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Despite the fact that SimRank has been successfully applied to various applications as a link-based similarity measure, it suffers from a counter-intuitive property called a pairwise normalization problem; JacSim is a powerful variant of SimRank that alleviates this problem. In this paper, we first point out three existing drawbacks of JacSim and then propose JacSim* to effectively solve them; JacSim* exploits those paths neglected by JacSim in similarity computation, its matrix form provides the exact similarity scores while not being sensitive to the number of node-pairs with common neighbors, and it has simpler, easier to understand, and easier to implement formulas in both iterative and matrix forms than those of JacSim. We conduct extensive experiments with eight real-world datasets to evaluate both the accuracy and performance of JacSim* in comparison with those of JacSim. Our experimental results demonstrate that JacSim* shows better accuracy than JacSim and the JacSim* matrix form is dramatically faster than its own iterative form and also than the two forms of JacSim with all datasets.
引用
收藏
页码:146038 / 146049
页数:12
相关论文
共 35 条
[1]  
[Anonymous], 2010, KDD
[2]  
[Anonymous], 2008, Introduction to information retrieval
[3]  
[Anonymous], 2005, P 14 INT C WORLD WID, DOI DOI 10.1145/1060745.1060839
[4]   Simrank++: Query Rewriting through Link Analysis of the Click Graph [J].
Antonellis, Ioannis ;
Molina, Hector Garcia ;
Chang, Chi Chao .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (01) :408-421
[5]  
Bai YS, 2020, AAAI CONF ARTIF INTE, V34, P3219
[6]  
Cai YZ, 2008, LECT NOTES ARTIF INT, V5139, P317
[7]   Efficient structural node similarity computation on billion-scale graphs [J].
Chen, Xiaoshuang ;
Lai, Longbin ;
Qin, Lu ;
Lin, Xuemin .
VLDB JOURNAL, 2021, 30 (03) :471-493
[8]  
Fujiwara Y, 2013, PROC INT CONF DATA, P589, DOI 10.1109/ICDE.2013.6544858
[9]   JacSim: An accurate and efficient link-based similarity measure in graphs [J].
Hamedani, Masoud Reyhani ;
Kim, Sang-Wook .
INFORMATION SCIENCES, 2017, 414 :203-224
[10]   SimCC: A novel method to consider both content and citations for computing similarity of scientific papers [J].
Hamedani, Masoud Reyhani ;
Kim, Sang-Wook ;
Kim, Dong-Jin .
INFORMATION SCIENCES, 2016, 334 :273-292