A novel way of computing similarities between nodes of a graph, with application to collaborative recommendation

被引:40
作者
Fouss, F [1 ]
Pirotte, A [1 ]
Saerens, M [1 ]
机构
[1] Univ Catholique Louvain, Informat Syst Res Unit, ISYS IAG, B-1348 Louvain, Belgium
来源
2005 IEEE/WIC/ACM International Conference on Web Intelligence, Proceedings | 2005年
关键词
D O I
10.1109/WI.2005.9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work presents a new perspective on characterizing the similarity between elements of a database or more generally, nodes of a weighted, undirected, graph. It is based on a Markov-chain model of random walk through the database. The suggested quantities, representing dissimilarities (or similarities) between any two elements, have the nice property of decreasing (increasing) when the number of paths connecting those elements increases and when the "length" of any path decreases. The model is evaluated on a collaborative recommendation task where suggestions are made about which movies people should watch based upon what they watched in the past. The model, which nicely fits into the so-called "statistical relational learning" framework as well as the "link analysis" paradigm, could also be used to compute document or word similarities, and, more generally, could be applied to other database or web mining tasks.
引用
收藏
页码:550 / 556
页数:7
相关论文
共 8 条
[1]  
[Anonymous], STOCH PROC APPL E
[2]  
DZEROSKI S, 2003, ACM SIGKDD EXPLORATI, V5, P1
[3]   Applying associative retrieval techniques to alleviate the sparsity problem in collaborative filtering [J].
Huang, Z ;
Chen, H ;
Zeng, D .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2004, 22 (01) :116-142
[4]  
Katz L., 1953, PSYCHOMETRIKA, V18, P39
[5]   RESISTANCE DISTANCE [J].
KLEIN, DJ ;
RANDIC, M .
JOURNAL OF MATHEMATICAL CHEMISTRY, 1993, 12 (1-4) :81-95
[6]   A survey of eigenvector methods for Web information retrieval [J].
Langville, AN ;
Meyer, CD .
SIAM REVIEW, 2005, 47 (01) :135-161
[7]  
Saerens M, 2004, LECT NOTES COMPUT SC, V3201, P371
[8]  
White S., 2003, P 9 ACM SIGKDD INT C, P266, DOI DOI 10.1145/956750.956782