A measure of similarity between graph vertices: Applications to synonym extraction and web searching

被引:230
作者
Blondel, VD
Gajardo, A
Heymans, M
Senellart, P
Van Dooren, P
机构
[1] Catholic Univ Louvain, Div Appl Math, B-1348 Louvain, Belgium
[2] Univ Concepcion, Dept Ingn Matemat, Concepcion, Chile
[3] Google Inc, Mountain View, CA 94043 USA
[4] Ecole Normale Super, Dept Comp Sci, F-75230 Paris 05, France
关键词
algorithms; graph algorithms; graph theory; eigenvalues of graphs;
D O I
10.1137/S0036144502415960
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We introduce a concept of similarity between vertices of directed graphs. Let CA and G(B) be two directed graphs with, respectively, n(A) and n(B) vertices. We define an n(B) x n(A) similarity matrix S whose real entry s(ij) expresses how similar vertex j (in G(A)) is to vertex i (in G(B)): we say that s(ij) is their similarity score. The similarity matrix can be obtained as the limit of the normalized even iterates of Sk+1 = BS(k)A(T) + B(T)S(k)A, where A and B are adjacency matrices of the graphs and So is a matrix whose entries are all equal to 1. In the special case where G(A) = G(B) = G, the matrix S is square and the score s(ij) is the similarity score between the vertices i and j of G. We point out that Klemberg's "hub and authority" method to identify web-pages relevant to a given query can be viewed as a special case of our definition in the case where one of the graphs has two vertices and a unique directed edge between them. In analogy to Kleinberg, we show that our similarity scores are given by the components of a dominant eigenvector of a nonnegative matrix. Potential applications of our similarity concept are numerous. We illustrate an application for the automatic extraction of synonyms in a monolingual dictionary.
引用
收藏
页码:647 / 666
页数:20
相关论文
共 21 条
  • [1] Bermudez A. J., 1994, SAVMA Symposium 1994 Proceedings., P1
  • [2] Biggs N., 1993, ALGEBRAIC GRAPH THEO
  • [3] Blondel VD, 2003, LECT NOTES COMPUT SC, V2719, P739
  • [4] BLONDEL VD, 2002, P SIAM TEXT MIN WORK
  • [5] Bollobas B., 2001, CAMBRIDGE STUDIES AD, V73
  • [6] Chung FR., 1997, Spectral graph theory
  • [7] Cvetkovic D., 1995, Spectra of Graphs-Theory and Application, V3rd ed.
  • [8] CVETKOVIC D, 1997, EIGENSPACES GRAPHS
  • [9] CONNECTEDNESS OF PRODUCTS OF 2 DIRECTED GRAPHS
    HARARY, F
    TRAUTH, CA
    [J]. SIAM JOURNAL ON APPLIED MATHEMATICS, 1966, 14 (02) : 250 - &
  • [10] CHAINABLE MATRIX, A SPECIAL COMBINATORIAL MATRIX
    HARTFIEL, DJ
    MAXSON, CJ
    [J]. DISCRETE MATHEMATICS, 1975, 12 (03) : 245 - 256