Simulation relations for pattern matching in directed graphs

被引:0
作者
Bjorklund, Johanna [1 ]
Ohman, Lars-Daniel [2 ]
机构
[1] Umea Univ, Dept Comp Sci, S-90187 Umea, Sweden
[2] Umea Univ, Dept Math & Math Stat, S-90187 Umea, Sweden
关键词
Pattern matching; Simulation; Graphs; Treebanks; Complexity; TREE; ALGORITHM;
D O I
10.1016/j.tcs.2013.03.021
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We consider the problem of finding the occurrences of a pattern tree t in a directed graph g, and propose two algorithms, one for preprocessing and one for searching for t in g. It is assumed that the object graph itself is large and static, and that the pattern tree is small and frequently updated. To model varying abstraction levels in the data, we work with partially ordered alphabets and compute simulation relations rather than equivalence relations. In particular, vertices and edges are labelled with elements from a pair of preorders instead of unstructured alphabets. Under the above assumptions, we obtain a search algorithm that runs in time O(height (t) . vertical bar t vertical bar . vertical bar(V-g(+/-)t/R-g(+/-)t vertical bar(2)) where vertical bar (V-g(+/-)t/R-g(+/-)t)vertical bar is the number of equivalence classes in the coarsest simulation relation R-g(+/-)t on the graph g((+/-))t, the disjoint union of g and t. This means that the size of the object graph only affects the running time of the search algorithm indirectly, because of the groundwork done by the preprocessing routine in time O(k . vertical bar g vertical bar . vertical bar(V-g/R-g)vertical bar(2)), where vertical bar(V-g/R-g) is the number of equivalence classes in the coarsest simulation relation R-g on g, taking k = vertical bar V-g vertical bar(2) in the general case and k = height (g) if g is acyclic. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 32 条
[1]  
Abdulla PA, 2008, LECT NOTES COMPUT SC, V4963, P93, DOI 10.1007/978-3-540-78800-3_8
[2]  
Aceto L., 2002, Algebraic Methodology and Software Technology. 9th International Conference, AMAST 2002. Proceedings (Lecture Notes in Computer Science Vol.2422), P239
[3]   Structural joins: A primitive for efficient XML query pattern matching [J].
Al-Khalifa, S ;
Jagadish, HV ;
Koudas, N ;
Patel, JM ;
Srivastava, D ;
Wu, YQ .
18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, :141-152
[4]  
[Anonymous], 2010, METHODS BIOCH ANAL
[5]  
Baader F., 1998, Term rewriting and all that
[6]   A survey on tree edit distance and related problems [J].
Bille, P .
THEORETICAL COMPUTER SCIENCE, 2005, 337 (1-3) :217-239
[7]  
Bruno N., 2002, P 2002 ACM SIGMOD IN, P310, DOI DOI 10.1145/564691.564727
[8]   Bisimulation relations for weighted automata [J].
Buchholz, Peter .
THEORETICAL COMPUTER SCIENCE, 2008, 393 (1-3) :109-123
[9]  
Chen S., 2006, P VERY LARGE DATA BA, P283
[10]   More efficient algorithm for ordered tree inclusion [J].
Chen, WM .
JOURNAL OF ALGORITHMS, 1998, 26 (02) :370-385