APPROXIMATE TREE MATCHING IN THE PRESENCE OF VARIABLE-LENGTH DONT CARES

被引:55
作者
ZHANG, KZ
SHASHA, D
WANG, JTL
机构
[1] NYU,COURANT INST MATH SCI,NEW YORK,NY 10012
[2] NEW JERSEY INST TECHNOL,DEPT COMP & INFORMAT SCI,NEWARK,NJ 07102
关键词
D O I
10.1006/jagm.1994.1003
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Ordered labeled trees are trees in which the sibling order matters. This paper presents algorithms for three problems having to do with approximate matching for such trees with variable length don′t cares (VLDCs). In strings, a VLDC symbol in the pattern may substitute for zero or more symbols in the data string. For example, if “com*er” is the pattern, then the “*” would substitute for the substring “put” when matching the data string “computer.” Approximate VLDC matching in strings means that after the best possible substitution, the pattern still need not be the same as the data string for a match to be allowed. For example, “com*er” matches “counter” within distance one (representing the cost of removing the “m” from “com*er” and having the “*” substitute for “unt”). We generalize approximate VLDC string matching to three algorithms for approximate VLDC matching on trees. The time complexity of our algorithms is O(|P| × |D| × min(depth(P), leaves(P)) × min(depth(D), leaves(D))) (where |P| and |D| are the number of nodes respectively of the pattern P and the data tree D), the same as for the best approximate tree matching algorithm without VLDCs previously reported. © 1994 Academic Press, Inc.
引用
收藏
页码:33 / 66
页数:34
相关论文
共 27 条
[1]   CODE GENERATION USING TREE MATCHING AND DYNAMIC-PROGRAMMING [J].
AHO, AV ;
GANAPATHI, M ;
TJIANG, SWK .
ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS, 1989, 11 (04) :491-516
[2]  
BYRD R, 1990, USER NOTES INFORMATL
[3]  
Chodorow M. S., 1985, 23rd Annual Meeting of the Association for Computational Linguistics. Proceedings of the Conference, P299
[4]  
CHODOROW MS, 1990, LOCATING SYNTACTIC P
[5]  
DUBINER M, 1990, 31ST P IEEE S F COMP, P145
[6]   AN IMPROVED ALGORITHM FOR APPROXIMATE STRING MATCHING [J].
GALIL, Z ;
PARK, K .
SIAM JOURNAL ON COMPUTING, 1990, 19 (06) :989-999
[7]   PATTERN-MATCHING IN TREES [J].
HOFFMANN, CM ;
ODONNELL, MJ .
JOURNAL OF THE ACM, 1982, 29 (01) :68-95
[8]  
KOSARAJU SR, 1989, 30TH P IEEE S F COMP, P178
[9]   FAST PARALLEL AND SERIAL APPROXIMATE STRING MATCHING [J].
LANDAU, GM ;
VISHKIN, U .
JOURNAL OF ALGORITHMS, 1989, 10 (02) :157-169
[10]  
Markowitz J., 1986, 24th Annual Meeting of the Association for Computational Linguistics. Proceedings of the Conference, P112