Rewriting of regular expressions and regular path queries

被引:51
作者
Calvanese, D
De Giacomo, G
Lenzerini, M
Vardi, MY
机构
[1] Univ Roma La Sapienza, Dipartimento Informat & Sistemist, I-00198 Rome, Italy
[2] Rice Univ, Dept Comp Sci, Houston, TX 77251 USA
基金
美国国家科学基金会;
关键词
semistructured data; query rewriting; regular path queries; regular expressions; computational complexity;
D O I
10.1006/jcss.2001.1805
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Recent work on semi-structured data has revitalized the interest in path queries, i.e., queries that ask for all pairs of objects in the database that are connected by a path conforming to a certain specification, in particular to a regular expression. Also, in semi-structured data, as well as in data integration, data warehousing, and query optimization, the problem of view-based query rewriting is receiving much attention: Given a query and a collection of views, generate a new query which uses the views and provides the answer to the original one. In this paper we address the problem of view-based query rewriting in the context of semi-structured data. We present a method for computing the rewriting of a regular expression E in terms of other regular expressions. The method computes the exact rewriting (the one that defines the same regular language as E) if it exists, or the rewriting that defines the maximal language contained in the one defined by E, otherwise. We present a complexity analysis of both the problem and the method, showing that the latter is essentially optimal. Finally, we illustrate how to exploit the method for view-based rewriting of regular path queries in semi-structured data. The complexity results established for the rewriting of regular expressions apply also to the case of regular path queries. (C) 2002 Elsevier Science (USA).
引用
收藏
页码:443 / 465
页数:23
相关论文
共 44 条
  • [1] Querying documents in object databases
    Abiteboul S.
    Cluet S.
    Christophides V.
    Milo T.
    Moerkotte G.
    Siméon J.
    [J]. International Journal on Digital Libraries, 1997, 1 (1) : 5 - 19
  • [2] Abiteboul S., 1998, Proceedings of the Seventeenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems. PODS 1998, P254, DOI 10.1145/275487.275516
  • [3] Abiteboul S., 1997, Proceedings of the Sixteenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, PODS 1997, P122, DOI 10.1145/263661.263676
  • [4] ADALI S, 1996, P ACM SIGMOD INT C M, P137
  • [5] Afrati FN, 1999, LECT NOTES COMPUT SC, V1540, P435
  • [6] Beeri C., 1997, Proceedings of the Sixteenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, PODS 1997, P99, DOI 10.1145/263661.263673
  • [7] BOAS PV, 1982, P 1 GTI WORKSH, P75
  • [8] BOAS PV, 1997, LECT NOTES PURE APPL, V187, P331
  • [9] Buneman P., 1997, Proceedings of the Sixteenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, PODS 1997, P117, DOI 10.1145/263661.263675
  • [10] Buneman P, 1996, P ACM SIGMOD INT C M, P505