Optimizing regular path expressions using graph schemas

被引:79
作者
Fernandez, M [1 ]
Suciu, D [1 ]
机构
[1] AT&T Bell Labs, Florham Park, NJ 07932 USA
来源
14TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS | 1998年
关键词
D O I
10.1109/ICDE.1998.655753
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Query languages for data with irregular structure use regular path expressions for navigation. This feature is useful for querying data where parts of the structure is either unknown, unavailable to the user, or changes frequently. Naive execution of regular path expressions is inefficient, however, because it ignores any structure in the data. We describe two optimization techniques for queries with regular path expressions. Both rely on graph schemas for specifying partial knowledge about the data's structure. Query pruning uses this structure to restrict navigation to only a fragment of the data; we give an efficient algorithm for rewriting any regular path expression query into a pruned one. Query rewriting using state extents can eliminate or reduce navigation altogether; it is reminiscent of optimizing relational queries using indices. There may be several ways to optimize a query using state extents; we give a polynomial-space algorithm that finds all such optimizations. For restricted forms of regular path expressions, the algorithm is provably efficient. We also give an efficient approximation algorithm that works on all regular path expressions.
引用
收藏
页码:14 / 23
页数:10
相关论文
empty
未找到相关数据