On the memory requirements of XPath evaluation over XML streams

被引:16
作者
Bar-Yossef, Ziv [1 ]
Fontoura, Marcus
Josifovski, Vanja
机构
[1] Technion Israel Inst Technol, Dept Elect Engn, IL-32000 Haifa, Israel
[2] IBM Corp, Almaden Res Ctr, San Jose, CA 95120 USA
[3] Yahoo Res Labs, Sunnyvale, CA 94089 USA
关键词
XPath; XML; data stream processing; space lower bounds; communication complexity;
D O I
10.1016/j.jcss.2006.10.002
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The important challenge of evaluating XPath queries over XML streams has sparked much interest in the past few years. A number of algorithms have been proposed, supporting wider fragments of the query language, and exhibiting better performance and memory utilization. Nevertheless, all the algorithms known to date use a prohibitively large amount of memory for certain types of queries. A natural question then is whether this memory bottleneck is inherent or just an artifact of the proposed algorithms. In this paper we initiate the first systematic and theoretical study of lower bounds on the amount of memory required to evaluate XPath queries over XML streams. We present a general lower bound technique, which given a query, specifies the minimum amount of memory that any algorithm evaluating the query on a stream would need to incur. The lower bounds are stated in terms of new graph-theoretic properties of queries. The proofs are based on tools from communication complexity. We then exploit insights learned from the lower bounds to obtain a new algorithm for XPath evaluation on streams. The algorithm uses space close to the optimum. Our algorithm deviates from the standard paradigm of using automata or transducers, thereby avoiding the need to store large transition tables. (c) 2006 Elsevier Inc. All rights reserved.
引用
收藏
页码:391 / 441
页数:51
相关论文
共 29 条
[1]  
ALTINEL M., 2000, VLDB 2000 P 26 INT C, P53
[2]  
[Anonymous], 1997, COMMUNICATION COMPLE
[3]  
[Anonymous], P 21 ACM SIGMOD SIGA
[4]  
AVILACAMPILLO I, 2002, P 1 WORKSH PROGR LAN
[5]  
Bar-Yossef Z., 2004, P 23 ACM S PRINC DAT, P177
[6]  
Bar-Yossef Ziv, 2005, P 24 ACM S PRINC DAT, P216
[7]  
BERGLUND A, 2004, XML PATH LANGUAGE XP
[8]  
BOAG S, 2003, XQUERY 1 0 XML QUERY
[9]  
Bray Tim, 1998, Extensible markup language
[10]  
CHAMBERLIN D, 2005, XML QUERY USE CASES