Configuring bitmap materialized views for optimizing XML queries

被引:6
作者
Wu, Xiaoying [1 ]
Theodoratos, Dimitri [2 ]
Kementsietsidis, Anastasios [3 ]
机构
[1] Wuhan Univ, State Key Lab Software Engn, Wuhan 430072, Peoples R China
[2] New Jersey Inst Technol, Newark, NJ 07102 USA
[3] IBM Res, Yorktown Hts, NY USA
来源
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS | 2015年 / 18卷 / 03期
基金
中国国家自然科学基金;
关键词
XPath query evaluation; XML; Materialized views; View configuration; SELECTION;
D O I
10.1007/s11280-013-0272-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years the inverted lists evaluation model along with holistic stack-based algorithms have been established as the most prominent techniques for evaluating XML queries on large persistent XML data. In this framework, we are using materialized views for optimizing XML queries. We consider a novel approach which instead of materializing the answer of a view materializes exactly the inverted sublists that are necessary for computing the answer of the view. This originality allows storing view materializations as compressed bitmaps, a solution that minimizes the materialization space and empowers performing optimization operations as CPU-efficient bitwise operations. To realize the potential of bitmap materialized views in optimizing query performance, we define and address the following problem (view configuration problem): given an XML tree and its schema find a template of tree-pattern views (view configuration) such that: (a) the views of this configuration can answer all the queries that can be issued against the schema, (b) their materialization fits in the space provided, and (c) evaluating the queries using these views minimizes the overall query evaluation cost. We consider an instance of this problem for tree pattern queries. Our intension is to find view configurations whose materializations are small enough to be stored in main memory. We find two candidate solution configurations and we identify cases where views can be excluded from materialization in a configuration without affecting query performance. In order to compare our approach with an approach which also can support the optimization of every query on the schema, we implemented an improvement of a state-of-the-art approach which is based on structural indexes. Our experimental results show that our approach is stable, greatly improves evaluating queries without materialized views, outperforms the structural index approach on all test cases and is very close to the optimal. These results characterize our approach as the best candidate for supporting the optimization of queries in the framework of the inverted lists model.
引用
收藏
页码:607 / 632
页数:26
相关论文
共 39 条
[1]  
Agrawal Sanjay, 2000, VLDB
[2]  
Arion A., 2007, VLDB, P87
[3]  
Balmin A., 2004, VLDB, P60
[4]  
Barta A., 2005, VLDB, P133
[5]  
Bello R. G., 1998, Proceedings of the Twenty-Fourth International Conference on Very-Large Databases, P659
[6]  
Bruno Nicolas., 2002, P 2002 ACM SIGMOD IN, P310, DOI DOI 10.1145/564691.564727
[7]  
Chaudhuri S., 1996, Advances in Database Technology - EDBT '96. 5th International Conference on Extending Database Technology. Proceedings, P167, DOI 10.1007/BFb0014151
[8]  
CHAUDHURI S, 1995, PROC INT CONF DATA, P190, DOI 10.1109/ICDE.1995.380392
[9]   ViewJoin: Efficient View-based Evaluation of Tree Pattern Queries [J].
Chen, Ding ;
Chan, Chee-Yong .
26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING ICDE 2010, 2010, :816-827
[10]   A formal perspective on the view selection problem [J].
Chirkova, R ;
Halevy, AY ;
Suciu, D .
VLDB JOURNAL, 2002, 11 (03) :216-237