Selecting materialized views using random algorithm

被引:1
作者
Zhou, Lijuan [1 ,3 ]
Hao, Zhongxiao [1 ,2 ,3 ,4 ]
Liu, Chi [2 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
[2] Capital Normal univ, Beijing 100037, Peoples R China
[3] Harbin Univ Sci & Technol, Harbin 150080, Peoples R China
[4] Qiqihar Univ, Qiqihar, Heilongjiang, Peoples R China
来源
DATA MINING, INTRUSION DETECTION, INFORMATION ASSURANCE, AND DATA NETWORKS SECURITY 2007 | 2007年 / 6570卷
关键词
data warehouse; materialized view; OLAP;
D O I
10.1117/12.716612
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The data warehouse is a repository of information collected from multiple, possibly heterogeneous, autonomous, distributed databases. The information stored at the data warehouse is in form of views, referred to as materialized views. The selection of the materialized views is one of the most important decisions in designing a data warehouse. Materialized views are stored in the data warehouse for the purpose of efficiently implementing on-line analytical processing queries. The first issue for the user to consider is query response time. So, in this paper, we develop algorithms to select a set of views to materialize in data warehouse in order to minimize the total view maintenance cost under the constraint of a given query response time. We call it query_cost view_selection problem. First, cost graph and cost model of query_cost view_selection problem are presented. Second, the methods for selecting materialized views by using random algorithms are presented. The genetic algorithm is applied to the materialized views selection problem. But with the development of genetic process, the legal solution produced become more and more difficult, so a lot of solutions are eliminated and producing time of the solutions is lengthened in genetic algorithm. Therefore, improved algorithm has been presented in this paper, which is the combination of simulated annealing algorithm and genetic algorithm for the purpose of solving the query cost view selection problem. Finally, in order to test the function and efficiency of our algorithms, experiment simulation is adopted. The experiments show that the given methods can provide near-optimal solutions in limited time and woks better in practical cases. Randomized algorithms will become invaluable tools for data warehouse evolution.
引用
收藏
页数:8
相关论文
共 10 条
  • [1] AGRAWAL S, 2000, P 26 INT C VER LARG, P496
  • [2] A formal perspective on the view selection problem
    Chirkova, R
    Halevy, AY
    Suciu, D
    [J]. VLDB JOURNAL, 2002, 11 (03) : 216 - 237
  • [3] CHIRKOVA R, 2003, PODS, P38
  • [4] CZEJDO B, 2001, P SO C COMP OCT
  • [5] GRUMBACH S, 2000, P 19 ACM SIGACT SIGM, P45
  • [6] GUPTA A, 2001, ADAPTING MAT VIEWS R
  • [7] THEODIRATOS D, 2000, P ACM 3 INT WORKSH D
  • [8] VALLURI SR, 2002, 13 ADC 2002 MELB AUS, V5
  • [9] ZHANG C, 2001, IEEE T SYSTEMS MAN C
  • [10] ZHOU L, 2003, IS T SPIE 15 ANN S S, V5021