Join synopses for approximate query answering

被引:0
作者
Acharya, S [1 ]
Gibbons, PB [1 ]
Poosala, V [1 ]
Ramaswamy, S [1 ]
机构
[1] AT&T Bell Labs, Informat Sci Res Ctr, Murray Hill, NJ 07974 USA
来源
SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999: SIGMOD99: PROCEEDINGS OF THE 1999 ACM SIGMOD - INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA | 1999年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In large data warehousing environments, it is often advantageous to provide fast, approximate answers to complex aggregate queries based on statistical summaries of the full data. In this paper, we demonstrate the difficulty of providing good approximate answers for join-queries using only statistics (in particular, samples) from the base relations. We propose join synopses as an effective solution for this problem and show how precomputing just one join synopsis for each relation suffices to significantly improve the quality of approximate answers for arbitrary queries with foreign key joins. We present optimal strategies for allocating the available space among the various join synopses when the query work load is known and identify heuristics for the common case when the work load is not known. We also present efficient algorithms for incrementally maintaining join synopses in the presence of updates to the base relations. Our extensive set of experiments on the TPC-D benchmark database show the effectiveness of join synopses and various other techniques proposed in this paper.
引用
收藏
页码:275 / 286
页数:12
相关论文
共 18 条
  • [1] ACHARYA S, 1999, P ACM SIGMOD INT C M
  • [2] Acharya S., 1999, SIGMOD 99
  • [3] Alon N., 1996, Proceedings of the Twenty-Eighth Annual ACM Symposium on the Theory of Computing, P20, DOI 10.1145/237814.237823
  • [4] [Anonymous], P ACM SIGMOD INT C M
  • [5] [Anonymous], B TECHNICAL COMMITTE
  • [6] CHEN CM, 1994, P ACM SIGMOD C MAY, P161
  • [7] GANGULY S, 1996, P ACM SIGMOD INT C M, P271
  • [8] GIBBONS P, 1997, AQUA PROJECT WHITE P
  • [9] Gibbons PB, 1997, PROCEEDINGS OF THE TWENTY-THIRD INTERNATIONAL CONFERENCE ON VERY LARGE DATABASES, P466
  • [10] GIBBONS PB, 1999, SELECTING ESTIMATION