SUM-optimal histograms for approximate query processing

被引:1
作者
Zhang, Meifan [1 ]
Wang, Hongzhi [1 ]
Li, Jianzhong [1 ]
Gao, Hong [1 ]
机构
[1] Harbin Inst Technol, Dept Comp Sci & Technol, Harbin, Peoples R China
关键词
Approximate query processing; Histogram; Big data; ALGORITHMS;
D O I
10.1007/s10115-020-01450-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we study the problem of the SUM query approximation with histograms. We define a new kind of histogram called the SUM-optimal histogram which can provide better estimation result for the SUM queries than the traditional equi-depth and V-optimal histograms. We propose three methods for the histogram construction. The first one is a dynamic programming method, and the other two are approximate methods. We use a greedy strategy to insert separators into a histogram and use the stochastic gradient descent method to improve the accuracy of separators. The experimental results indicate that our method can provide better estimations for the SUM queries than the equi-depth and V-optimal histograms.
引用
收藏
页码:3155 / 3180
页数:26
相关论文
共 42 条
  • [1] Fast and Near-Optimal Algorithms for Approximating Distributions by Histograms
    Acharya, Jayadev
    Diakonikolas, Ilias
    Hegde, Chinmay
    Li, Jerry
    Schmidt, Ludwig
    [J]. PODS'15: PROCEEDINGS OF THE 33RD ACM SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2015, : 249 - 263
  • [2] Acharya S, 1999, SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999, P574, DOI 10.1145/304181.304581
  • [3] Acharya S, 2000, SIGMOD REC, V29, P487
  • [4] Agarwal S., 2013, P 8 ACM EUR C COMP S, P29
  • [5] Agrawal R, 1995, COMAD
  • [6] [Anonymous], 2008, Proceedings of the 2008 ACM SIGMOD international conference on Management of data
  • [7] [Anonymous], 1996, ACM SIGMOD RECORD
  • [8] [Anonymous], 2012, P 31 ACM SIGMOD SIGA, DOI DOI 10.1145/2213556.2213561
  • [9] A quad-tree based multiresolution approach for two-dimensional summary data
    Buccafurri, F.
    Furfaro, F.
    Mazzeo, G. M.
    Sacca, D.
    [J]. INFORMATION SYSTEMS, 2011, 36 (07) : 1082 - 1103
  • [10] Enhancing histograms by tree-like bucket indices
    Buccafurri, Francesco
    Lax, Gianluca
    Sacca, Domenico
    Pontieri, Luigi
    Rosaci, Domenico
    [J]. VLDB JOURNAL, 2008, 17 (05) : 1041 - 1061