Approximating multi-dimensional aggregate range queries over real attributes

被引:1
|
作者
Gunopulos, D [1 ]
Kollios, G [1 ]
Domeniconi, C [1 ]
Tsotras, VJ [1 ]
机构
[1] Univ Calif Riverside, Riverside, CA 92521 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Finding approximate answers to multi-dimensional range queries over real valued attributes has significant applications in data exploration and database query optimization. In this paper we consider the following problem: given a table of d attributes whose domain is the real numbers, and a query that specifies a range in each dimension, find a good approximation of the number of records in the table that satisfy the query. We present a new histogram technique that is designed to approximate the density of multi-dimensional datasets with real attributes. Our technique finds buckets of variable size, and allows the buckets to overlap. Overlapping buckets allow more efficient approximation of the density. The size of the cells is based on the local density of the data. This technique leads to a faster and more compact approximation of the data distribution. We also show how to generalize kernel density estimators, and how to apply them on the multi-dimensional query approximation problem. Finally, we compare the accuracy of the proposed techniques with existing techniques using real and synthetic datasets.
引用
收藏
页码:463 / 474
页数:12
相关论文
共 50 条
  • [1] Aggregate aware caching for multi-dimensional queries
    Deshpande, PM
    Naughton, JF
    ADVANCES IN DATABASE TECHNOLOGY-DEBT 2000, PROCEEDINGS, 2000, 1777 : 167 - 182
  • [2] ADenTS: An Adaptive Density-based Tree Structure for approximating aggregate queries over real attributes
    Wu, TY
    Xu, J
    Wang, C
    Wang, W
    Shi, BL
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2005, 3518 : 529 - 538
  • [3] A structured overlay for multi-dimensional range queries
    Schuett, Thorsten
    Schintke, Florian
    Reinefeld, Alexander
    EURO-PAR 2007 PARALLEL PROCESSING, PROCEEDINGS, 2007, 4641 : 503 - +
  • [4] Box queries over multi-dimensional streams
    Friedman, Roy
    Shahout, Rana
    INFORMATION SYSTEMS, 2022, 109
  • [5] Selectivity estimators for multidimensional range queries over real attributes
    Dimitrios Gunopulos
    George Kollios
    Vassilis J. Tsotras
    Carlotta Domeniconi
    The VLDB Journal, 2005, 14 : 137 - 154
  • [6] Selectivity estimators for multidimensional range queries over real attributes
    Gunopulos, D
    Kollios, G
    Tsotras, VJ
    Domeniconi, C
    VLDB JOURNAL, 2005, 14 (02): : 137 - 154
  • [7] Data storage in sensor networks for multi-dimensional range queries
    Lee, JY
    Lim, YH
    Chung, YD
    Kim, MH
    EMBEDDED SOFTWARE AND SYSTEMS, PROCEEDINGS, 2005, 3820 : 420 - 429
  • [8] Fast Multi-dimensional Range Queries on Encrypted Cloud Databases
    Chi, Jialin
    Hong, Cheng
    Zhang, Min
    Zhang, Zhenfeng
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2017), PT I, 2017, 10177 : 559 - 575
  • [9] Predicate Encryption for Multi-dimensional Range Queries from Lattices
    Gay, Romain
    Meaux, Pierrick
    Wee, Hoeteck
    PUBLIC-KEY CRYPTOGRAPHY - PKC 2015, 2015, 9020 : 752 - 776
  • [10] Precisely answering multi-dimensional range queries without privacy breaches
    Wang, LY
    Li, YJ
    Wijesekera, D
    Jajodia, S
    COMPUTER SECURITY - ESORICS 2003, PROCEEDINGS, 2003, 2808 : 100 - 115