Indexing Uncertain Data

被引:37
作者
Agarwal, Pankaj K. [1 ]
Cheng, Siu-Wing [1 ]
Tao, Yufei [1 ]
Yi, Ke [1 ]
机构
[1] Duke Univ, Durham, NC 27710 USA
来源
PODS'09: PROCEEDINGS OF THE TWENTY-EIGHTH ACM SIGMOD-SIGACT-SIGART SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS | 2009年
关键词
Indexing; uncertain data; range query;
D O I
10.1145/1559795.1559816
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Querying uncertain data has emerged as an important problem in data management due to the imprecise nature of many measurement data. In this paper we study answering range queries over uncertain data. Specifically, we are given a collection P of n points in R, each represented by its one-dimensional probability density function (pdf). The goal is to build an index on P such that given a query interval I and a probability threshold tau, we can quickly report all points of P that lie in I with probability at least tau. We present various indexing schemes with linear or near-linear space and logarithmic query time. Our schemes support; pdf's that are either histograms or more complex ones such as Gaussian or piecewise algebraic. They also extend to the external memory model in which the goal is to minimize the number of disk accesses when querying the index.
引用
收藏
页码:137 / 146
页数:10
相关论文
共 25 条
[1]  
Afshani P, 2009, PROCEEDINGS OF THE TWENTIETH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P180
[2]  
Agarwal P.K., 1999, ADV DISCRETE COMPUTA, V223, P1, DOI [10.1090/conm/223/03131, DOI 10.1090/conm/223/03131]
[3]   DYNAMIC HALF-SPACE RANGE REPORTING AND ITS APPLICATIONS [J].
AGARWAL, PK ;
MATOUSEK, J .
ALGORITHMICA, 1995, 13 (04) :325-345
[4]  
Agarwal PK, 2000, HANDBOOK OF COMPUTATIONAL GEOMETRY, P1, DOI 10.1016/B978-044482537-7/50002-4
[5]   Efficient searching with linear constraints [J].
Agarwal, PK ;
Arge, L ;
Erickson, J ;
Franciosa, PG ;
Vitter, JS .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2000, 61 (02) :194-216
[6]   THE INPUT OUTPUT COMPLEXITY OF SORTING AND RELATED PROBLEMS [J].
AGGARWAL, A ;
VITTER, JS .
COMMUNICATIONS OF THE ACM, 1988, 31 (09) :1116-1127
[7]  
Agrawal P., 2006, VLDB, P1151
[8]  
[Anonymous], 2005, P 31 INT C VERY LARG
[9]  
[Anonymous], 2004, Proceedings of the Thirtieth international conference on Very large data bases-Volume
[10]  
Aronov B., 2007, SCG '07, P327