A dynamic data structure for top-k queries on uncertain data

被引:3
作者
Chen, Jiang [2 ]
Yi, Ke [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Kowloon, Hong Kong, Peoples R China
[2] Yahoo Inc, Sunnyvale, CA 94089 USA
关键词
Uncertain data; Dynamic data structures;
D O I
10.1016/j.tcs.2008.06.016
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In an uncertain data set s = (S, p,f) where S is the ground set consisting of n elements, p : S -> [0, 1] a probability function, and f : S -> R a score function, each element i epsilon S with score f(i) appears independently with probability p(i). The top-k query on s asks for the set of k elements that has the maximum probability of appearing to be the k elements with the highest scores in a random instance of s. Computing the top-k answer on a fixed S is known to be easy. In this paper, we consider the dynamic problem, that is, how to maintain the top-k query answer when 8 changes, including element insertions and deletions in the ground set S, changes in the probability function p and in the score function f. We present a fully dynamic data structure that handles an update in O(k log n) time, and answers a top-j query in O(log n + j) time for any j <= k. The structure has O(n) size and can be constructed in O(n log k) time. As a building block of our dynamic structure, we present an algorithm for the all-top-k problem, that is, computing the top-j answers for all j = 1, ..., k, which may be of independent interest. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:310 / 317
页数:8
相关论文
共 15 条
[1]  
AGGARWAL A, 1987, GEOMETRIC APPL MATRI
[2]  
Agrawal P., 2006, P VLDB 2006 DEM DESC
[3]  
ANANTHAKRISHNA R, 2002, P INT C VER LARG DAT
[4]  
[Anonymous], P INT C DAT ENG
[5]  
Benjelloun O, 2006, P VER LARG DAT BAS
[6]  
CHAUCLHURI S, 2003, P ACM SIGMOD INT C M
[7]  
CHEN J, 2007, P INT S ALG COMP
[8]  
Dalvi Nilesh N., 2004, P INT C VER LARG DAT
[9]  
DESHPANDE A, 2004, P INT C VER LARG DAT
[10]   MAKING DATA-STRUCTURES PERSISTENT [J].
DRISCOLL, JR ;
SARNAK, N ;
SLEATOR, DD ;
TARJAN, RE .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1989, 38 (01) :86-124