Asymptotically Efficient Algorithms for Skyline Probabilities of Uncertain Data

被引:20
作者
Atallah, Mikhail J. [1 ]
Qi, Yinian [1 ]
Yuan, Hao [2 ]
机构
[1] Purdue Univ, Dept Comp Sci, W Lafayette, IN 47907 USA
[2] City Univ Hong Kong, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China
来源
ACM TRANSACTIONS ON DATABASE SYSTEMS | 2011年 / 36卷 / 02期
基金
美国国家科学基金会;
关键词
Algorithms; Uncertain data; probabilistic skyline;
D O I
10.1145/1966385.1966390
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Skyline computation is widely used in multicriteria decision making. As research in uncertain databases draws increasing attention, skyline queries with uncertain data have also been studied. Some earlier work focused on probabilistic skylines with a given threshold; Atallah and Qi [2009] studied the problem to compute skyline probabilities for all instances of uncertain objects without the use of thresholds, and proposed an algorithm with subquadratic time complexity. In this work, we propose a new algorithm for computing all skyline probabilities that is asymptotically faster: worst-case O(n root nlog n) time and O(n) space for 2D data; O(n(2-1/d) log(d-1) n) time and O(nlog(d-2) n) space for d-dimensional data. Furthermore, we study the online version of the problem: Given any query point p (unknown until the query time), return the probability that no instance in the given data set dominates p. We propose an algorithm for answering such an online query for d-dimensional data in O(n(1-1/d) log(d-1) n) time after preprocessing the data in O(n(2-1/d) log(d-1)) time and space.
引用
收藏
页数:28
相关论文
共 38 条
  • [1] GENERALIZED STRING MATCHING
    ABRAHAMSON, K
    [J]. SIAM JOURNAL ON COMPUTING, 1987, 16 (06) : 1039 - 1051
  • [2] AFSHANI P, 2011, P 14 INT C DAT THEOR
  • [3] [Anonymous], 2005, P 31 INT C VERY LARG
  • [4] [Anonymous], 2004, Proceedings of the Thirtieth international conference on Very large data bases-Volume
  • [5] [Anonymous], 2003, Proc. of the ACM SIGMOD International Conference on Management of Data, DOI DOI 10.1145/872757
  • [6] Fast and simple relational processing of uncertain data
    Antova, Lyublena
    Jansen, Thomas
    Koch, Christoph
    Olteanu, Dan
    [J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 983 - 992
  • [7] Computing All Skyline Probabilities for Uncertain Data
    Atallah, Mikhail J.
    Qi, Yinian
    [J]. PODS'09: PROCEEDINGS OF THE TWENTY-EIGHTH ACM SIGMOD-SIGACT-SIGART SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2009, : 279 - 287
  • [8] Benjelloun Omar., 2006, VLDB
  • [9] MULTIDIMENSIONAL DIVIDE-AND-CONQUER
    BENTLEY, JL
    [J]. COMMUNICATIONS OF THE ACM, 1980, 23 (04) : 214 - 229
  • [10] Efficient Search for the Top-k Probable Nearest Neighbors in Uncertain Databases
    Beskales, George
    Soliman, Mohamed A.
    Ilyas, Ihab F.
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (01): : 326 - 339