Probabilistic n-of-N skyline computation over uncertain data streams

被引:0
作者
Wenjie Zhang
Aiping Li
Muhammad Aamir Cheema
Ying Zhang
Lijun Chang
机构
[1] University of New South Wales,
[2] National University of Defense Technology,undefined
来源
World Wide Web | 2015年 / 18卷
关键词
Skyline; Stream; Query processing; Uncertain;
D O I
暂无
中图分类号
学科分类号
摘要
Skyline operator is a useful tool in multi-criteria decision making in various applications. Uncertainty is inherent in real applications due to various reasons. In this paper, we consider the problem of efficiently computing probabilistic skylines against the most recent N uncertain elements in a data stream seen so far. Specifically, we study the problem in the n-of-N model; that is, computing the probabilistic skyline for the most recent n (∀ n ≤ N) elements, where an element is a probabilistic skyline element if its skyline probability is not below a given probability threshold q. Firstly, an effective pruning technique to minimize the number of uncertain elements to be kept is developed. It can be shown that on average storing only O(logdN) uncertain elements from the most recent N elements is sufficient to support the precise computation of all probabilistic n-of-N skyline queries in a d-dimension space if the data distribution on each dimension is independent. A novel encoding scheme is then proposed together with efficient update techniques so that computing a probabilistic n-of-N skyline query in a d-dimension space is reduced to O(dloglogN + s) if the data distribution is independent, where s is the number of skyline points. A trigger based technique is provided to process continuous n-of-N skyline queries. Extensive experiments demonstrate that the new techniques on uncertain data streams can support on-line probabilistic skyline query computation over rapid data streams.
引用
收藏
页码:1331 / 1350
页数:19
相关论文
empty
未找到相关数据