Probabilistic Aggregate Skyline Join Queries: Skylines with Aggregate Operations over Existentially Uncertain Relations

被引:0
作者
Bhattacharya, Arnab [1 ]
Awate, Shrikant [1 ]
机构
[1] Indian Inst Technol, Dept Comp Sci & Engn, Kanpur, Uttar Pradesh, India
来源
PROCEEDINGS OF THE 27TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT | 2015年
关键词
Skyline Query; Uncertain Database; Existential Uncertainty; Multiple Preferences; Join; Aggregation;
D O I
10.1145/2791347.2791350
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The multi-criteria decision making, made possible by the advent of skyline queries, has been successfully applied in many areas. Though most of the earlier work is concerned with only a single relation, several real world applications require finding the skyline set over multiple relations. Consequently, the join operation over skylines where the preferences are local to each relation and/or on aggregated values of attributes from different relations, has been proposed. In the meanwhile, uncertain datasets are witnessing increasing applications in many scientific and real-life situations. The problem of skyline computation for such datasets becomes even more challenging as every object can be classified as a skyline with some probability. In this paper, we introduce probabilistic aggregate skyline join queries (PASJQ) that ask for objects whose probability of being a skyline from a join of two uncertain relations is over a query probability threshold. The skyline preferences are on both local and aggregate attributes. Since the naive algorithm can be impractical, we propose three algorithms to efficiently process such queries. The algorithms process the skylines as much as possible locally before computing the join to reduce the computation burden of finding skylines from the larger joined relation. Experiments with real and synthetic data exhibit the practicality and scalability of these algorithms with respect to query probability threshold, cardinality, dimensionality and other parameters of the uncertain relations.
引用
收藏
页数:12
相关论文
共 38 条
  • [1] Afshani Peyman., 2011, ICDT, P186, DOI [10.1145/1938551.1938576, DOI 10.1145/1938551.1938576]
  • [2] Aggarwal CC, 2009, ADV DATABASE SYST, V35, P1, DOI 10.1007/978-0-387-09690-2
  • [3] [Anonymous], P SIGMOD
  • [4] [Anonymous], 2005, P 31 INT C VERY LARG
  • [5] [Anonymous], 2004, Proceedings of the Thirtieth international conference on Very large data bases-Volume
  • [6] Asymptotically Efficient Algorithms for Skyline Probabilities of Uncertain Data
    Atallah, Mikhail J.
    Qi, Yinian
    Yuan, Hao
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2011, 36 (02):
  • [7] Benjelloun Omar., 2006, VLDB
  • [8] Efficient Search for the Top-k Probable Nearest Neighbors in Uncertain Databases
    Beskales, George
    Soliman, Mohamed A.
    Ilyas, Ihab F.
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (01): : 326 - 339
  • [9] Bohm C., 2009, ACM QUEST
  • [10] The Skyline operator
    Börzsönyi, S
    Kossmann, D
    Stocker, K
    [J]. 17TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2001, : 421 - 430