Top k probabilistic skyline queries on uncertain data

被引:4
作者
Yang, Zhibang [1 ]
Li, Kenli [2 ]
Zhou, Xu [2 ]
Mei, Jing [3 ]
Gao, Yunjun [4 ]
机构
[1] Changsha Univ, Coll Comp Engn & Appl Math, Changsha 410003, Hunan, Peoples R China
[2] Hunan Univ, Coll Informat Sci & Engn, Changsha 410082, Hunan, Peoples R China
[3] Hunan Normal Univ, Coll Math & Comp Sci, Changsha 410082, Hunan, Peoples R China
[4] Zhejiang Univ, Coll Comp Sci, Hangzhou 310027, Zhejiang, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Skyline query; Query processing; Uncertain data; EFFICIENT; RETRIEVAL; ALGORITHMS;
D O I
10.1016/j.neucom.2018.03.052
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Uncertainty of data is inherent in many applications, and query processing over uncertain data has gained widespread attention. The probabilistic skyline query is a powerful tool for managing uncertain data. However, the famous probabilistic skyline query, called p-skyline query, is likely to return unattractive objects which have no advantage in either their attributes or skyline probabilities with comparing to other query results. Moreover, it may return too many objects to offer any meaningful insight for customers. In this paper, we first propose a modified p-skyline (PS) query based on a strong dominance operator to identify truly attractive results. Then we formulate a top k MPS (TkMPS) query on the basis of a new ranking criterion. We present effective approaches for processing the MPS query, and extend these approaches to process the TkMPS query. To improve the query performance, the reuse technique is adopted. Extensive experiments verify that the proposed algorithms for the MPS and TkMPS queries are efficient and effective, our MPS query can filter out 34.44% unattractive objects from the p-skyline query results at most, and although in some cases the results of the MPS and the p-skyline queries are just the same, our MPS query needs much less CPU, I/O, and memory costs. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 36 条
[1]   Asymptotically Efficient Algorithms for Skyline Probabilities of Uncertain Data [J].
Atallah, Mikhail J. ;
Qi, Yinian ;
Yuan, Hao .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 2011, 36 (02)
[2]   Computing All Skyline Probabilities for Uncertain Data [J].
Atallah, Mikhail J. ;
Qi, Yinian .
PODS'09: PROCEEDINGS OF THE TWENTY-EIGHTH ACM SIGMOD-SIGACT-SIGART SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2009, :279-287
[3]   Domination in the Probabilistic World: Computing Skylines for Arbitrary Correlations and Ranking Semantics [J].
Bartolini, Ilaria ;
Ciaccia, Paolo ;
Patella, Marco .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 2014, 39 (02)
[4]   The Skyline of a Probabilistic Relation [J].
Bartolini, Ilaria ;
Ciaccia, Paolo ;
Patella, Marco .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (07) :1656-1669
[5]   The Skyline operator [J].
Börzsönyi, S ;
Kossmann, D ;
Stocker, K .
17TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2001, :421-430
[6]   ON THE AVERAGE NUMBER OF MAXIMA IN A SET OF VECTORS [J].
BUCHTA, C .
INFORMATION PROCESSING LETTERS, 1989, 33 (02) :63-65
[7]   Efficient and Progressive Algorithms for Distributed Skyline Queries over Uncertain Data [J].
Ding, Xiaofeng ;
Jin, Hai .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (08) :1448-1462
[8]   A probabilistic relational algebra for the integration of information retrieval and database systems [J].
Fuhr, N ;
Rolleke, T .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 1997, 15 (01) :32-66
[9]   Efficient algorithms for finding the most desirable skyline objects [J].
Gao, Yunjun ;
Liu, Qing ;
Chen, Lu ;
Chen, Gang ;
Li, Qing .
KNOWLEDGE-BASED SYSTEMS, 2015, 89 :250-264
[10]   On efficient reverse skyline query processing [J].
Gao, Yunjun ;
Liu, Qing ;
Zheng, Baihua ;
Chen, Gang .
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (07) :3237-3249