SCSA: Evaluating skyline queries in incomplete data

被引:0
作者
Yonis Gulzar
Ali A. Alwan
Radhwan Mohamed Abdullah
Qin Xin
Marwa B. Swidan
机构
[1] International Islamic University Malaysia,Department of Computer Science, Kulliyyah of Information and Communication Technology
[2] University of Mosul,Division of Basic Sciences, College of Agriculture and Forestry
[3] Universiti Putra Malaysia,Faculty of Computer Science and Information Technology
[4] University of Faroe Islands,Faculty of Science and Technology
来源
Applied Intelligence | 2019年 / 49卷
关键词
Skyline; Skyline queries; Incomplete data; Missing data; Preference queries; Query processing;
D O I
暂无
中图分类号
学科分类号
摘要
Skyline queries have been extensively incorporated in various contemporary database applications. The list includes but is not limited to multi-criteria decision-making systems, decision support systems, and recommendation systems. Due to its great benefits and wide application range, many skyline algorithms have already been proposed in numerous data settings. Nonetheless, most researchers presume the completion of data meaning that all data item values are available. Since this assumption cannot be sustained in a large number of real-world database applications, the existing algorithms are rather inadequate to be directly applied on a database with incomplete data. In such cases, processing skyline queries on incomplete data incur exhaustive pairwise comparisons between data items, which may lead to loss of the transitivity property of the skyline technique. Losing the transitivity property may in turn give rise to the problem of cyclic dominance. In order to address these issues, we propose a new skyline algorithm called Sorting-based Cluster Skyline Algorithm (SCSA) that combines the sorting and partitioning techniques and simplifies the skyline computation on an incomplete dataset. These two techniques help boost the skyline process and avoid many unnecessary pairwise comparisons between data items to prune the dominated data items. The comprehensive experiments carried out on both synthetic and real-life datasets demonstrate the effectiveness and versatility of our approach as compared to the currently used approaches.
引用
收藏
页码:1636 / 1657
页数:21
相关论文
共 77 条
[1]  
Gulzar Y(2016)A Framework for Evaluating Skyline Queries over Incomplete Data Procedia Computer Science 94 191-198
[2]  
Alwan AA(2018)Skyline queries over possibilistic RDF data Int J Approx Reason 93 277-289
[3]  
Salleh N(2016)Efficient Skyline Maintenance over Frequently Updated Evidential Databases Communications in Computer and Information Science 611 199-210
[4]  
Shaikhli IFA(2017)Processing skyline queries in incomplete database: Issues, challenges and future trends J Comput Sci 13 647-658
[5]  
Alvi SIM(2018)A Model for Skyline Query Processing in a Partially Complete Database Adv Sci Lett 24 1339-1343
[6]  
Abidi A(2007)Efficient continuous skyline computation Inf Sci 177 3411-3437
[7]  
Elmi S(2018)A Model for Processing Skyline Queries in Crowd-sourced Databases Indonesian Journal of Electrical Engineering and Computer Science 10 798-806
[8]  
Bach Tobji MA(1975)On Finding the Maxima of a Set of Vectors J ACM 22 469-476
[9]  
HadjAli A(1978)On the Average Number of Maxima in a Set of Vectors and Applications J ACM 25 536-543
[10]  
Ben Yaghlane B(1993)Fast linear expected-time algorithms for computing maxima and convex hulls Algorithmica 9 168-183