Exploring correlation for fast skyline computation

被引:3
|
作者
Yu, Boseon [1 ]
Choi, Wonik [2 ]
Liu, Ling [3 ]
机构
[1] Korea Inst Sci & Technol, Hwarang Ro 14gil 5, Seoul, South Korea
[2] Inha Univ, Sch Informat & Commun Engn, 100 Inharo, Incheon, South Korea
[3] Georgia Inst Technol, Coll Comp, 266 Ferst Dr, Atlanta, GA 30332 USA
关键词
Skyline; Information extraction; Data analysis; Parallel computing; MULTICORE; QUERIES;
D O I
10.1007/s11227-017-2064-0
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Scaling skyline queries over high-dimensional datasets remains to be challenging due to the fact that most existing algorithms assume dimensional independence when establishing the worst-case complexity by discarding correlation distribution. In this paper, we present HashSkyline, a systematic and correlation-aware approach for scaling skyline queries over high-dimensional datasets with three novel features: First, it offers a fast hash-based method to prune non-skyline points by utilizing data correlation characteristics and speed up the overall skyline evaluation for correlated datasets. Second, we develop , which can dramatically reduce the response time for anti-correlated and independent datasets by capitalizing on the parallel processing power of GPUs. Third, the HashSkyline approach uses the pivot cell-based mechanism combined with the correlation threshold to determine the correlation distribution characteristics for a given dataset, enabling adaptive configuration of HashSkyline for skyline query evaluation by auto-switching of and . We evaluate the validity of HashSkyline using both synthetic datasets and real datasets. Our experiments show that HashSkyline consumes significantly less pre-processing cost and achieves significantly higher overall query performance, compared to existing state-of-the-art algorithms.
引用
收藏
页码:5071 / 5102
页数:32
相关论文
共 50 条
  • [31] Efficient Region-Based Skyline Computation for a Group of Users
    Dehaki, Ghoncheh Babanejad
    Ibrahim, Hamidah
    Alwan, Ali A.
    Sidi, Fatimah
    Udzir, Nur Izura
    Lawal, Ma'aruf Mohammed
    IEEE ACCESS, 2022, 10 : 94496 - 94517
  • [32] A Rule-based Skyline Computation over a Dynamic Database
    Dehaki, Ghazaleh Babanejad
    Ibrahim, Hamidah
    Sidi, Fatimah
    Udzir, Nur Izura
    Alwan, Ali A.
    22ND INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES (IIWAS2020), 2020, : 97 - 103
  • [33] SCP: Skyline Computation Planner for Distributed, Update Intensive Environment
    Kulkarni, R. D.
    Momin, B. F.
    INFORMATION AND COMMUNICATION TECHNOLOGY FOR INTELLIGENT SYSTEMS (ICTIS 2017) - VOL 1, 2018, 83 : 399 - 408
  • [34] Efficient Skyline Computation For optimal Service Composition with Fuzzy preference relationships
    Rhimi, Fatma
    Ben Yahia, Saloua
    Ben Ahmed, Samir
    2015 INTERNATIONAL SYMPOSIUM ON NETWORKS, COMPUTERS AND COMMUNICATIONS (ISNCC 2015), 2015,
  • [35] A Continuous Region-Based Skyline Computation for a Group of Mobile Users
    Dehaki, Ghoncheh Babanejad
    Ibrahim, Hamidah
    Alwan, Ali A.
    Sidi, Fatimah
    Udzir, Nur Izura
    Lawal, Ma'aruf Mohammed
    SYMMETRY-BASEL, 2022, 14 (10):
  • [36] Skyline Computation for Supporting Location-Based Services in a Road Network
    Xiao, Yingyuan
    Zhang, Hua
    Wang, Jingsong
    Wang, Hongya
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2012, 15 (05): : 1937 - 1948
  • [37] A two-phase data space partitioning for efficient skyline computation
    Nasridinov, Aziz
    Choi, Jong-Hyeok
    Park, Young-Ho
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2017, 20 (04): : 3617 - 3628
  • [38] An Efficient Architecture for Parallel Skyline Computation over Large Distributed Datasets
    Li, He
    Jang, Sumin
    Yoo, Jaesoo
    JOURNAL OF INTERNET TECHNOLOGY, 2014, 15 (04): : 577 - 588
  • [39] A two-phase data space partitioning for efficient skyline computation
    Aziz Nasridinov
    Jong-Hyeok Choi
    Young-Ho Park
    Cluster Computing, 2017, 20 : 3617 - 3628
  • [40] Efficient Skyline Computation Over an Incomplete Database With Changing States and Structures
    Dehaki, Ghazaleh Babanejad
    Ibrahim, Hamidah
    Alwan, Ali A.
    Sidi, Fatimah
    Udzir, Nur Izura
    IEEE ACCESS, 2021, 9 : 88699 - 88723