SkyFlow: Heterogeneous streaming for skyline computation using FlowGraph and SYCL

被引:3
作者
Carlos Romero, Jose [1 ]
Navarro, Angeles [1 ]
Rodriguez, Andres [1 ]
Asenjo, Rafael [1 ]
机构
[1] Univ Malaga, Dept Comp Archirecture, Malaga, Spain
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2023年 / 141卷
关键词
Skyline; Stream of queries; Heterogeneous computing; Integrated GPU; SYCL; OneAPI; EFFICIENT; MULTICORE; QUERIES;
D O I
10.1016/j.future.2022.11.021
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The skyline is an optimization operator widely used for multi-criteria decision making. It allows minimizing an n-dimensional dataset into its smallest subset. In this work we present SkyFlow, the first heterogeneous CPU+GPU graph-based engine for skyline computation on a stream of data queries. Two data flow approaches, Coarse-grained and Fine-grained, have been proposed for different streaming scenarios. Coarse-grained aims to keep in parallel the computation of two queries using a hybrid solution with two state-of-the-art skyline algorithms: one optimized for CPU and another for GPU. We also propose a model to estimate at runtime the computation time of any arriving data query. This estimation is used by a heuristic to schedule the data query on the device queue in which it will finish earlier. On the other hand, Fine-grained splits one query computation between CPU and GPU. An experimental evaluation using as target architecture a heterogeneous system comprised of a multicore CPU and an integrated GPU for different streaming scenarios and datasets, reveals that our heterogeneous CPU+GPU approaches always outperform previous only-CPU and only-GPU state-of-the-art implementations up to 6.86xand 5.19x, respectively, and they fall below 6% of ideal peak performance at most. We also evaluate Coarse-grained vs Fine-Grained finding that each approach is better suited to different streaming scenarios.(c) 2022 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页码:269 / 283
页数:15
相关论文
共 41 条
[1]   The negative skycube [J].
Alami, Karim ;
Hanusse, Nicolas ;
Kamnang-Wanko, Patrick ;
Maabout, Sofian .
INFORMATION SYSTEMS, 2020, 88
[2]  
[Anonymous], 2021, DATASET NBA
[3]  
[Anonymous], 2021, DATASET HOUSE
[4]  
[Anonymous], 2021, DATASET WEATHER
[5]   Efficient Sort-Based Skyline Evaluation [J].
Bartolini, Ilaria ;
Ciaccia, Paolo ;
Patella, Marco .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 2008, 33 (04)
[6]  
Begh K.S., 2013, P 9 INT WORKSHOP DAT, P1
[7]  
Blackard Jock A, 1998, UCI Machine Learning Repository
[8]   Template Skycube Algorithms for Heterogeneous Parallelism on Multicore and GPU Architectures [J].
Bogh, Kenneth S. ;
Chester, Sean ;
Sidlauskas, Darius ;
Assent, Ira .
SIGMOD'17: PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2017, :447-462
[9]   SkyAlign: a portable, work-efficient skyline algorithm for multicore and GPU architectures [J].
Bogh, Kenneth S. ;
Chester, Sean ;
Assent, Ira .
VLDB JOURNAL, 2016, 25 (06) :817-841
[10]   The Skyline operator [J].
Börzsönyi, S ;
Kossmann, D ;
Stocker, K .
17TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2001, :421-430