GASSER: An Auto-Tunable System for General Sliding-Window Streaming Operators on GPUs

被引:17
作者
De Matteis, Tiziano [1 ]
Mencagli, Gabriele [1 ]
De Sensi, Daniele [1 ]
Torquati, Massimo [1 ]
Danelutto, Marco [1 ]
机构
[1] Univ Pisa, Dept Comp Sci, I-56127 Pisa, Italy
关键词
Data stream processing; sliding-window queries; GPU processing; autotuning; selfconfiguring systems;
D O I
10.1109/ACCESS.2019.2910312
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Today's stream processing systems handle high-volume data streams in an efficient manner. To achieve this goal, they are designed to scale out on large clusters of commodity machines. However, despite the efficient use of distributed architectures, they lack support to co-processors like graphical processing units (GPUs) ready to accelerate data-parallel tasks. The main reason for this lack of integration is that GPU processing and the streaming paradigm have different processing models, with GPUs needing a bulk of data present at once while the streaming paradigm advocates a tuple-at-a-time processing model. This paper contributes to fill this gap by proposing Gasser, a system for offloading the execution of sliding-window operators on GPUs. The system focuses on completely general functions by targeting the parallel processing of non-incremental queries that are not supported by the few existing GPU-based streaming prototypes. Furthermore, Gasser provides an auto-tuning approach able to automatically find the optimal value of the configuration parameters (i.e., batch length and the degree of parallelism) needed to optimize throughput and latency with the given query and data stream. The experimental part assesses the performance efficiency of Gasser by comparing its peak throughput and latency against Apache Flink, a popular and scalable streaming system. Furthermore, we evaluate the penalty induced by supporting completely general queries against the performance achieved by the state-of-the-art solution specifically optimized for incremental queries. Finally, we show the speed and accuracy of the auto-tuning approach adopted by Gasser, which is able to self-configure the system by finding the right configuration parameters without manual tuning by the users.
引用
收藏
页码:48753 / 48769
页数:17
相关论文
共 30 条
[21]  
Nabi Z., 2016, Pro spark streaming: The zen of real-time analytics using apache spark
[22]  
Pinnecke Marcus., 2015, GvDB, P78
[23]  
Shein A., 2018, EDBT, P397, DOI DOI 10.5441/002/EDBT.2018.35
[24]  
Tangwongsan K., 2017, P 11 ACM INT C DISTR, P66, DOI DOI 10.1145/3093742.3093925
[25]   Networked Drone Cameras for Sports Streaming [J].
Wang, Xiaoli ;
Chowdhery, Aakanksha ;
Chiang, Mung .
2017 IEEE 37TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2017), 2017, :308-318
[26]  
Wesley R, 2016, PROC VLDB ENDOW, V9, P1221
[27]  
Yongpeng Zhang, 2011, 2011 International Conference on Parallel Processing, P245, DOI 10.1109/ICPP.2011.22
[28]   Revisiting the Design of Data Stream Processing Systems on Multi-Core Processors [J].
Zhang, Shuhao ;
He, Bingsheng ;
Dahlmeier, Daniel ;
Zhou, Amelie Chi ;
Heinze, Thomas .
2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017), 2017, :659-670
[29]  
Zhao XW, 2017, SOFTWARE ARCHITECTURE FOR BIG DATA AND THE CLOUD, P183, DOI 10.1016/B978-0-12-805467-3.00011-9
[30]  
AI COMMUN, V13, P215