Parallel Patterns for Window-Based Stateful Operators on Data Streams: An Algorithmic Skeleton Approach

被引:25
作者
De Matteis, Tiziano [1 ]
Mencagli, Gabriele [1 ]
机构
[1] Univ Pisa, Dept Comp Sci, Largo B Pontecorvo 3, I-56127 Pisa, Italy
关键词
Parallel patterns; Algorithmic skeletons; Data stream processing; Multi-/many-core architectures;
D O I
10.1007/s10766-016-0413-x
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The topic of Data Stream Processing is a recent and highly active research area dealing with the in-memory, tuple-by-tuple analysis of streaming data. Continuous queries typically consume huge volumes of data received at a great velocity. Solutions that persistently store all the input tuples and then perform off-line computation are impractical. Rather, queries must be executed continuously as data cross the streams. The goal of this paper is to present parallel patterns for window-based stateful operators, which are the most representative class of stateful data stream operators. Parallel patterns are presented "A la" Algorithmic Skeleton, by explaining the rationale of each pattern, the preconditions to safely apply it, and the outcome in terms of throughput, latency and memory consumption. The patterns have been implemented in the framework targeting off-the-shelf multicores. To the best of our knowledge this is the first time that a similar effort to merge the Data Stream Processing domain and the field of Structured Parallelism has been made.
引用
收藏
页码:382 / 401
页数:20
相关论文
共 18 条
[1]  
Aggarwal CC., 2007, DATA STREAMS ADV DAT
[2]   On Designing Multicore-Aware Simulators for Systems Biology Endowed with OnLine Statistics [J].
Aldinucci, Marco ;
Calcagno, Cristina ;
Coppo, Mario ;
Damiani, Ferruccio ;
Drocco, Maurizio ;
Sciacca, Eva ;
Spinella, Salvatore ;
Torquati, Massimo ;
Troina, Angelo .
BIOMED RESEARCH INTERNATIONAL, 2014, 2014
[3]  
Aldinucci M, 2012, LECT NOTES COMPUT SC, V7484, P662, DOI 10.1007/978-3-642-32820-6_65
[4]   The CQL continuous query language: semantic foundations and query execution [J].
Arasu, A ;
Babu, S ;
Widom, J .
VLDB JOURNAL, 2006, 15 (02) :121-142
[5]  
Babcock B., 2002, PODS, P1, DOI [DOI 10.1145/543613.543615, 10.1145/543613.543615]
[6]  
Balkesen C., 2011, VLDB INT WORKSH DAT
[7]   Analyzing Memory Requirements for Pervasive Grid Applications [J].
Bertolli, Carlo ;
Mencagli, Gabriele ;
Vanneschi, Marco .
PROCEEDINGS OF THE 18TH EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, 2010, :297-301
[8]   Honeycomb rectangular disks [J].
Teng, YH ;
Tan, JJM ;
Hsu, LH .
PARALLEL COMPUTING, 2005, 31 (3-4) :371-388
[9]   Processing Flows of Information: From Data Stream to Complex Event Processing [J].
Cugola, Gianpaolo ;
Margara, Alessandro .
ACM COMPUTING SURVEYS, 2012, 44 (03)
[10]  
Dobra A., 2002, P 2002 ACM SIGMOD IN