Multithreading-Enabled Active Replication for Event Stream Processing Operators

被引:12
作者
Brito, Andrey [1 ]
Fetzer, Christof [1 ]
Felber, Pascal [2 ]
机构
[1] Tech Univ Dresden, Syst Engn Grp, Dresden, Germany
[2] Univ Neuchatel, Inst Informat, CH-2000 Neuchatel, Switzerland
来源
2009 28TH IEEE INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS, PROCEEDINGS | 2009年
关键词
D O I
10.1109/SRDS.2009.37
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Event Stream Processing (ESP) systems are very popular in monitoring applications. Algorithmic trading, network monitoring and sensor networks are good examples of applications that rely upon ESP systems. As these systems become larger and more widely deployed, they have to answer increasingly stronger requirements that are often difficult to satisfy. Fault-tolerance is a good example of such a non-trivial requirement. Making ESP operators fault-tolerant can add considerable performance overhead to the application. In this paper, we focus on active replication as an approach to provide fault-tolerance to ESP operators. More precisely, we address the performance costs of active replication for operators in distributed ESP applications. We use a speculation mechanism based on Software Transactional Memory (STM) to achieve the following goals: (i) enable replicas to make progress using optimistic delivery; (ii) enable early forwarding of speculative computation results; (m) enable active replication of multi-threaded operators using transactional executions. Experimental evaluation shows that, using this combination of mechanisms, one can implement highly efficient fault-tolerant ESP operators.
引用
收藏
页码:22 / +
页数:3
相关论文
共 22 条
[1]   Active replication of multithreaded applications [J].
Basile, C ;
Kalbarczyk, Z ;
Iyer, RK .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2006, 17 (05) :448-465
[2]  
BRITO A, 2008, DEBS 08, P265
[3]  
BRITO A, 2009, 29 INT C DISTR COMP
[4]  
Budhiraja N., 1993, The Primary-Backup Approach, Distributed Systems, V2, P199
[5]   Unreliable failure detectors for reliable distributed systems [J].
Chandra, TD ;
Toueg, S .
JOURNAL OF THE ACM, 1996, 43 (02) :225-267
[6]   Finding frequent items in data streams [J].
Charikar, M ;
Chen, K ;
Farach-Colton, M .
THEORETICAL COMPUTER SCIENCE, 2004, 312 (01) :3-15
[7]  
CHEREQUE M, 1992, FTCS-22 : THE TWENTY-SECOND INTERNATIONAL SYMPOSIUM ON FAULT-TOLERANT COMPUTING, P28
[8]   Semi-passive replication [J].
Défago, X ;
Schiper, A ;
Sergent, N .
SEVENTEENTH IEEE SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS, PROCEEDINGS, 1998, :43-50
[9]  
DEPLANCHE AM, 1999, SRDS 1999, P90
[10]  
FELBER P, 2008, P 13 ACM SIGPLAN S P