A Task Allocation Method for Stream Processing with Recovery Latency Constraint

被引:0
作者
Hong-Liang Li
Jie Wu
Zhen Jiang
Xiang Li
Xiao-Hui Wei
机构
[1] Jilin University,College of Computer Science and Technology
[2] Key Laboratory of Symbolic Computation and Knowledge Engineering of the Ministry of Education,Department of Computer and Information Sciences
[3] Temple University,Department of Computer Science
[4] West Chester University of Pennsylvania,undefined
来源
Journal of Computer Science and Technology | 2018年 / 33卷
关键词
stream processing; task allocation; fault-tolerance; upstream backup; recovery latency;
D O I
暂无
中图分类号
学科分类号
摘要
Stream processing applications continuously process large amounts of online streaming data in real time or near real time. They have strict latency constraints. However, the continuous processing makes them vulnerable to any failures, and the recoveries may slow down the entire processing pipeline and break latency constraints. The upstream backup scheme is one of the most widely applied fault-tolerant schemes for stream processing systems. It introduces complex backup dependencies to tasks, which increases the difficulty of controlling recovery latencies. Moreover, when dependent tasks are located on the same processor, they fail at the same time in processor-level failures, bringing extra recovery latencies that increase the impacts of failures. This paper studies the relationship between the task allocation and the recovery latency of a stream processing application. We present a correlated failure effect model to describe the recovery latency of a stream topology in processor-level failures under a task allocation plan. We introduce a recovery-latency aware task allocation problem (RTAP) that seeks task allocation plans for stream topologies that will achieve guaranteed recovery latencies. We discuss the difference between RTAP and classic task allocation problems and present a heuristic algorithm with a computational complexity of O(n log2n) to solve the problem. Extensive experiments were conducted to verify the correctness and effectiveness of our approach. It improves the resource usage by 15%–20% on average.
引用
收藏
页码:1125 / 1139
页数:14
相关论文
共 68 条
  • [1] Stonebraker M(2005)The 8 requirements of real-time stream processing ACM SIGMOD Record 34 42-47
  • [2] Çetintemel U(2013)Mill-Wheel: Fault-tolerant stream processing at Internet scale Proceedings of the VLDB Endowment 6 1033-1044
  • [3] Zdonik S(2017)A heuristic placement selection approach of partitions of mobile applications in mobile cloud computing model based on community collaboration Cluster Computing 20 3131-3146
  • [4] Akidau T(2017)Minimum backups for stream processing with recovery latency guarantees IEEE Transactions on Reliability 66 783-94
  • [5] Balikov A(2017)Building a fault tolerant framework with deadline guarantee in big data stream computing environments Journal of Computer and System Sciences 89 4-23
  • [6] Bekiroğlu K(2014)Open challenges for data stream mining research ACM SIGKDD Explorations Newsletter 16 1-10
  • [7] Chernyak S(2008)Fault-tolerance in the borealis distributed stream processing system ACM Transactions on Database Systems 33 1-44
  • [8] Zhao J(2007)WhiteWater: Distributed processing of fast streams IEEE Transactions on Knowledge and Data Engineering 19 1214-1226
  • [9] Ou S(2017)Proactive elasticity and energy awareness in data stream processing Journal of Systems and Software 127 302-319
  • [10] Hu L(1997)The balanced hypercube: A cube-based system for fault-tolerant applications IEEE Transactions on Computers 46 484-90