A Task Allocation Method for Stream Processing with Recovery Latency Constraint

被引:0
|
作者
Hong-Liang Li
Jie Wu
Zhen Jiang
Xiang Li
Xiao-Hui Wei
机构
[1] Jilin University,College of Computer Science and Technology
[2] Key Laboratory of Symbolic Computation and Knowledge Engineering of the Ministry of Education,Department of Computer and Information Sciences
[3] Temple University,Department of Computer Science
[4] West Chester University of Pennsylvania,undefined
关键词
stream processing; task allocation; fault-tolerance; upstream backup; recovery latency;
D O I
暂无
中图分类号
学科分类号
摘要
Stream processing applications continuously process large amounts of online streaming data in real time or near real time. They have strict latency constraints. However, the continuous processing makes them vulnerable to any failures, and the recoveries may slow down the entire processing pipeline and break latency constraints. The upstream backup scheme is one of the most widely applied fault-tolerant schemes for stream processing systems. It introduces complex backup dependencies to tasks, which increases the difficulty of controlling recovery latencies. Moreover, when dependent tasks are located on the same processor, they fail at the same time in processor-level failures, bringing extra recovery latencies that increase the impacts of failures. This paper studies the relationship between the task allocation and the recovery latency of a stream processing application. We present a correlated failure effect model to describe the recovery latency of a stream topology in processor-level failures under a task allocation plan. We introduce a recovery-latency aware task allocation problem (RTAP) that seeks task allocation plans for stream topologies that will achieve guaranteed recovery latencies. We discuss the difference between RTAP and classic task allocation problems and present a heuristic algorithm with a computational complexity of O(n log2n) to solve the problem. Extensive experiments were conducted to verify the correctness and effectiveness of our approach. It improves the resource usage by 15%–20% on average.
引用
收藏
页码:1125 / 1139
页数:14
相关论文
共 50 条
  • [1] A Task Allocation Method for Stream Processing with Recovery Latency Constraint
    Li, Hong-Liang
    Wu, Jie
    Jiang, Zhen
    Li, Xiang
    Wei, Xiao-Hui
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2018, 33 (06) : 1125 - 1139
  • [2] Task Allocation for Stream Processing with Recovery Latency Guarantee
    Li, Hongliang
    Wu, Jie
    Jiang, Zhen
    Li, Xiang
    Wei, Xiaohui
    2017 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2017, : 379 - 383
  • [3] Integrated Recovery and Task Allocation for Stream Processing
    Li, Hongliang
    Wu, Jie
    Jiang, Zhen
    Li, Xiang
    Wei, Xiaohui
    Zhuang, Yuan
    2017 IEEE 36TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2017,
  • [4] Topology-Aware Task Allocation for Distributed Stream Processing with Latency Guarantee
    Wei, Xiaohui
    Wei, Xun
    Li, Hongliang
    Zhuang, Yuan
    Yue, Hengshan
    ICAIP 2018: 2018 THE 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN IMAGE PROCESSING, 2018, : 245 - 251
  • [5] Task Allocation for Distributed Stream Processing
    Eidenbenz, Raphael
    Locher, Thomas
    IEEE INFOCOM 2016 - THE 35TH ANNUAL IEEE INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS, 2016,
  • [6] Adaptive and Collaborative Edge Inference in Task Stream with Latency Constraint
    Song, Jinduo
    Liu, Zhicheng
    Wang, Xiaofei
    Qiu, Chao
    Chen, Xu
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [7] Topology-aware task allocation for online distributed stream processing applications with latency constraints
    Wei, Xiaohui
    Wei, Xun
    Li, Hongliang
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2019, 534
  • [8] Minimum Backups for Stream Processing With Recovery Latency Guarantees
    Li, Hongliang
    Wu, Jie
    Jiang, Zhen
    Li, Xiang
    Wei, Xiaohui
    IEEE TRANSACTIONS ON RELIABILITY, 2017, 66 (03) : 783 - 794
  • [9] Flexible Executor Allocation without Latency Increase for Stream Processing in Apache Spark
    Morisawa, Yuta
    Suzuki, Masaki
    Kitahara, Takeshi
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 2198 - 2206
  • [10] Predictive method of task allocation in stream-based computing
    Aoyagi, Y
    Uehara, M
    Mori, H
    INFORMATION NETWORKING IN ASIA, 2001, 3 : 209 - 218