Atomicity of batches in stream processing

被引:0
|
作者
K. Vidyasankar
机构
[1] Memorial University,Department of Computer Science
来源
Journal of Ambient Intelligence and Humanized Computing | 2018年 / 9卷
关键词
Stream processing; Transactions; Splitting and merging of batches; Atomic batches; Concurrent execution; Compensation;
D O I
暂无
中图分类号
学科分类号
摘要
Stream processing is about processing continuous streams of data by programs in a workflow. Continuous execution is discretized by grouping input stream tuples into batches and using one batch at a time for the execution of programs. As source input batches arrive continuously, several batches may be processed in the workflow simultaneously. Ensuring correctness of these concurrent executions is important. We apply (database) transaction concept for the correctness. A general requirement is that each batch be processed completely in the workflow. That is, all the programs triggered by the batch, directly and transitively, in the workflow must be executed successfully. We say that a batch is executed atomically if it is processed completely, independently of the processing of other batches, and, if needed, the processing can be compensated without affecting the processing of other batches. The batches which can be executed atomically are called atomic batches. If batches are processed in isolation in the workflow, ensuring atomicity is fairly straightforward. However, when they are split, merged or overlapped along the workflow computation, ensuring atomicity becomes complicated. In some cases, several source input batches can be combined to form an atomic batch. In some other cases, execution can be prompted to yield atomic batches. In this paper, we study these issues.
引用
收藏
页码:19 / 29
页数:10
相关论文
共 50 条
  • [31] VeilGraph: incremental graph stream processing
    Miguel E. Coimbra
    Sérgio Esteves
    Alexandre P. Francisco
    Luís Veiga
    Journal of Big Data, 9
  • [32] Stream Processing on Clustered Edge Devices
    Dautov, Rustem
    Distefano, Salvatore
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2022, 10 (02) : 885 - 898
  • [33] SeQual-Stream: approaching stream processing to quality control of NGS datasets
    Óscar Castellanos-Rodríguez
    Roberto R. Expósito
    Juan Touriño
    BMC Bioinformatics, 24
  • [34] SeQual-Stream: approaching stream processing to quality control of NGS datasets
    Castellanos-Rodriguez, Oscar
    Exposito, Roberto R.
    Tourino, Juan
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [35] Processing Flows of Information: From Data Stream to Complex Event Processing
    Cugola, Gianpaolo
    Margara, Alessandro
    ACM COMPUTING SURVEYS, 2012, 44 (03)
  • [36] Speedup of Fuzzy Clustering Through Stream Processing on Graphics Processing Units
    Anderson, Derek T.
    Luke, Robert H.
    Keller, James M.
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2008, 16 (04) : 1101 - 1106
  • [37] Adaptive key partitioning in distributed stream processing
    Gang Liu
    Zeting Wang
    Amelie Chi Zhou
    Rui Mao
    CCF Transactions on High Performance Computing, 2024, 6 : 164 - 178
  • [38] Joker: Elastic stream processing with organic adaptation
    Kahveci, Basri
    Gedik, Bugra
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 137 : 205 - 223
  • [39] Pushing Intelligence to the Edge with a Stream Processing Architecture
    Dautov, Rustem
    Distefano, Salvatore
    Bruneo, Dario
    Longo, Francesco
    Merlino, Giovani
    Puliafito, Antonio
    2017 IEEE INTERNATIONAL CONFERENCE ON INTERNET OF THINGS (ITHINGS) AND IEEE GREEN COMPUTING AND COMMUNICATIONS (GREENCOM) AND IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING (CPSCOM) AND IEEE SMART DATA (SMARTDATA), 2017, : 792 - 799
  • [40] A decentralized control mechanism for stream processing networks
    Zhen Liu
    Ao Tang
    Cathy H. Xia
    Li Zhang
    Annals of Operations Research, 2009, 170 : 161 - 182