Amber: A Debuggable Dataflow System Based on the Actor Model

被引:14
|
作者
Kumar, Avinash [1 ]
Wang, Zuozhi [1 ]
Ni, Shengquan [1 ]
Li, Chen [1 ]
机构
[1] UC Irvine, Dept Comp Sci, Irvine, CA 92697 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2020年 / 13卷 / 05期
关键词
D O I
10.14778/3377369.3377381
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A long-running analytic task on big data often leaves a developer in the dark without providing valuable feedback about the status of the execution. In addition, a failed job that needs to restart from scratch can waste earlier computing resources. An effective method to address these issues is to allow the developer to debug the task during its execution, which is unfortunately not supported by existing big data solutions. In this paper we develop a system called Amber that supports responsive debugging during the execution of a workflow task. After starting the execution, the developer can pause the job at will, investigate the states of the cluster, modify the job, and resume the computation. She can also set conditional breakpoints to pause the execution when certain conditions are satisfied. In this way, the developer can gain a much better understanding of the run-time behavior of the execution and more easily identify issues in the job or data. Amber is based on the actor model, a distributed computing paradigm that provides concurrent units of computation using actors. We give a full specification of Amber, and implement it on top of the Orleans system. Our experiments show its high performance and usability of debugging on computing clusters.
引用
收藏
页码:740 / 753
页数:14
相关论文
共 50 条
  • [31] A tree-based dataflow model for the unsymmetric multifrontal method
    Eisenstat, SC
    Liu, JWH
    ELECTRONIC TRANSACTIONS ON NUMERICAL ANALYSIS, 2005, 21 : 1 - 19
  • [32] A tree-based dataflow model for the unsymmetric multifrontal method
    Eisenstat, Stanley C.
    Liu, Joseph W. H.
    Electronic Transactions on Numerical Analysis, 2005, 21 : 1 - 19
  • [33] A DATAFLOW MODEL FOR .NET-BASED GRID COMPUTING SYSTEMS
    Jin, Chao
    Buyya, Rajkumar
    Stein, Lex
    Zhang, Zheng
    GCA 2007: PROCEEDINGS OF THE 3RD INTERNATIONAL WORKSHOP ON GRID COMPUTING AND APPLICATIONS, 2007, : 25 - +
  • [34] Integration of an Autoencoder Model with an Actor-Oriented System
    Dyubele, Sithembiso
    Cele, Noxolo Pretty
    Mbangata, Lubabalo
    ADVANCES IN ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING, 2024, 4 (03): : 2629 - 2647
  • [35] Study on regional technical innovation triple-actor system based on the Triple Helix Model
    Shan Yuanyuan
    Zhao Yulin
    PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON INNOVATION & MANAGEMENT, VOLS I AND II, 2007, : 272 - 276
  • [36] OpenCDN: An ICN-Based Open Content Distribution System Using Distributed Actor Model
    Narayanan, Arvind
    Ramadan, Eman
    Zhang, Zhi-Li
    IEEE INFOCOM 2018 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2018, : 268 - 273
  • [37] Advances in the dataflow computational model
    Najjar, Walid A.
    Lee, Edward A.
    Gao, Guang R.
    Parallel Computing, 1999, 25 (13): : 1907 - 1929
  • [38] Advances in the dataflow computational model
    Naggar, WA
    Lee, EA
    Gao, GR
    PARALLEL COMPUTING, 1999, 25 (13-14) : 1907 - 1929
  • [39] A Monitoring System for the LHCb Dataflow
    Barbosa, Joao
    Gaspar, Clara
    Jost, Beat
    Frank, Markus
    Cardoso, Luis G.
    2016 IEEE-NPSS REAL TIME CONFERENCE (RT), 2016,
  • [40] Naiad: A Timely Dataflow System
    Murray, Derek G.
    McSherry, Frank
    Isaacs, Rebecca
    Isard, Michael
    Barham, Paul
    Abadi, Martin
    SOSP'13: PROCEEDINGS OF THE TWENTY-FOURTH ACM SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES, 2013, : 439 - 455