Amber: A Debuggable Dataflow System Based on the Actor Model

被引:14
|
作者
Kumar, Avinash [1 ]
Wang, Zuozhi [1 ]
Ni, Shengquan [1 ]
Li, Chen [1 ]
机构
[1] UC Irvine, Dept Comp Sci, Irvine, CA 92697 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2020年 / 13卷 / 05期
关键词
D O I
10.14778/3377369.3377381
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A long-running analytic task on big data often leaves a developer in the dark without providing valuable feedback about the status of the execution. In addition, a failed job that needs to restart from scratch can waste earlier computing resources. An effective method to address these issues is to allow the developer to debug the task during its execution, which is unfortunately not supported by existing big data solutions. In this paper we develop a system called Amber that supports responsive debugging during the execution of a workflow task. After starting the execution, the developer can pause the job at will, investigate the states of the cluster, modify the job, and resume the computation. She can also set conditional breakpoints to pause the execution when certain conditions are satisfied. In this way, the developer can gain a much better understanding of the run-time behavior of the execution and more easily identify issues in the job or data. Amber is based on the actor model, a distributed computing paradigm that provides concurrent units of computation using actors. We give a full specification of Amber, and implement it on top of the Orleans system. Our experiments show its high performance and usability of debugging on computing clusters.
引用
收藏
页码:740 / 753
页数:14
相关论文
共 50 条
  • [21] Dataflow-based Implementation of Model Predictive Control
    Gu, Ruirui
    Bhattacharyya, Shuvra S.
    Levine, Williams S.
    2009 AMERICAN CONTROL CONFERENCE, VOLS 1-9, 2009, : 2343 - 2349
  • [22] A Clockless Computing System based on the Static Dataflow Paradigm
    Verdoscia, Lorenzo
    Vaccaro, Roberto
    Giorgi, Roberto
    2014 FOURTH WORKSHOP ON DATA-FLOW EXECUTION MODELS FOR EXTREME SCALE COMPUTING DFM 2014, 2014, : 30 - 37
  • [23] Synthesis and high level optimisation of multidimensional dataflow actor networks on FPGA
    McAllister, J
    Woods, R
    Walke, R
    Reilly, D
    2004 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS DESIGN AND IMPLEMENTATION, PROCEEDINGS, 2004, : 164 - 169
  • [24] A Markov reward model for reliable synchronous dataflow system design
    Kumar, VV
    Verma, R
    Lach, J
    Dugan, JB
    2004 INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS, 2004, : 817 - 825
  • [25] EVALUATION OF A DATAFLOW MODEL
    JARAGH, MH
    HABIB, MKR
    COMPUTER PERFORMANCE, 1983, 4 (04): : 242 - 252
  • [26] Timely Dataflow: A Model
    Abadi, Martin
    Isard, Michael
    FORMAL TECHNIQUES FOR DISTRIBUTED OBJECTS, COMPONENTS, AND SYSTEMS, FORTE 2015, 2015, 9039 : 131 - 145
  • [27] A Priority-Based Budget Scheduler with Conservative Dataflow Model
    Steine, Marcel
    Bekooij, Marco
    Wiggers, Maarten
    PROCEEDINGS OF THE 2009 12TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN, ARCHITECTURES, METHODS AND TOOLS, 2009, : 37 - +
  • [28] An actor-based model for the electronic market
    Loia, V
    Scandizzo, S
    INFORMATION INTELLIGENCE AND SYSTEMS, VOLS 1-4, 1996, : 2880 - 2885
  • [29] An actor model for rule-based systems
    Boufriche-Boufaïda, Z
    EXPERT SYSTEMS, 1999, 16 (01) : 11 - 18
  • [30] Liam: An Actor Based Programming Model for HDLs
    Skinner, Haven
    Possignolo, Rafael Trapani
    Renau, Jose
    MEMOCODE 2017: PROCEEDINGS OF THE 15TH ACM-IEEE INTERNATIONAL CONFERENCE ON FORMAL METHODS AND MODELS FOR SYSTEM DESIGN, 2017, : 185 - 188