Reconciling fault-tolerant distributed computing and systems-on-chip

被引:22
|
作者
Fuegger, Matthias [1 ]
Schmid, Ulrich [1 ]
机构
[1] Tech Univ Wien, Embedded Comp Syst Grp E182 2, A-1040 Vienna, Austria
基金
奥地利科学基金会;
关键词
Clock synchronization; Fault-tolerant; distributed systems; Modeling approaches; VLSI; CLOCK SYNCHRONIZATION; SOFT ERRORS; DESIGN; IMPOSSIBILITY; ARCHITECTURE; CONSENSUS; CIRCUITS; ISSUES; TRENDS;
D O I
10.1007/s00446-011-0151-7
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Classic distributed computing abstractions do not match well the reality of digital logic gates, which are the elementary building blocks of Systems-on-Chip (SoCs) and other Very Large Scale Integrated (VLSI) circuits: Massively concurrent, continuous computations undermine the concept of sequential processes executing sequences of atomic zero-time computing steps, and very limited computational resources at gate-level make even simple operations prohibitively costly. In this paper, we introduce a modeling and analysis framework based on continuous computations and zero-bit message channels, and employ this framework for the correctness & performance analysis of a distributed fault-tolerant clocking approach for Systems-on-Chip (SoCs). Starting out from a "classic" distributed Byzantine fault-tolerant tick generation algorithm, we show how to adapt it for direct implementation in clockless digital logic, and rigorously prove its correctness and derive analytic expressions for worst case performance metrics like synchronization precision and clock frequency. Rather than on absolute delay values, both the algorithm's correctness and the achievable synchronization precision depend solely on the ratio of certain path delays. Since these ratios can be mapped directly to placement & routing constraints, there is typically no need for changing the algorithm when migrating to a faster implementation technology and/or when using a slightly different layout in an SoC.
引用
收藏
页码:323 / 355
页数:33
相关论文
共 50 条
  • [31] The Role of A Priori Belief in the Design and Analysis of Fault-Tolerant Distributed Systems
    Giorgio Cignarale
    Ulrich Schmid
    Tuomas Tahko
    Roman Kuznets
    Minds and Machines, 2023, 33 : 293 - 319
  • [32] The Role of A Priori Belief in the Design and Analysis of Fault-Tolerant Distributed Systems
    Cignarale, Giorgio
    Schmid, Ulrich
    Tahko, Tuomas
    Kuznets, Roman
    MINDS AND MACHINES, 2023, 33 (02) : 293 - 319
  • [33] Distributed Fault-Tolerant Control of Multiagent Systems: An Adaptive Learning Approach
    Khalili, Mohsen
    Zhang, Xiaodong
    Cao, Yongcan
    Polycarpou, Marios M.
    Parisini, Thomas
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (02) : 420 - 432
  • [34] CLOSURE AND CONVERGENCE - A FOUNDATION OF FAULT-TOLERANT COMPUTING
    ARORA, A
    GOUDA, M
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1993, 19 (11) : 1015 - 1027
  • [35] FT-SDN: A Fault-Tolerant Distributed Architecture for Software Defined Network
    Das, Rohit Kumar
    Pohrmen, Fabiola Hazel
    Maji, Arnab Kumar
    Saha, Goutam
    WIRELESS PERSONAL COMMUNICATIONS, 2020, 114 (02) : 1045 - 1066
  • [36] A FAULT-TOLERANT DISTRIBUTED TERMINATION DETECTION SCHEME
    YE, X
    KEANE, JA
    INFORMATION PROCESSING '94, VOL I: TECHNOLOGY AND FOUNDATIONS, 1994, 51 : 300 - 305
  • [37] Cyclic storage for fault-tolerant distributed executions
    Marcelin-Jimenez, Ricardo
    Rajsbaum, Sergio
    Stevens, Brett
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2006, 17 (09) : 1028 - 1036
  • [38] A Fault-Tolerant Wideband Amplifier Based on Distributed Amplification Topology
    Zheng, Shao Yong
    Zhang, Jia Cheng
    Liu, Hai Wen
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2020, 67 (06) : 4516 - 4526
  • [39] A Fault-Tolerant Algorithm For Distributed Resource Allocation
    Pessolani, P.
    Jara, O.
    Gonnet, S.
    Cortes, T.
    Tinetti, F. G.
    IEEE LATIN AMERICA TRANSACTIONS, 2017, 15 (11) : 2152 - 2163
  • [40] Combination of clock-state and clock-rate correction in fault-tolerant distributed systems
    Hermann Kopetz
    Astrit Ademaj
    Alexander Hanzlik
    Real-Time Systems, 2006, 33 : 139 - 173