An adaptive programming model for fault-tolerant distributed computing

被引:14
|
作者
Gorender, Sergio
Macedo, Raimundo Jose de Araujo
Raynal, Michel
机构
[1] Univ Fed Bahia, Dept Comp Sci, Distributed Syst Lab, BR-40170110 Salvador, BA, Brazil
[2] Univ Rennes 1, IRISA, F-35042 Rennes, France
关键词
adaptability; asynchronous/synchronous distributed system; consensus; distributed computing model; fault tolerance; quality of service;
D O I
10.1109/TDSC.2007.3
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The capability of dynamically adapting to distinct runtime conditions is an important issue when designing distributed systems where negotiated quality of service (QoS) cannot always be delivered between processes. Providing fault tolerance for such dynamic environments is a challenging task. Considering such a context, this paper proposes an adaptive programming model for fault-tolerant distributed computing, which provides upper-layer applications with process state information according to the current system synchrony ( or QoS). The underlying system model is hybrid, composed by a synchronous part ( where there are time bounds on processing speed and message delay) and an asynchronous part ( where there is no time bound). However, such a composition can vary over time, and, in particular, the system may become totally asynchronous ( e. g., when the underlying system QoS degrade) or totally synchronous. Moreover, processes are not required to share the same view of the system synchrony at a given time. To illustrate what can be done in this programming model and how to use it, the consensus problem is taken as a benchmark problem. This paper also presents an implementation of the model that relies on a negotiated quality of service ( QoS) for communication channels.
引用
收藏
页码:18 / 31
页数:14
相关论文
共 50 条
  • [21] Distributed Fault-Tolerant Control of Multiagent Systems: An Adaptive Learning Approach
    Khalili, Mohsen
    Zhang, Xiaodong
    Cao, Yongcan
    Polycarpou, Marios M.
    Parisini, Thomas
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (02) : 420 - 432
  • [22] An Introduction into Fault-tolerant Quantum Computing
    Paler, Alexandru
    Devitt, Simon J.
    2015 52ND ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2015,
  • [23] Distributed synthesis of fault-tolerant programs in the high atomicity model
    Bonakdarpour, Borzoo
    Kulkarni, Sandeep S.
    Abujarad, Fuad
    STABILIZATION, SAFETY, AND SECURITY OF DISTRIBUTED SYSTEMS, PROCEEDINGS, 2007, 4838 : 21 - +
  • [24] Developing fault-tolerant distributed loops
    Farrag, A. A.
    INFORMATION PROCESSING LETTERS, 2010, 111 (02) : 97 - 101
  • [25] Strategies for Fault-Tolerant Distributed Visualization
    Panagiotidis, Alexandros
    Reina, Guido
    Ertl, Thomas
    2014 IEEE PACIFIC VISUALIZATION SYMPOSIUM (PACIFICVIS), 2014, : 286 - 290
  • [26] A Fault-tolerant model for tuple space coordination in distributed environments
    Kirti, Medha
    Maurya, Ashish Kumar
    Yadav, Rama Shankar
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (01)
  • [27] FT_HORB: A fault-tolerant distributed programming environment based on RMI
    Kim, S
    Hyun, M
    Yamakita, J
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2002, E85D (03) : 510 - 517
  • [28] Parameterized Model Checking of Fault-tolerant Distributed Algorithms by Abstraction
    John, Annu
    Konnov, Igor
    Schmid, Ulrich
    Veith, Helmut
    Widder, Josef
    2013 FORMAL METHODS IN COMPUTER-AIDED DESIGN (FMCAD), 2013, : 201 - 209
  • [29] Adaptive and technology-independent architecture for fault-tolerant distributed AAL solutions
    Schmidt, Michael
    Obermaisser, Roman
    COMPUTERS IN BIOLOGY AND MEDICINE, 2018, 95 : 236 - 247
  • [30] Multi-Agent Fault-Tolerant Control Based on Distributed Adaptive Consensus
    Zhang, Pu
    Xue, Huifeng
    Gao, Shan
    IEEE ACCESS, 2019, 7 : 135882 - 135895