Engineering Adaptive Fault-Tolerance Mechanisms for Resilient Computing on ROS

被引:9
|
作者
Lauer, Michael [2 ]
Amy, Matthieu [3 ]
Fabre, Jean-Charles [3 ]
Roy, Matthieu [1 ]
Excoffon, William [3 ]
Stoicescu, Miruna [4 ]
机构
[1] CNRS, LAAS, Ave Colonel Roche, F-31400 Toulouse, France
[2] Univ Toulouse, UPS, LAAS, F-31400 Toulouse, France
[3] Univ Toulouse, INP, LAAS, F-31400 Toulouse, France
[4] ESA, ESOC, Darmstadt, Germany
来源
2016 IEEE 17TH INTERNATIONAL SYMPOSIUM ON HIGH ASSURANCE SYSTEMS ENGINEERING (HASE) | 2016年
关键词
D O I
10.1109/HASE.2016.30
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Systems are expected to evolve during their service life in order to cope with changes of various natures, ranging from fluctuations in available resources to additional features requested by users. For dependable embedded systems, the challenge is even greater, as evolution must not impair dependability attributes. Resilient computing implies maintaining dependability properties when facing changes. Resilience encompasses several aspects, among which evolvability, i.e., the capacity of a system to evolve during its service life. In this paper, we discuss the evolution of systems with respect to their dependability mechanisms, and show how such mechanisms can evolve accordingly. From a component-based approach that enables to clarify the concepts, the process and the techniques to be used to address resilient computing, in particular regarding the adaptation of fault tolerance (or safety) mechanisms, we show how Adaptive Fault Tolerance (AFT) can be implemented with ROS. Beyond implementation, we draw the lessons learned from this work and discuss the limits of this runtime support to implement such resilient computing features in embedded systems.
引用
收藏
页码:94 / 101
页数:8
相关论文
共 50 条
  • [31] BoboCEP: Distributed Complex Event Processing for Resilient Fault-Tolerance Support in IoT
    Power, Alexander
    Kotonya, Gerald
    2020 IEEE SIXTH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (BIGDATASERVICE 2020), 2020, : 110 - 113
  • [32] An Adaptive Fault-tolerance Scheme for Distributed Load Balancing Systems
    Liu, Dan
    De Grande, Robson E.
    Boukerche, Azzedine
    48TH ANNUAL SIMULATION SYMPOSIUM (ANSS 2015), 2015, : 138 - 145
  • [33] Provisioning Spot Instances Without Employing Fault-Tolerance Mechanisms
    Alourani, Abdullah
    Kshemkalyani, Ajay D.
    2020 19TH INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING (ISPDC 2020), 2020, : 126 - 133
  • [34] Adaptive Application Scaling for Improving Fault-Tolerance and Availability in the Cloud
    Radhakrishnan, Ganesan
    BELL LABS TECHNICAL JOURNAL, 2012, 17 (02) : 5 - 14
  • [35] Fault-tolerance mechanisms for a parallel programming system - A responsiveness perspective
    Karl, H
    COMMUNICATION-BASED SYSTEMS, 2000, : 43 - 54
  • [36] Fault-Tolerance Mechanisms for Software-Defined Internet of Vehicles
    Cheng, Chien-Fu
    Srivastava, Gautam
    Lin, Jerry Chun-Wei
    Lin, Ying-Chen
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (06) : 3859 - 3868
  • [37] Multiple Fault-tolerance Mechanisms in Cloud Systems: a Systematic Review
    Marcotte, Philippe
    Gregoire, Frederic
    Petrillo, Fabio
    2019 IEEE 30TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS (ISSREW 2019), 2019, : 414 - 421
  • [38] A comparative cost analysis of fault-tolerance mechanisms for availability on the cloud
    Sampaio, Altino M.
    Barbosa, Jorge G.
    SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2018, 19 : 315 - 323
  • [39] DDGrid: A Grid Computing Environment with Massive Concurrency and Fault-tolerance Support
    Wang, Yongjian
    Luan, Zhongzhi
    Qian, Depei
    Huang, Yuanqiang
    Chen, Ting
    Han, Biao
    Ren, Yinan
    Yu, Kunqian
    Jiang, Hualiang
    GCC 2008: SEVENTH INTERNATIONAL CONFERENCE ON GRID AND COOPERATIVE COMPUTING, PROCEEDINGS, 2008, : 5 - +
  • [40] ON FAULT-TOLERANCE AND FAULT-AVOIDANCE
    REGULINSKI, TLD
    IEEE TRANSACTIONS ON RELIABILITY, 1987, 36 (02) : 161 - 161