GPU-based First Aid for System Faults

被引:0
|
作者
Kimura, Kento [1 ]
Kourai, Kenichi [1 ]
机构
[1] Kyushu Inst Technol, Iizuka, Fukuoka, Japan
关键词
fault recovery; GPUs; signals; scheduling; deadlocks;
D O I
10.1145/3546591.3547526
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
It is difficult to completely avoid system failures in recent large-scale and complex systems. Therefore, it is important to detect system faults rapidly and accurately and recover from them. Fault recovery is categorized into external one from remote hosts and internal one with processes or the operating system (OS) inside a target system. However, both methods are subject to system faults. If fault recovery fails, a hardware reset is required and can lead to losing system data and states. This paper proposes GPUfas for recovering from system faults by indirectly controlling OS behavior from a GPU, which is not easily affected by system faults. GPUfas attempts fault recovery by rewriting OS data in main memory and leveraging the capabilities of the OS itself. For example, it can mimic signal sending and process scheduling to force termination of the processes that consume excessive resources. It can also mimic unlocking to recover from some kind of deadlock. We have implemented GPUfas using the Linux kernel, CUDA, and LLVM to enable a GPU to rewrite OS data transparently. Then, we confirmed the effectiveness and efficiency of fault recovery by GPUfas.
引用
收藏
页码:38 / 45
页数:8
相关论文
共 50 条
  • [22] A GPU-based DEM framework for simulation of polyhedral particulate system
    Liu, Guang-Yu
    Xu, Wen-Jie
    GRANULAR MATTER, 2023, 25 (02)
  • [23] On the performance of a GPU-based SoC in a distributed spatial audio system
    Jose A. Belloch
    José M. Badía
    Diego F. Larios
    Enrique Personal
    Miguel Ferrer
    Laura Fuster
    Mihaita Lupoiu
    Alberto Gonzalez
    Carlos León
    Antonio M. Vidal
    Enrique S. Quintana-Ortí
    The Journal of Supercomputing, 2021, 77 : 6920 - 6935
  • [24] A GPU-based DEM framework for simulation of polyhedral particulate system
    Guang-Yu Liu
    Wen-Jie Xu
    Granular Matter, 2023, 25
  • [25] GPU-based Acceleration of System-level Design Tasks
    Bordoloi, Unmesh D.
    Chakraborty, Samarjit
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2010, 38 (3-4) : 225 - 253
  • [26] A GPU-based hyperbolic SVD algorithm
    Novakovic, Vedran
    Singer, Sanja
    BIT NUMERICAL MATHEMATICS, 2011, 51 (04) : 1009 - 1030
  • [27] GPU-Based Detection of Stopping Vehicles
    Gamage, Tharindu
    Samarawickrama, Jayathu G.
    Pasqual, A. A.
    INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER2012), 2012, : 222 - 222
  • [28] GPU-based calculations in digital holography
    Madrigal, R.
    Acebal, P.
    Blaya, S.
    Carretero, L.
    Fimia, A.
    Serrano, F.
    HOLOGRAPHY: ADVANCES AND MODERN TRENDS III, 2013, 8776
  • [29] GPU-based Decompression for the 842 Algorithm
    Plauth, Max
    Polze, Andreas
    2019 SEVENTH INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING WORKSHOPS (CANDARW 2019), 2019, : 97 - 102
  • [30] Implementation of a GPU-based CFD code
    Niksiar, Pooya
    Ashrafizadeh, Ali
    Shams, Mehrzad
    Madani, Amir Hossein
    2014 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), VOL 1, 2014, : 84 - 89