Evaluating single event upsets in deep neural networks for semantic segmentation: An embedded system perspective

被引：5

作者：

Gutierrez-Zaballa, Jon ^{[1
]}

Basterretxea, Koldo ^{[1
]}

Echanobe, Javier ^{[2
]}

机构：

[1] Univ Basque Country, Dept Elect Technol, Bilbao 48013, Spain

[2] Univ Basque Country, Dept Elect & Elect, Leioa 48940, Spain

来源：

JOURNAL OF SYSTEMS ARCHITECTURE | 2024年 / 154卷

关键词：

Single bit upsets; Robustness evaluation; Model compression; Embedded artificial intelligence; Semantic segmentation;

D O I：

10.1016/j.sysarc.2024.103242

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

As the deployment of artificial intelligence (AI) algorithms at edge devices becomes increasingly prevalent, enhancing the robustness and reliability of autonomous AI-based perception and decision systems is becoming as relevant as precision and performance, especially in applications areas considered safety-critical such as autonomous driving and aerospace. This paper delves into the robustness assessment in embedded Deep Neural Networks (DNNs), particularly focusing on the impact of parameter perturbations produced by single event upsets (SEUs) on convolutional neural networks (CNN) for image semantic segmentation. By scrutinizing the layer-by-layer and bit-by-bit sensitivity of various encoder-decoder models to soft errors, this study thoroughly investigates the vulnerability of segmentation DNNs to SEUs and evaluates the consequences of techniques like model pruning and parameter quantization on the robustness of compressed models aimed at embedded implementations. The findings offer valuable insights into the mechanisms underlying SEU-induced failures that allow for evaluating the robustness of DNNs once trained in advance. Moreover, based on the collected data, we propose a set of practical lightweight error mitigation techniques with no memory or computational cost suitable for resource-constrained deployments. The code used to perform the fault injection (FI) campaign is available at https://github.com/jonGuti13/TensorFI2, while the code to implement proposed techniques is available at https://github.com/jonGuti13/parameterProtection.

引用

页数：23

共 75 条

[1] A Systematic Literature Review on Hardware Reliability Assessment Methods for Deep Neural Networks [J].

Ahmadilivani, Mohammad Hasan ;

Taheri, Mahdi ;

Raik, Jaan ;

Daneshtalab, Masoud ;

Jenihhin, Maksim .

ACM COMPUTING SURVEYS, 2024, 56 (06)

[2]

AMD-Xilinx, 2023, Device reliability report. UG116 v10.17

[3]

Arechiga AP, 2018, IEEE HIGH PERF EXTR

[4]

Arechiga AP, 2018, 2018 IEEE 8TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), P190, DOI 10.1109/CCWC.2018.8301749

[5] On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation [J].

Bach, Sebastian ;

Binder, Alexander ;

Montavon, Gregoire ;

Klauschen, Frederick ;

Mueller, Klaus-Robert ;

Samek, Wojciech .

PLOS ONE, 2015, 10 (07)

[6] Soft errors in advanced computer systems [J].

Baumann, R .

IEEE DESIGN & TEST OF COMPUTERS, 2005, 22 (03) :258-266

[7]

Bolchini C., 2022, 2022 IEEE INT S DEF, P1

[8]

Bosio A, 2019, 2019 20TH IEEE LATIN AMERICAN TEST SYMPOSIUM (LATS), DOI 10.1109/latw.2019.8704548

[9] Zero-Overhead Protection for CNN Weights [J].

Burel, Stephane ;

Evans, Adrian ;

Anghel, Lorena .

34TH IEEE INTERNATIONAL SYMPOSIUM ON DEFECT AND FAULT TOLERANCE IN VLSI AND NANOTECHNOLOGY SYSTEMS (DFT 2021), 2021,

[10]

BurelT Stephane, 2022, 2022 IEEE INT S DEF, P1

← 1 2 3 4 5 6 7 8 →