Training-Free Stuck-At Fault Mitigation for ReRAM-Based Deep Learning Accelerators

被引:4
作者
Quan, Chenghao [1 ]
Fouda, Mohammed E. [2 ]
Lee, Sugil [1 ]
Jung, Giju [1 ]
Lee, Jongeun [1 ]
Eltawil, Ahmed E. [3 ]
Kurdahi, Fadi [2 ]
机构
[1] Ulsan Natl Inst Sci & Technol, Dept Elect Engn, Ulsan 44919, South Korea
[2] Univ Calif Irvine, Ctr Embedded & Cyber Phys Syst, Irvine, CA 92697 USA
[3] King Abdullah Univ Sci & Technol, CEMSE Div, Thuwal 23955, Saudi Arabia
关键词
Accelerator; artificial neural network; batch normalization (BN); ReRAM crossbar array; stuck-at fault (SAF); IR-DROP; EFFICIENT;
D O I
10.1109/TCAD.2022.3222288
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Although Resistive RAMs can support highly efficient matrix-vector multiplication, which is very useful for machine learning and other applications, the nonideal behavior of hardware, such as stuck-at fault (SAF) and IR drop is an important concern in making ReRAM crossbar array-based deep learning accelerators. Previous work has addressed the nonideality problem through either redundancy in hardware, which requires a permanent increase of hardware cost, or software retraining, which may be even more costly or unacceptable due to its need for a training dataset as well as high computation overhead. In this article, we propose a very lightweight method that can be applied on top of existing hardware or software solutions. Our method, called forward-parameter tuning (FPT), takes advantage of a certain statistical property existing in the activation data of neural network layers, and can mitigate the impact of mild nonidealities in ReRAM crossbar arrays (RCAs) for deep learning applications without using any hardware, a dataset, or gradient-based training. Our experimental results using MNIST, CIFAR-10, and CIFAR-100, and ImageNet datasets in binary and multibit networks demonstrate that our technique is very effective, both alone and together with previous methods, up to 20% fault rate, which is higher than even some of the previous remapping methods. We also evaluate our method in the presence of other nonidealities, such as variability and IR drop. Furthermore, we provide an analysis based on the concept of the effective fault rate (EFR), which not only demonstrates that EFR can be a useful tool to predict the accuracy of faulty RCA-based neural networks but also explains why mitigating the SAF problem is more difficult with multibit neural networks.
引用
收藏
页码:2174 / 2186
页数:13
相关论文
共 38 条
[1]  
AZAMAT A, 2021, INT C COMPUTER AIDED, P1
[2]   Mitigating Asymmetric Nonlinear Weight Update Effects in Hardware Neural Network Based on Analog Resistive Synapse [J].
Chang, Chih-Cheng ;
Chen, Pin-Chun ;
Chou, Teyuh ;
Wang, I-Ting ;
Hudec, Boris ;
Chang, Che-Chia ;
Tsai, Chia-Ming ;
Chang, Tian-Sheuan ;
Hou, Tuo-Hung .
IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2018, 8 (01) :116-124
[3]   Accurate Inference With Inaccurate RRAM Devices: A Joint Algorithm-Design Solution [J].
Charan, Gouranga ;
Mohanty, Abinash ;
Du, Xiaocong ;
Krishnan, Gokul ;
Joshi, Rajiv V. ;
Cao, Yu .
IEEE JOURNAL ON EXPLORATORY SOLID-STATE COMPUTATIONAL DEVICES AND CIRCUITS, 2020, 6 (01) :27-35
[4]   RRAM Defect Modeling and Failure Analysis Based on March Test and a Novel Squeeze-Search Scheme [J].
Chen, Ching-Yi ;
Shih, Hsiu-Chuan ;
Wu, Cheng-Wen ;
Lin, Chih-He ;
Chiu, Pi-Feng ;
Sheu, Shyh-Shyuan ;
Chen, Frederick T. .
IEEE TRANSACTIONS ON COMPUTERS, 2015, 64 (01) :180-190
[5]  
Chen LR, 2017, DES AUT TEST EUROPE, P19, DOI 10.23919/DATE.2017.7926952
[6]  
Courbariaux M, 2016, Arxiv, DOI arXiv:1602.02830
[7]   IR-QNN Framework: An IR Drop-Aware Offline Training of Quantized Crossbar Arrays [J].
Fouda, Mohammed E. ;
Lee, Sugil ;
Lee, Jongeun ;
Kim, Gun Hwan ;
Kurdahi, Fadi ;
Eltawi, Ahmed M. .
IEEE ACCESS, 2020, 8 :228392-228408
[8]   Mask Technique for Fast and Efficient Training of Binary Resistive Crossbar Arrays [J].
Fouda, Mohammed E. ;
Lee, Sugil ;
Lee, Jongeun ;
Eltawil, Ahmed ;
Kurdahi, Fadi .
IEEE TRANSACTIONS ON NANOTECHNOLOGY, 2019, 18 :704-716
[9]   Modeling and Analysis of Passive Switching Crossbar Arrays [J].
Fouda, Mohammed E. ;
Eltawil, Ahmed M. ;
Kurdahi, Fadi .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2018, 65 (01) :270-282
[10]   Noise Injection Adaption: End-to-End ReRAM Crossbar Non-ideal Effect Adaption for Neural Network Mapping [J].
He, Zhezhi ;
Lin, Jie ;
Ewetz, Rickard ;
Yuan, Jiann-Shiun ;
Fan, Deliang .
PROCEEDINGS OF THE 2019 56TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2019,