Reliability evaluation of FPGA based pruned neural networks

被引:8
作者
Gao, Zhen [1 ]
Yao, Yi [1 ]
Wei, Xiaohui [1 ]
Yan, Tong [1 ]
Zeng, Shulin [2 ]
Ge, Guangjun [2 ]
Wang, Yu [2 ]
Ullah, Anees [3 ]
Reviriego, Pedro [4 ]
机构
[1] Tianjin Univ, Tianjin 300072, Peoples R China
[2] Tsinghua Univ, Sch Elect Engn, Beijing 100084, Peoples R China
[3] Univ Engn & Technol, Peshawar 220101, Abbottabad, Pakistan
[4] Univ Carlos III Madrid, Leganes 28911, Spain
基金
中国国家自然科学基金;
关键词
Convolutional Neural Networks (CNNs); Pruning; Reliability; FPGAs; Fault injection; RADIATION;
D O I
10.1016/j.microrel.2022.114498
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Convolutional Neural Networks (CNNs) are widely used for image classification. To fit the implementation of CNNs on resource-limited systems like FPGAs, pruning is a popular technique to reduce the complexity. In this paper, the robustness of the pruned CNNs against errors on weights and configuration memory of the FPGA accelerator is evaluated with VGG16 as a case study, and two popular pruning methods (magnitude-based and filter pruning) are considered. In particular, the accuracy loss of the original VGG16 and the ones with different pruning rates is tested based on fault injection experiments, and the results show that the effect of errors on weights and configuration memories are different for the two pruning methods. For errors on weights, the networks pruned using both methods demonstrate higher reliability with higher pruning rates, but the ones using filter pruning are relatively less reliable. For errors on configuration memory, errors on about 30% of the configuration bits will affect the CNN operation, and only 14% of them will introduce significant accuracy loss. However, the effect of the same critical bits is different for the two pruning methods. The pruned networks using magnitude-based method are less reliable than the original VGG16, but the ones using filter pruning are more reliable than the original VGG16. The different effects are explained based on the structure of the CNN accelerator and the properties of the two pruning methods. The impact of quantization on the CNN reliability is also evaluated for the magnitude-based pruning method.
引用
收藏
页数:11
相关论文
共 38 条
[1]  
Arechiga AP, 2018, IEEE HIGH PERF EXTR
[2]  
Arechiga AP, 2018, 2018 IEEE 8TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), P190, DOI 10.1109/CCWC.2018.8301749
[3]   SATELLITE ANOMALIES FROM GALACTIC COSMIC-RAYS [J].
BINDER, D ;
SMITH, EC ;
HOLMAN, AB .
IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 1975, 22 (06) :2675-2680
[4]  
Bosio A, 2019, 2019 20TH IEEE LATIN AMERICAN TEST SYMPOSIUM (LATS), DOI 10.1109/latw.2019.8704548
[5]   Current and Future Challenges in Radiation Effects on CMOS Electronics [J].
Dodd, P. E. ;
Shaneyfelt, M. R. ;
Schwank, J. R. ;
Felix, J. A. .
IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 2010, 57 (04) :1747-1763
[6]  
Fabio B., 2018, 2018 IEEE 19 LAT AM, P1
[7]  
Gao Z., 2020, 33 IEEE INT S DEF FA
[8]  
github.com, ALUMLUTHER
[9]  
Goldstein B. F., 2020 IEEE 11 LAT AM
[10]  
Guo K., 2018, IEEE T COMPUT AIDED, V37