Error Resilient In-Memory Computing Architecture for CNN Inference on the Edge

被引：3

作者：

Rios, Marco ^{[1
]}

Ponzina, Flavio ^{[1
]}

Ansaloni, Giovanni ^{[1
]}

Levisse, Alexandre ^{[1
]}

Atienza, David ^{[1
]}

机构：

[1] Ecole Polytech Fed Lausanne EPFL, Embedded Syst Lab, Lausanne, Switzerland

来源：

PROCEEDINGS OF THE 32ND GREAT LAKES SYMPOSIUM ON VLSI 2022, GLSVLSI 2022 | 2022年

关键词：

In-Memory Computing; Fault Tolerant Architectures; Deep Neural Networks;

D O I：

10.1145/3526241.3530351

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The growing popularity of edge computing has fostered the development of diverse solutions to support Artificial Intelligence (AI) in energy-constrained devices. Nonetheless, comparatively few efforts have focused on the resiliency exhibited by AI workloads (such as Convolutional Neural Networks, CNNs) as an avenue towards increasing their run-time efficiency, and even fewer have proposed strategies to increase such resiliency. We herein address this challenge in the context of Bit-line Computing architectures, an embodiment of the in-memory computing paradigm tailored towards CNN applications. We show that little additional hardware is required to add highly effective error detection and mitigation in such platforms. In turn, our proposed scheme can cope with high error rates when performing memory accesses with no impact on CNNs accuracy, allowing for very aggressive voltage scaling. Complementary, we also show that CNN resiliency can be increased by algorithmic optimizations in addition to architectural ones, adopting a combined ensembling and pruning strategy that increases robustness while not inflating workload requirements. Experiments on different quantized CNN models reveal that our combined hardware/software approach enables the supply voltage to be reduced to just 650mV, decreasing the energy per inference up to 51.3%, without affecting the baseline CNN classification accuracy.

引用

页码：249 / 254

页数：6

共 50 条

[1] Digital In-Memory Computing to Accelerate Deep Learning Inference on the Edge
Perri, Stefania
Zambelli, Cristian
Ielmini, Daniele
Silvano, Cristina
2024 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, IPDPSW 2024, 2024, : 130 - 133
[2] Temperature-Resilient RRAM-Based In-Memory Computing for DNN Inference
Meng, Jian
Shim, Wonbo
Yang, Li
Yeo, Injune
Fan, Deliang
Yu, Shimeng
Seo, Jae-sun
IEEE MICRO, 2022, 42 (01) : 89 - 98
[3] LUTIC: A CRAM-based Architecture for Power Failure Resilient In-Memory Computing
Akhunov, Khakim
Yildirim, Kasim Sinan
2023 26TH INTERNATIONAL SYMPOSIUM ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS AND SYSTEMS, DDECS, 2023, : 69 - 72
[4] IMCE: An In-Memory Computing and Encrypting Hardware Architecture for Robust Edge Security
Shao, Hanyong
Fu, Boyi
Yang, Jinghao
Li, Wenpu
Su, Chang
Fu, Zhiyuan
Tango, Kechao
Huang, Ru
2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,
[5] High-Accuracy Spintronic Approximate Compressors for Error-Resilient In-Memory Computing
Eghlimi, Yasin
Moaiyeri, Mohammad Hossein
Ahmadinejad, Mohammad
SPIN, 2022, 12 (01)
[6] A 12T SRAM in-Memory Computing differential current architecture for CNN implementations
Domenech-Asensi, Gines
Ruiz-Merino, Ramon
Zapata-Perez, Juan
Diaz-Madrid, Jose A.
2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,
[7] Neural in-memory checksums: an error detection and correction technique for safe in-memory inference
Parrini, Luca
Soliman, Taha
Hettwer, Benjamin
de la Parra, Cecilia
Borrmann, Jan Micha
Singh, Simranjeet
Bende, Ankit
Rana, Vikas
Merchant, Farhad
Wehn, Norbert
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2025, 383 (2288):
[8] Linear Error Correction Codec Implementation Based on an In-Memory Computing Architecture for Nonvolatile Memories
Luo, Lichuan
Liu, Xiao
Jiang, Linjun
Zhang, He
Zhang, Youguang
Liu, Dijun
Kang, Wang
IEEE TRANSACTIONS ON ELECTRON DEVICES, 2022, 69 (06) : 3455 - 3461
[9] Hardware-software co-exploration with racetrack memory based in-memory computing for CNN inference in embedded systems
Choong, Benjamin Chen Ming
Luo, Tao
Liu, Cheng
He, Bingsheng
Zhang, Wei
Zhou, Joey Tianyi
JOURNAL OF SYSTEMS ARCHITECTURE, 2022, 128
[10] An In-Memory Analog Computing Co-Processor for Energy-Efficient CNN Inference on Mobile Devices
Elbtity, Mohammed
Singh, Abhishek
Reidy, Brendan
Guo, Xiaochen
Zand, Ramtin
2021 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2021), 2021, : 188 - 193

← 1 2 3 4 5 →