Error Resilient In-Memory Computing Architecture for CNN Inference on the Edge

被引:3
|
作者
Rios, Marco [1 ]
Ponzina, Flavio [1 ]
Ansaloni, Giovanni [1 ]
Levisse, Alexandre [1 ]
Atienza, David [1 ]
机构
[1] Ecole Polytech Fed Lausanne EPFL, Embedded Syst Lab, Lausanne, Switzerland
关键词
In-Memory Computing; Fault Tolerant Architectures; Deep Neural Networks;
D O I
10.1145/3526241.3530351
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The growing popularity of edge computing has fostered the development of diverse solutions to support Artificial Intelligence (AI) in energy-constrained devices. Nonetheless, comparatively few efforts have focused on the resiliency exhibited by AI workloads (such as Convolutional Neural Networks, CNNs) as an avenue towards increasing their run-time efficiency, and even fewer have proposed strategies to increase such resiliency. We herein address this challenge in the context of Bit-line Computing architectures, an embodiment of the in-memory computing paradigm tailored towards CNN applications. We show that little additional hardware is required to add highly effective error detection and mitigation in such platforms. In turn, our proposed scheme can cope with high error rates when performing memory accesses with no impact on CNNs accuracy, allowing for very aggressive voltage scaling. Complementary, we also show that CNN resiliency can be increased by algorithmic optimizations in addition to architectural ones, adopting a combined ensembling and pruning strategy that increases robustness while not inflating workload requirements. Experiments on different quantized CNN models reveal that our combined hardware/software approach enables the supply voltage to be reduced to just 650mV, decreasing the energy per inference up to 51.3%, without affecting the baseline CNN classification accuracy.
引用
收藏
页码:249 / 254
页数:6
相关论文
共 50 条
  • [1] Digital In-Memory Computing to Accelerate Deep Learning Inference on the Edge
    Perri, Stefania
    Zambelli, Cristian
    Ielmini, Daniele
    Silvano, Cristina
    2024 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, IPDPSW 2024, 2024, : 130 - 133
  • [2] Temperature-Resilient RRAM-Based In-Memory Computing for DNN Inference
    Meng, Jian
    Shim, Wonbo
    Yang, Li
    Yeo, Injune
    Fan, Deliang
    Yu, Shimeng
    Seo, Jae-sun
    IEEE MICRO, 2022, 42 (01) : 89 - 98
  • [3] LUTIC: A CRAM-based Architecture for Power Failure Resilient In-Memory Computing
    Akhunov, Khakim
    Yildirim, Kasim Sinan
    2023 26TH INTERNATIONAL SYMPOSIUM ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS AND SYSTEMS, DDECS, 2023, : 69 - 72
  • [4] IMCE: An In-Memory Computing and Encrypting Hardware Architecture for Robust Edge Security
    Shao, Hanyong
    Fu, Boyi
    Yang, Jinghao
    Li, Wenpu
    Su, Chang
    Fu, Zhiyuan
    Tango, Kechao
    Huang, Ru
    2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,
  • [5] High-Accuracy Spintronic Approximate Compressors for Error-Resilient In-Memory Computing
    Eghlimi, Yasin
    Moaiyeri, Mohammad Hossein
    Ahmadinejad, Mohammad
    SPIN, 2022, 12 (01)
  • [6] A 12T SRAM in-Memory Computing differential current architecture for CNN implementations
    Domenech-Asensi, Gines
    Ruiz-Merino, Ramon
    Zapata-Perez, Juan
    Diaz-Madrid, Jose A.
    2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,
  • [7] Neural in-memory checksums: an error detection and correction technique for safe in-memory inference
    Parrini, Luca
    Soliman, Taha
    Hettwer, Benjamin
    de la Parra, Cecilia
    Borrmann, Jan Micha
    Singh, Simranjeet
    Bende, Ankit
    Rana, Vikas
    Merchant, Farhad
    Wehn, Norbert
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2025, 383 (2288):
  • [8] Linear Error Correction Codec Implementation Based on an In-Memory Computing Architecture for Nonvolatile Memories
    Luo, Lichuan
    Liu, Xiao
    Jiang, Linjun
    Zhang, He
    Zhang, Youguang
    Liu, Dijun
    Kang, Wang
    IEEE TRANSACTIONS ON ELECTRON DEVICES, 2022, 69 (06) : 3455 - 3461
  • [9] Hardware-software co-exploration with racetrack memory based in-memory computing for CNN inference in embedded systems
    Choong, Benjamin Chen Ming
    Luo, Tao
    Liu, Cheng
    He, Bingsheng
    Zhang, Wei
    Zhou, Joey Tianyi
    JOURNAL OF SYSTEMS ARCHITECTURE, 2022, 128
  • [10] An In-Memory Analog Computing Co-Processor for Energy-Efficient CNN Inference on Mobile Devices
    Elbtity, Mohammed
    Singh, Abhishek
    Reidy, Brendan
    Guo, Xiaochen
    Zand, Ramtin
    2021 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2021), 2021, : 188 - 193