A Case for Emerging Memories in DNN Accelerators

被引:0
|
作者
Mukherjee, Avilash [1 ]
Saurav, Kumar [2 ]
Nair, Prashant [1 ]
Shekhar, Sudip [1 ]
Lis, Mieszko [1 ]
机构
[1] Univ British Columbia, Vancouver, BC, Canada
[2] QUALCOMM India, Bengaluru, Karnataka, India
关键词
Machine-Learning; Convolutional Neural Networks; Non-Volatile Memories; PCM; RRAM; MRAM; STT-MRAM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The popularity of Deep Neural Networks (DNNs) has led to many DNN accelerator architectures, which typically focus on the on-chip storage and computation costs. However, much of the energy is spent on accesses to off-chip DRAM memory. While emerging resistive memory technologies such as MRAM, PCM, and RRAM can potentially reduce this energy component, they suffer from drawbacks such as low endurance that prevent them from being a DRAM replacement in DNN applications. In this paper, we examine how DNN accelerators can be designed to overcome these limitations and how emerging memories can be used for off-chip storage. We demonstrate that through (a) careful mapping of DNN computation to the accelerator and (b) a hybrid setup (both DRAM and an emerging memory), we can reduce inference energy over a DRAM-only design by a factor ranging from 1.12x on EfficientNetB7 to 6.3x on ResNet-50, while also increasing the endurance from 2 weeks to over a decade. As the energy benefits vary dramatically across DNN models, we also develop a simple analytical heuristic solely based on DNN model parameters that predicts the suitability of a given DNN for emerging-memory-based accelerators.
引用
收藏
页码:938 / 941
页数:4
相关论文
共 50 条
  • [1] A Model-to-Circuit Compiler for Evaluation of DNN Accelerators based on Systolic Arrays and Multibit Emerging Memories
    Knoedtel, Johannes
    Fritscher, Markus
    Reiser, Daniel
    Fey, Dietmar
    Breiling, Marco
    Reichenbach, Marc
    2020 9TH INTERNATIONAL CONFERENCE ON MODERN CIRCUITS AND SYSTEMS TECHNOLOGIES (MOCAST), 2020,
  • [2] GoldenEye: A Platform for Evaluating Emerging Numerical Data Formats in DNN Accelerators
    Mahmoud, Abdulrahman
    Tambe, Thierry
    Aloui, Tarek
    Brooks, David
    Wei, Gu-Yeon
    2022 52ND ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN 2022), 2022, : 206 - 214
  • [3] Energy Profiling of DNN Accelerators
    Wess, Matthias
    Dallinger, Dominik
    Schnoell, Daniel
    Bittner, Matthias
    Goetzinger, Maximilian
    Jantsch, Axel
    2023 26TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN, DSD 2023, 2023, : 53 - 60
  • [4] CMDS: Cross-layer Dataflow Optimization for DNN Accelerators Exploiting Multi-bank Memories
    Shi, Man
    Colleman, Steven
    VanDeMieroop, Charlotte
    Joseph, Antony
    Meijer, Maurice
    Dehaene, Wim
    Verhelst, Marian
    2023 24TH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN, ISQED, 2023, : 172 - 179
  • [5] Reliable Brain-inspired AI Accelerators using Classical and Emerging Memories
    Yayla, Mikail
    Thomann, Simon
    Islam, Md Mazharul
    Wei, Ming-Liang
    Ho, Shu-Yin
    Aziz, Ahmedullah
    Yang, Chia-Lin
    Chen, Jian-Jia
    Amrouch, Hussam
    2023 IEEE 41ST VLSI TEST SYMPOSIUM, VTS, 2023,
  • [6] Analog Weights in ReRAM DNN Accelerators
    Eshraghian, Jason K.
    Kang, Sung-Mo
    Baek, Seungbum
    Orchard, Garrick
    Iu, Herbert Ho-Ching
    Lei, Wen
    2019 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS 2019), 2019, : 267 - 271
  • [7] Rapid Emulation of Approximate DNN Accelerators
    Farahbakhsh, Amirreza
    Hosseini, Seyedmehdi
    Kachuee, Sajjad
    Sharilkhani, Mohammad
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [8] Control Variate Approximation for DNN Accelerators
    Zervakis, Georgios
    Spantidi, Ourania
    Anagnostopoulos, Iraklis
    Amrouch, Hussam
    Henkel, Joerg
    2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 481 - 486
  • [9] Soft errors in DNN accelerators: A comprehensive review
    Ibrahim, Younis
    Wang, Haibin
    Liu, Junyang
    Wei, Jinghe
    Chen, Li
    Rech, Paolo
    Adam, Khalid
    Guo, Gang
    MICROELECTRONICS RELIABILITY, 2020, 115 (115)
  • [10] Targeting DNN Inference Via Efficient Utilization of Heterogeneous Precision DNN Accelerators
    Spantidi, Ourania
    Zervakis, Georgios
    Alsalamin, Sami
    Roman-Ballesteros, Isai
    Henkel, Joerg
    Amrouch, Hussam
    Anagnostopoulos, Iraklis
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2023, 11 (01) : 112 - 125