A Case for Emerging Memories in DNN Accelerators

被引:0
|
作者
Mukherjee, Avilash [1 ]
Saurav, Kumar [2 ]
Nair, Prashant [1 ]
Shekhar, Sudip [1 ]
Lis, Mieszko [1 ]
机构
[1] Univ British Columbia, Vancouver, BC, Canada
[2] QUALCOMM India, Bengaluru, Karnataka, India
关键词
Machine-Learning; Convolutional Neural Networks; Non-Volatile Memories; PCM; RRAM; MRAM; STT-MRAM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The popularity of Deep Neural Networks (DNNs) has led to many DNN accelerator architectures, which typically focus on the on-chip storage and computation costs. However, much of the energy is spent on accesses to off-chip DRAM memory. While emerging resistive memory technologies such as MRAM, PCM, and RRAM can potentially reduce this energy component, they suffer from drawbacks such as low endurance that prevent them from being a DRAM replacement in DNN applications. In this paper, we examine how DNN accelerators can be designed to overcome these limitations and how emerging memories can be used for off-chip storage. We demonstrate that through (a) careful mapping of DNN computation to the accelerator and (b) a hybrid setup (both DRAM and an emerging memory), we can reduce inference energy over a DRAM-only design by a factor ranging from 1.12x on EfficientNetB7 to 6.3x on ResNet-50, while also increasing the endurance from 2 weeks to over a decade. As the energy benefits vary dramatically across DNN models, we also develop a simple analytical heuristic solely based on DNN model parameters that predicts the suitability of a given DNN for emerging-memory-based accelerators.
引用
收藏
页码:938 / 941
页数:4
相关论文
共 50 条
  • [21] Shaped Pruning for Efficient Memory Addressing in DNN Accelerators
    Woo, Yunhee
    Kim, Dongyoung
    Jeong, Jaemin
    Lee, Jeong-Gun
    2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-ASIA (ICCE-ASIA), 2021,
  • [22] A DNN Protection Solution for PIM accelerators with Model Compression
    Zhao, Lei
    Zhang, Youtao
    Yang, Jun
    2022 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2022), 2022, : 320 - 325
  • [23] Special Session: Approximation and Fault Resiliency of DNN Accelerators
    Ahmadilivani, Mohammad Hasan
    Barbareschi, Mario
    Barone, Salvatore
    Bosio, Alberto
    Daneshtalab, Masoud
    Della Torca, Salvatore
    Gavarini, Gabriele
    Jenihhin, Maksim
    Raik, Jaan
    Ruospo, Annachiara
    Sanchez, Ernesto
    Taheri, Mahdi
    2023 IEEE 41ST VLSI TEST SYMPOSIUM, VTS, 2023,
  • [24] Lightning Talk: Efficiency and Programmability of DNN Accelerators and GPUs
    Ro, Won Woo
    2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,
  • [25] Fault Resilience Techniques for Flash Memory of DNN Accelerators
    Lu, Shyue-Kung
    Wu, Yu-Sheng
    Hong, Jin-Hua
    Miyase, Kohei
    2022 IEEE INTERNATIONAL TEST CONFERENCE IN ASIA (ITC-ASIA 2022), 2022, : 1 - 6
  • [26] NeuroSpector: Systematic Optimization of Dataflow Scheduling in DNN Accelerators
    Park, Chanho
    Kim, Bogil
    Ryu, Sungmin
    Song, William J.
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 34 (08) : 2279 - 2294
  • [27] A survey on modeling and improving reliability of DNN algorithms and accelerators
    Mittal, Sparsh
    JOURNAL OF SYSTEMS ARCHITECTURE, 2020, 104
  • [28] AdaPT: Fast Emulation of Approximate DNN Accelerators in PyTorch
    Danopoulos, Dimitrios
    Zervakis, Georgios
    Siozios, Kostas
    Soudris, Dimitrios
    Henkel, Joerg
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2023, 42 (06) : 2074 - 2078
  • [29] Heterogeneous Dataflow Accelerators for Multi-DNN Workloads
    Kwon, Hyoukjun
    Lai, Liangzhen
    Pellauer, Michael
    Krishna, Tushar
    Chen, Yu-Hsin
    Chandra, Vikas
    2021 27TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2021), 2021, : 71 - 83
  • [30] Fault Resilience Techniques for Flash Memory of DNN Accelerators
    Lu, Shyue-Kung
    Wu, Yu-Sheng
    Hong, Jin-Hua
    Miyase, Kohei
    2022 IEEE INTERNATIONAL TEST CONFERENCE (ITC), 2022, : 591 - 600