A Case for Emerging Memories in DNN Accelerators

被引:0
|
作者
Mukherjee, Avilash [1 ]
Saurav, Kumar [2 ]
Nair, Prashant [1 ]
Shekhar, Sudip [1 ]
Lis, Mieszko [1 ]
机构
[1] Univ British Columbia, Vancouver, BC, Canada
[2] QUALCOMM India, Bengaluru, Karnataka, India
关键词
Machine-Learning; Convolutional Neural Networks; Non-Volatile Memories; PCM; RRAM; MRAM; STT-MRAM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The popularity of Deep Neural Networks (DNNs) has led to many DNN accelerator architectures, which typically focus on the on-chip storage and computation costs. However, much of the energy is spent on accesses to off-chip DRAM memory. While emerging resistive memory technologies such as MRAM, PCM, and RRAM can potentially reduce this energy component, they suffer from drawbacks such as low endurance that prevent them from being a DRAM replacement in DNN applications. In this paper, we examine how DNN accelerators can be designed to overcome these limitations and how emerging memories can be used for off-chip storage. We demonstrate that through (a) careful mapping of DNN computation to the accelerator and (b) a hybrid setup (both DRAM and an emerging memory), we can reduce inference energy over a DRAM-only design by a factor ranging from 1.12x on EfficientNetB7 to 6.3x on ResNet-50, while also increasing the endurance from 2 weeks to over a decade. As the energy benefits vary dramatically across DNN models, we also develop a simple analytical heuristic solely based on DNN model parameters that predicts the suitability of a given DNN for emerging-memory-based accelerators.
引用
收藏
页码:938 / 941
页数:4
相关论文
共 50 条
  • [41] Compiler-assisted Operator Template Library for DNN Accelerators
    Li, Jiansong
    Cao, Wei
    Dong, Xiao
    Li, Guangli
    Wang, Xueying
    Zhao, Peng
    Liu, Lei
    Feng, Xiaobing
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2021, 49 (05) : 628 - 645
  • [42] Klotski: DNN Model Orchestration Framework for Dataflow Architecture Accelerators
    Bai, Chen
    Wei, Xuechao
    Zhuo, Youwei
    Cai, Yi
    Zheng, Hongzhong
    Yu, Bei
    Xie, Yuan
    2023 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2023,
  • [43] A Uniform Latency Model for DNN Accelerators with Diverse Architectures and Dataflows
    Mei, Linyan
    Liu, Huichu
    Wu, Tony
    Sumbul, H. Ekin
    Verhelst, Marian
    Beigne, Edith
    PROCEEDINGS OF THE 2022 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2022), 2022, : 220 - 225
  • [44] Analysis and Mitigation of DRAM Faults in Sparse-DNN Accelerators
    Kundu, Shamik
    Raha, Arnab
    Banerjee, Suvadeep
    Natarajan, Suriyaprakash
    Basu, Kanad
    IEEE DESIGN & TEST, 2023, 40 (02) : 90 - 99
  • [45] Precision and Performance-Aware Voltage Scaling in DNN Accelerators
    Rathore, Mallika
    Milder, Peter
    Salman, Emre
    PROCEEDINGS OF THE GREAT LAKES SYMPOSIUM ON VLSI 2023, GLSVLSI 2023, 2023, : 237 - 242
  • [46] A SIMD Dynamic Fixed Point Processing Engine for DNN Accelerators
    Raut, Gopal
    Edavoor, Pranose J.
    Selvakumar, David
    Thakur, Ritambhara
    2024 25TH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN, ISQED 2024, 2024,
  • [47] A Fast Precision Tuning Solution for Always-On DNN Accelerators
    Wang, Ying
    He, Yintao
    Cheng, Long
    Li, Huawei
    Li, Xiaowei
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (05) : 1236 - 1248
  • [48] Architecting Decentralization and Customizability in DNN Accelerators for Hardware Defect Adaptation
    Ozen, Elbruz
    Orailoglu, Alex
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (11) : 3934 - 3945
  • [49] Benchmarking DNN Mapping Methods for the in-Memory Computing Accelerators
    Wang, Yimin
    Fong, Xuanyao
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2023, 13 (04) : 1040 - 1051
  • [50] An Error Compensation Technique for Low-Voltage DNN Accelerators
    Ji, Daehan
    Shin, Dongyeob
    Park, Jongsun
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2021, 29 (02) : 397 - 408