A Case for Emerging Memories in DNN Accelerators

被引：0

作者：

Mukherjee, Avilash ^{[1
]}

Saurav, Kumar ^{[2
]}

Nair, Prashant ^{[1
]}

Shekhar, Sudip ^{[1
]}

Lis, Mieszko ^{[1
]}

机构：

[1] Univ British Columbia, Vancouver, BC, Canada

[2] QUALCOMM India, Bengaluru, Karnataka, India

来源：

PROCEEDINGS OF THE 2021 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2021) | 2021年

关键词：

Machine-Learning; Convolutional Neural Networks; Non-Volatile Memories; PCM; RRAM; MRAM; STT-MRAM;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The popularity of Deep Neural Networks (DNNs) has led to many DNN accelerator architectures, which typically focus on the on-chip storage and computation costs. However, much of the energy is spent on accesses to off-chip DRAM memory. While emerging resistive memory technologies such as MRAM, PCM, and RRAM can potentially reduce this energy component, they suffer from drawbacks such as low endurance that prevent them from being a DRAM replacement in DNN applications. In this paper, we examine how DNN accelerators can be designed to overcome these limitations and how emerging memories can be used for off-chip storage. We demonstrate that through (a) careful mapping of DNN computation to the accelerator and (b) a hybrid setup (both DRAM and an emerging memory), we can reduce inference energy over a DRAM-only design by a factor ranging from 1.12x on EfficientNetB7 to 6.3x on ResNet-50, while also increasing the endurance from 2 weeks to over a decade. As the energy benefits vary dramatically across DNN models, we also develop a simple analytical heuristic solely based on DNN model parameters that predicts the suitability of a given DNN for emerging-memory-based accelerators.

引用

页码：938 / 941

页数：4

共 50 条

[41] Compiler-assisted Operator Template Library for DNN Accelerators
Li, Jiansong
Cao, Wei
Dong, Xiao
Li, Guangli
Wang, Xueying
Zhao, Peng
Liu, Lei
Feng, Xiaobing
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2021, 49 (05) : 628 - 645
[42] Klotski: DNN Model Orchestration Framework for Dataflow Architecture Accelerators
Bai, Chen
Wei, Xuechao
Zhuo, Youwei
Cai, Yi
Zheng, Hongzhong
Yu, Bei
Xie, Yuan
2023 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2023,
[43] A Uniform Latency Model for DNN Accelerators with Diverse Architectures and Dataflows
Mei, Linyan
Liu, Huichu
Wu, Tony
Sumbul, H. Ekin
Verhelst, Marian
Beigne, Edith
PROCEEDINGS OF THE 2022 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2022), 2022, : 220 - 225
[44] Analysis and Mitigation of DRAM Faults in Sparse-DNN Accelerators
Kundu, Shamik
Raha, Arnab
Banerjee, Suvadeep
Natarajan, Suriyaprakash
Basu, Kanad
IEEE DESIGN & TEST, 2023, 40 (02) : 90 - 99
[45] Precision and Performance-Aware Voltage Scaling in DNN Accelerators
Rathore, Mallika
Milder, Peter
Salman, Emre
PROCEEDINGS OF THE GREAT LAKES SYMPOSIUM ON VLSI 2023, GLSVLSI 2023, 2023, : 237 - 242
[46] A SIMD Dynamic Fixed Point Processing Engine for DNN Accelerators
Raut, Gopal
Edavoor, Pranose J.
Selvakumar, David
Thakur, Ritambhara
2024 25TH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN, ISQED 2024, 2024,
[47] A Fast Precision Tuning Solution for Always-On DNN Accelerators
Wang, Ying
He, Yintao
Cheng, Long
Li, Huawei
Li, Xiaowei
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (05) : 1236 - 1248
[48] Architecting Decentralization and Customizability in DNN Accelerators for Hardware Defect Adaptation
Ozen, Elbruz
Orailoglu, Alex
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (11) : 3934 - 3945
[49] Benchmarking DNN Mapping Methods for the in-Memory Computing Accelerators
Wang, Yimin
Fong, Xuanyao
IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2023, 13 (04) : 1040 - 1051
[50] An Error Compensation Technique for Low-Voltage DNN Accelerators
Ji, Daehan
Shin, Dongyeob
Park, Jongsun
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2021, 29 (02) : 397 - 408

← 1 2 3 4 5 →