Smart Memory: Deep Learning Acceleration in 3D-Stacked Memories

被引:0
|
作者
Rezaei, Seyyed Hossein SeyyedAghaei [1 ]
Moghaddam, Parham Zilouchian [1 ]
Modarressi, Mehdi [1 ]
机构
[1] Univ Tehran, Sch Elect & Comp Engn, Tehran 25529, Iran
关键词
Network-on-memory; processing-in-memory; 3D-stacked memory; deep learning accelerator;
D O I
10.1109/LCA.2023.3287976
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Processing-in-memory (PIM) is the most promising paradigm to address the bandwidth bottleneck in deep neural network (DNN) accelerators. However, the algorithmic and dataflow structure of DNNs still necessitates moving a large amount of data across banks inside the memory device to bring input data and their corresponding model parameters together, negatively shifting part of the bandwidth bottleneck to the in-memory data communication infrastructure. To alleviate this bottleneck, we present Smart Memory, a highly parallel in-memory DNN accelerator for 3D memories that benefits from a scalable high-bandwidth in-memory network. Whereas the existing PIM designs implement the compute units and network-on-chip on the logic die of the underlying 3D memory, in Smart Memory the computation and data transmission tasks are distributed across the memory banks. To this end, each memory bank is equipped with (1) a very simple processing unit to run neural networks, and (2) a circuit-switched router to interconnect memory banks by a 3D network-on-memory. Our evaluation shows 44% average performance improvement over state-of-the-art in-memory DNN accelerators.
引用
收藏
页码:137 / 141
页数:5
相关论文
共 50 条
  • [31] FAULTSIM: A Fast, Configurable Memory-Reliability Simulator for Conventional and 3D-Stacked Systems
    Nair, Prashant J.
    Roberts, David A.
    Qureshi, Moinuddin K.
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2016, 12 (04)
  • [32] A Many-Core Hardware Acceleration Platform for Short Read Mapping Problem Using Distributed Memory Interface with 3D-stacked Architecture
    Liu, Pei
    Hemani, Ahmed
    Paul, Kolin
    2014 INTERNATIONAL SYMPOSIUM ON SYSTEM-ON-CHIP (SOC), 2014,
  • [33] EXTREME: Exploiting Page Table for Reducing Refresh Power of 3D-Stacked DRAM Memory
    Shin, Ho Hyun
    Park, Young Min
    Choi, Duheon
    Kim, Byoung Jin
    Cho, Dae-Hyung
    Chung, Eui-Young
    IEEE TRANSACTIONS ON COMPUTERS, 2018, 67 (01) : 32 - 44
  • [34] A Power-Efficient Transmitter Design for 3D-Stacked Memories in 28-nm CMOS Technology
    Thinh Nguyen-Viet
    Quoc Cuong Bui
    Loan Pham-Nguyen
    Gyung-Su Byun
    2024 IEEE TENTH INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS, ICCE 2024, 2024, : 153 - 156
  • [35] Modeling TSV Open Defects in 3D-Stacked DRAM
    Jiang, Li
    Liu, Yuxi
    Duan, Lian
    Xie, Yuan
    Xu, Qiang
    INTERNATIONAL TEST CONFERENCE 2010, 2010,
  • [36] Quantifying and Coping with Parametric Variations in 3D-Stacked Microarchitectures
    Ozdemir, Serkan
    Pan, Yan
    Das, Abhishek
    Memik, Gokhan
    Loh, Gabriel
    Choudhary, Alok
    PROCEEDINGS OF THE 47TH DESIGN AUTOMATION CONFERENCE, 2010, : 144 - 149
  • [37] Research on Thermal Analysis Method of 3D-stacked MRAM
    Yong R.-X.
    Jiang Y.-F.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (10): : 2775 - 2782
  • [38] A Customized Many-Core Hardware Acceleration Platform for Short Read Mapping Problems Using Distributed Memory Interface with 3D-Stacked Architecture
    Liu, Pei
    Hemani, Ahmed
    Paul, Kolin
    Weis, Christian
    Jung, Matthias
    Wehn, Norbert
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2017, 87 (03): : 327 - 341
  • [39] An Optimized 3D-Stacked Memory Architecture by Exploiting Excessive, High-Density TSV Bandwidth
    Woo, Dong Hyuk
    Seong, Nak Hee
    Lewis, Dean L.
    Lee, Hsien-Hsin S.
    HPCA-16 2010: SIXTEENTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 2010, : 429 - 440
  • [40] Accelerating Sparse Matrix-Matrix Multiplication with 3D-Stacked Logic-in-Memory Hardware
    Zhu, Qiuling
    Graf, Tobias
    Sumbul, H. Ekin
    Pileggi, Larry
    Franchetti, Franz
    2013 IEEE CONFERENCE ON HIGH PERFORMANCE EXTREME COMPUTING (HPEC), 2013,