Smart Memory: Deep Learning Acceleration in 3D-Stacked Memories

被引:0
|
作者
Rezaei, Seyyed Hossein SeyyedAghaei [1 ]
Moghaddam, Parham Zilouchian [1 ]
Modarressi, Mehdi [1 ]
机构
[1] Univ Tehran, Sch Elect & Comp Engn, Tehran 25529, Iran
关键词
Network-on-memory; processing-in-memory; 3D-stacked memory; deep learning accelerator;
D O I
10.1109/LCA.2023.3287976
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Processing-in-memory (PIM) is the most promising paradigm to address the bandwidth bottleneck in deep neural network (DNN) accelerators. However, the algorithmic and dataflow structure of DNNs still necessitates moving a large amount of data across banks inside the memory device to bring input data and their corresponding model parameters together, negatively shifting part of the bandwidth bottleneck to the in-memory data communication infrastructure. To alleviate this bottleneck, we present Smart Memory, a highly parallel in-memory DNN accelerator for 3D memories that benefits from a scalable high-bandwidth in-memory network. Whereas the existing PIM designs implement the compute units and network-on-chip on the logic die of the underlying 3D memory, in Smart Memory the computation and data transmission tasks are distributed across the memory banks. To this end, each memory bank is equipped with (1) a very simple processing unit to run neural networks, and (2) a circuit-switched router to interconnect memory banks by a 3D network-on-memory. Our evaluation shows 44% average performance improvement over state-of-the-art in-memory DNN accelerators.
引用
收藏
页码:137 / 141
页数:5
相关论文
共 50 条
  • [41] Thermal Management for 3D-Stacked Systems via Unified Core-Memory Power Regulation
    Shen, Yixian
    Schreuders, Leo
    Pathania, Anuj
    Pimentel, Andy D.
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2023, 22 (05)
  • [42] A 3D-Stacked Logic-in-Memory Accelerator for Application-Specific Data Intensive Computing
    Zhu, Qiuling
    Akin, Berkin
    Sumbul, H. Ekin
    Sadi, Fazle
    Hoe, James C.
    Pileggi, Larry
    Franchetti, Franz
    2013 IEEE INTERNATIONAL 3D SYSTEMS INTEGRATION CONFERENCE (3DIC), 2013,
  • [43] Test and Debug Solutions for 3D-Stacked Integrated Circuits
    Deutsch, Sergej
    Chakraharty, Krishnendu
    2015 IEEE INTERNATIONAL TEST CONFERENCE (ITC), 2015,
  • [44] The Failure Analysis Of 3D-Stacked Die Packaging Flash
    Fang, Jianming
    Guan, Xinbin
    Li, Enliang
    2024 25TH INTERNATIONAL CONFERENCE ON ELECTRONIC PACKAGING TECHNOLOGY, ICEPT, 2024,
  • [45] IMEC demonstrates feasibility of 3D-stacked IC integration
    不详
    ELECTRONICS WORLD, 2007, 113 (1855): : 4 - 4
  • [46] NDC: Analyzing the Impact of 3D-Stacked Memory plus Logic Devices on Map Reduce Workloads
    Pugsley, Seth H.
    Jestes, Jeffrey
    Zhang, Huihui
    Balasubramonian, Rajeev
    Srinivasan, Vijayalakshmi
    Buyuktosunoglu, Alper
    Davis, Al
    Li, Feifei
    2014 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS), 2014, : 190 - 200
  • [47] Algorithm/Hardware Co-optimized SAR Image Reconstruction with 3D-stacked Logic in Memory
    Sadi, Fazle
    Akin, Berkin
    Popovici, Doru T.
    Hoe, James C.
    Pileggi, Larry
    Franchetti, Franz
    2014 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2014,
  • [48] Simultaneous Multi-Layer Access: Improving 3D-Stacked Memory Bandwidth at Low Cost
    Lee, Donghyuk
    Ghose, Saugata
    Pekhimenko, Gennady
    Khan, Samira
    Mutlu, Onur
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2016, 12 (04)
  • [49] ApproxPIM: Exploiting Realistic 3D-stacked DRAM for Energy-Efficient Processing In-memory
    Tang, Yibin
    Wang, Ying
    Li, Huawei
    Li, Xiaowei
    2017 22ND ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2017, : 396 - 401
  • [50] Memory-Side Prefetching Scheme Incorporating Dynamic Page Mode in 3D-Stacked DRAM
    Rafique, Muhammad M.
    Zhu, Zhichun
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (11) : 2734 - 2747