Reconfigurable Dataflow Graphs For Processing-In-Memory

被引:0
作者
Shelor, Charles F. [1 ]
Kavi, Krishna M. [1 ]
机构
[1] Univ North Texas, Comp Sci & Engn, Denton, TX 76203 USA
来源
ICDCN '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING AND NETWORKING | 2019年
关键词
Dataflow Architectures; Coarse Grained Reconfigurable Logic; Processing in Memory; 3D-Stacked Memories;
D O I
10.1145/3288599.3288605
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In order to meet the ever-increasing speed differences between processor clocks and memory access times, there has been an interest in moving computation closer to memory. The near data processing or processing-in-memory is particularly suited for very high bandwidth memories such as the 3D-DRAMs. There are different ideas proposed for PIMs, including simple in-order processors, GPUs, specialized ASICs and reconfigurable designs. In our case, we use Coarse-Grained Reconfigurable Logic to build dataflow graphs for computational kernels as the PIM. We show that our approach can achieve significant speedups and save energy consumed by computations. We evaluated our designs using several processing technologies for building the coarse-gained logic units. The DFPIM concept showed good performance improvement and excellent energy efficiency for the streaming benchmarks that were analyzed. The DFPIM in a 28 nm process with an implementation in each of 16 vaults of a 3D-DRAM logic layer showed an average speed-up of 7.2 over that using 32 cores of an Intel Xeon server system. The server processor required 368 times more energy to execute the benchmarks than the DFPIM implementation.
引用
收藏
页码:110 / 119
页数:10
相关论文
共 50 条
  • [21] A Compiler for Automatic Selection of Suitable Processing-in-Memory Instructions
    Ahmed, Hameeza
    Santos, Paulo C.
    Lima, Joao P. C.
    Moura, Rafael F.
    Alves, Marco A. Z.
    Beck, Antonio C. S.
    Carro, Luigi
    2019 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2019, : 564 - 569
  • [22] Heterogeneous Memory Architecture Accommodating Processing-In-Memory on SoC For AIoT Applications
    Qiu, Kangyi
    Zhang, Yaojun
    Yan, Bonan
    Huang, Ru
    27TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2022, 2022, : 383 - 388
  • [23] Accelerating Neural Network Training with Processing-in-Memory GPU
    Fei, Xiang
    Han, Jianhui
    Huang, Jianqiang
    Zheng, Weimin
    Zhang, Youhui
    2022 22ND IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2022), 2022, : 414 - 421
  • [24] Processing-in-Memory Using Optically-Addressed Phase Change Memory
    Yang, Guowei
    Demirkiran, Cansu
    Kizilates, Zeynep Ece
    Ocampo, Carlos A. Rios
    Coskun, Ayse K.
    Joshi, Ajay
    2023 IEEE/ACM INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, ISLPED, 2023,
  • [25] A programmable shared-memory system for an array of processing-in-memory devices
    Sangkuen Lee
    Hyogi Sim
    Youngjae Kim
    Sudharshan S. Vazhkudai
    Cluster Computing, 2019, 22 : 385 - 398
  • [26] Towards Memory-Efficient Allocation of CNNs on Processing-in-Memory Architecture
    Wang, Yi
    Chen, Weixuan
    Yang, Jing
    Li, Tao
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 29 (06) : 1428 - 1441
  • [27] NeuroPIM: Felxible Neural Accelerator for Processing-in-Memory Architectures
    Bidgoli, Ali Monavari
    Fattahi, Sepideh
    Rezaei, Seyyed Hossein Seyyedaghaei
    Modarressi, Mehdi
    Daneshtalab, Masoud
    2023 26TH INTERNATIONAL SYMPOSIUM ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS AND SYSTEMS, DDECS, 2023, : 51 - 56
  • [28] Machine Learning Training on a Real Processing-in-Memory System
    Gomez-Luna, Juan
    Guo, Yuxin
    Brocard, Sylvan
    Legriel, Julien
    Cimadomo, Remy
    Oliveira, Geraldo F.
    Singh, Gagandeep
    Mutlu, Onur
    2022 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2022), 2022, : 292 - 295
  • [29] Thermal-aware processing-in-memory instruction offloading
    Nai, Lifeng
    Hadidi, Ramyad
    Xiao, He
    Kim, Hyojong
    Sim, Jaewoong
    Kim, Hyesoon
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2019, 130 : 193 - 207
  • [30] PIMS: A Lightweight Processing-in-Memory Accelerator for Stencil Computations
    Li, Jie
    Wang, Xi
    Tumeo, Antonino
    Williams, Brody
    Leidel, John D.
    Chen, Yong
    MEMSYS 2019: PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON MEMORY SYSTEMS, 2019, : 41 - 52