Reconfigurable Dataflow Graphs For Processing-In-Memory

被引:0
作者
Shelor, Charles F. [1 ]
Kavi, Krishna M. [1 ]
机构
[1] Univ North Texas, Comp Sci & Engn, Denton, TX 76203 USA
来源
ICDCN '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING AND NETWORKING | 2019年
关键词
Dataflow Architectures; Coarse Grained Reconfigurable Logic; Processing in Memory; 3D-Stacked Memories;
D O I
10.1145/3288599.3288605
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In order to meet the ever-increasing speed differences between processor clocks and memory access times, there has been an interest in moving computation closer to memory. The near data processing or processing-in-memory is particularly suited for very high bandwidth memories such as the 3D-DRAMs. There are different ideas proposed for PIMs, including simple in-order processors, GPUs, specialized ASICs and reconfigurable designs. In our case, we use Coarse-Grained Reconfigurable Logic to build dataflow graphs for computational kernels as the PIM. We show that our approach can achieve significant speedups and save energy consumed by computations. We evaluated our designs using several processing technologies for building the coarse-gained logic units. The DFPIM concept showed good performance improvement and excellent energy efficiency for the streaming benchmarks that were analyzed. The DFPIM in a 28 nm process with an implementation in each of 16 vaults of a 3D-DRAM logic layer showed an average speed-up of 7.2 over that using 32 cores of an Intel Xeon server system. The server processor required 368 times more energy to execute the benchmarks than the DFPIM implementation.
引用
收藏
页码:110 / 119
页数:10
相关论文
共 50 条
[41]   PIM-Quantifier: A Processing-in-Memory Platform for mRNA Quantification [J].
Zhang, Fan ;
Angizi, Shaahin ;
Fahmi, Naima Ahmed ;
Zhang, Wei ;
Fan, Deliang .
2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, :43-48
[42]   Effective Methods of Neural Network Pruning in Processing-In-Memory Architecture [J].
Park, Jonghee ;
Jang, Sung-Joon ;
Shin, Hojong ;
Kwak, Jonghoon .
2024 24TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS, ICCAS 2024, 2024, :7-8
[43]   Towards Memory-Efficient Processing-in-Memory Architecture for Convolutional Neural Networks [J].
Wang, Yi ;
Zhang, Mingxu ;
Yang, Jing .
ACM SIGPLAN NOTICES, 2017, 52 (05) :81-90
[44]   NPC: A Non-Conflicting Processing-in-Memory Controller in DDR Memory Systems [J].
Lee, Seungyong ;
Lee, Sanghyun ;
Seo, Minseok ;
Park, Chunmyung ;
Shin, Woojae ;
Lee, Hyuk-Jae ;
Kim, Hyun .
IEEE TRANSACTIONS ON COMPUTERS, 2025, 74 (03) :1025-1039
[45]   Wave-PIM: AcceleratingWave Simulation Using Processing-in-Memory [J].
Hanindhito, Bagus ;
Li, Ruihao ;
Gourounas, Dimitrios ;
Fathi, Arash ;
Govil, Karan ;
Trenev, Dimitar ;
Gerstlauer, Andreas ;
John, Lizy K. .
50TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2021,
[46]   Performance Characterization of Gemini APU Processing-in-Memory Devices for Space [J].
Wildenstein, Diego ;
Sampson, Neil ;
George, Alan .
2024 IEEE SPACE COMPUTING CONFERENCE, SCC 2024, 2024, :75-81
[47]   Aggressive Performance Improvement on Processing-in-Memory Devices by Adopting Hugepages [J].
Santos, Paulo Cesar ;
Forlin, Bruno E. ;
Alves, Marco A. Z. ;
Carro, Luigi .
2022 IEEE 33RD INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP), 2022, :60-63
[48]   CoPIM: A Collaborative Scheduling Framework for Commodity Processing-in-memory Systems [J].
Shi, Shunchen ;
Li, Xueqi ;
Pan, Zhaowu ;
Zhang, Peiheng ;
Sun, Ninghui .
2024 IEEE 42ND INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, ICCD, 2024, :44-51
[49]   Accelerating CNN Training With Concurrent Execution of GPU and Processing-in-Memory [J].
Choi, Jungwoo ;
Lee, Hyuk-Jae ;
Sohn, Kyomin ;
Yu, Hak-Soo ;
Rhee, Chae Eun .
IEEE ACCESS, 2024, 12 :160190-160204
[50]   SpaceA: Sparse Matrix Vector Multiplication on Processing-in-Memory Accelerator [J].
Xie, Xinfeng ;
Liang, Zheng ;
Gu, Peng ;
Basak, Abanti ;
Deng, Lei ;
Liang, Ling ;
Hu, Xing ;
Xie, Yuan .
2021 27TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2021), 2021, :570-583