ILP-based Multi-Branch CNNs Mapping on Processing-in-Memory Architecture

被引：0

作者：

Han, Haodong ^{[1
]}

Wang, Junpeng ^{[1
]}

Ding, Bo ^{[1
]}

Chen, Song ^{[1
]}

机构：

[1] Univ Sci & Technol China, Hefei, Anhui, Peoples R China

来源：

2024 IEEE 6TH INTERNATIONAL CONFERENCE ON AI CIRCUITS AND SYSTEMS, AICAS 2024 | 2024年

关键词：

convolutional neural network; mapping; 3D-stacked DRAM; processing in memory;

D O I：

10.1109/AICAS59952.2024.10595921

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

3D-stacked-DRAM-based processing-in-memory (DRAM-PIM) architectures demonstrate benefits in memory access bandwidth and energy efficiency and effectively mitigate the storage wall challenge posed by CNNs. However, DRAM-PIM architectures have a huge mapping space for multi-branch CNNs and inadequate mapping increases the latency of CNNs and the memory requirement of nodes. In this work, we propose an integer linear programming (ILP) method that integrates layer scheduling and resource quantity allocation to minimize overall latency. An ILP-based binding method is introduced to bind layers onto a node array of DRAM-PIM architectures with the maximum memory requirement of nodes reduced. Experimental results demonstrate that our method reduces the latency of branching structures in CNNs and achieves better memory balancing between nodes compared to the baseline method.

引用

页码：179 / 183

页数：5

共 50 条

[21] Optimizing Weight Mapping and Data Flow for Convolutional Neural Networks on Processing-in-Memory Architectures
Peng, Xiaochen
Liu, Rui
Yu, Shimeng
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2020, 67 (04) : 1333 - 1343
[22] AR-PIM: An Adaptive-Range Processing-in-Memory Architecture
Chou, Teyuh
Garcia-Redondo, Fernando
Whatmough, Paul
Zhang, Zhengya
2023 IEEE/ACM INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, ISLPED, 2023,
[23] HydraNet: Multi-branch Convolution Neural Network Architecture for MRI Denoising
Gregory, Stephen
Cheng, Hu
Newman, Sharlene
Gan, Yu
MEDICAL IMAGING 2021: IMAGE PROCESSING, 2021, 11596
[24] A Processing-in-Memory Architecture Programming Paradigm for Wireless Internet-of-Things Applications
Yang, Xu
Hou, Yumin
He, Hu
SENSORS, 2019, 19 (01)
[25] RoPIM: A Processing-in-Memory Architecture for Accelerating Rotary Positional Embedding in Transformer Models
Jeon, Yunhyeong
Jang, Minwoo
Lee, Hwanjun
Jung, Yeji
Jung, Jin
Lee, Jonggeon
So, Jinin
Kim, Daehoon
IEEE COMPUTER ARCHITECTURE LETTERS, 2025, 24 (01) : 41 - 44
[26] PAIRS: Pruning-AIded Row-Skipping for SDK-Based Convolutional Weight Mapping in Processing-In-Memory Architectures
Rhe, Johnny
Jeon, Kang Eun
Ko, Jong Hwan
2023 IEEE/ACM INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, ISLPED, 2023,
[27] Sky-Sorter: A Processing-in-Memory Architecture for Large-Scale Sorting
Zokaee, Farzaneh
Chen, Fan
Sun, Guangyu
Jiang, Lei
IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (02) : 480 - 493
[28] abstractPIM: Bridging the Gap Between Processing-In-Memory Technology and Instruction Set Architecture
Eliahu, Adi
Ben-Hur, Rotem
Ronen, Ronny
Kvatinsky, Shahar
2020 IFIP/IEEE 28TH INTERNATIONAL CONFERENCE ON VERY LARGE SCALE INTEGRATION (VLSI-SOC), 2020, : 28 - 33
[29] VW-SDK: Efficient Convolutional Weight Mapping Using Variable Windows for Processing-In-Memory Architectures
Rhe, Johnny
Moon, Sungmin
Ko, Jong Hwan
PROCEEDINGS OF THE 2022 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2022), 2022, : 214 - 219
[30] MultiPIM: A Detailed and Configurable Multi-Stack Processing-In-Memory Simulator
Yu, Chao
Liu, Sihang
Khan, Samira
IEEE COMPUTER ARCHITECTURE LETTERS, 2021, 20 (01) : 54 - 57

← 1 2 3 4 5 →