ILP-based Multi-Branch CNNs Mapping on Processing-in-Memory Architecture

被引：0

作者：

Han, Haodong ^{[1
]}

Wang, Junpeng ^{[1
]}

Ding, Bo ^{[1
]}

Chen, Song ^{[1
]}

机构：

[1] Univ Sci & Technol China, Hefei, Anhui, Peoples R China

来源：

2024 IEEE 6TH INTERNATIONAL CONFERENCE ON AI CIRCUITS AND SYSTEMS, AICAS 2024 | 2024年

关键词：

convolutional neural network; mapping; 3D-stacked DRAM; processing in memory;

D O I：

10.1109/AICAS59952.2024.10595921

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

3D-stacked-DRAM-based processing-in-memory (DRAM-PIM) architectures demonstrate benefits in memory access bandwidth and energy efficiency and effectively mitigate the storage wall challenge posed by CNNs. However, DRAM-PIM architectures have a huge mapping space for multi-branch CNNs and inadequate mapping increases the latency of CNNs and the memory requirement of nodes. In this work, we propose an integer linear programming (ILP) method that integrates layer scheduling and resource quantity allocation to minimize overall latency. An ILP-based binding method is introduced to bind layers onto a node array of DRAM-PIM architectures with the maximum memory requirement of nodes reduced. Experimental results demonstrate that our method reduces the latency of branching structures in CNNs and achieves better memory balancing between nodes compared to the baseline method.

引用

页码：179 / 183

页数：5

共 50 条

[1] Towards Memory-Efficient Allocation of CNNs on Processing-in-Memory Architecture
Wang, Yi
Chen, Weixuan
Yang, Jing
Li, Tao
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2018, 29 (06) : 1428 - 1441
[2] DDAM: Data Distribution-Aware Mapping of CNNs on Processing-In-Memory Systems
Wang, Junpeng
Du, Haitao
Ding, Bo
Xu, Qi
Chen, Song
Kang, Yi
ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2023, 28 (03)
[3] Study on Processing-in-Memory Technology based on Dataflow Architecture
Choi, Kyu Hyun
Hwang, Taeho
2022 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2022,
[4] A Study of Data Layout in Multi-channel Processing-In-Memory Architecture
Jeong, Taeyang
Choi, Duheon
Han, Sangwoo
Chung, Eui-Young
PROCEEDINGS OF 2018 7TH INTERNATIONAL CONFERENCE ON SOFTWARE AND COMPUTER APPLICATIONS (ICSCA 2018), 2018, : 134 - 138
[5] Optimizing Weight Mapping and Data Flow for Convolutional Neural Networks on RRAM based Processing-In-Memory Architecture
Peng, Xiaochen
Liu, Rui
Yu, Shimeng
2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,
[6] RETRANSFORMER: ReRAM-based Processing-in-Memory Architecture for Transformer Acceleration
Yang, Xiaoxuan
Yan, Bonan
Li, Hai
Chen, Yiran
2020 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED-DESIGN (ICCAD), 2020,
[7] A Novel ReRAM-Based Processing-in-Memory Architecture for Graph Traversal
Han, Lei
Shen, Zhaoyan
Liu, Duo
Shao, Zili
Huang, H. Howie
Li, Tao
ACM TRANSACTIONS ON STORAGE, 2018, 14 (01)
[8] Genetic Algorithm-Based Energy-Aware CNN Quantization for Processing-In-Memory Architecture
Kang, Beomseok
Lu, Anni
Long, Yun
Kim, Daehyun
Yu, Shimeng
Mukhopadhyay, Saibal
IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2021, 11 (04) : 649 - 662
[9] Flexible Instruction Set Architecture for Programmable Look-up Table based Processing-in-Memory
Connolly, Mark
Sutradhar, Purab Ranjan
Indovina, Mark
Ganguly, Amlan
2021 IEEE 39TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2021), 2021, : 66 - 73
[10] Implementation of a Low-Overhead Processing-in-Memory Architecture
Jang, Young-Jong
Kim, Byung-Soo
Kim, Dong-Sun
Hwang, Tae-ho
2016 INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2016, : 185 - 186

← 1 2 3 4 5 →