ILP-based Multi-Branch CNNs Mapping on Processing-in-Memory Architecture

被引:0
|
作者
Han, Haodong [1 ]
Wang, Junpeng [1 ]
Ding, Bo [1 ]
Chen, Song [1 ]
机构
[1] Univ Sci & Technol China, Hefei, Anhui, Peoples R China
来源
2024 IEEE 6TH INTERNATIONAL CONFERENCE ON AI CIRCUITS AND SYSTEMS, AICAS 2024 | 2024年
关键词
convolutional neural network; mapping; 3D-stacked DRAM; processing in memory;
D O I
10.1109/AICAS59952.2024.10595921
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D-stacked-DRAM-based processing-in-memory (DRAM-PIM) architectures demonstrate benefits in memory access bandwidth and energy efficiency and effectively mitigate the storage wall challenge posed by CNNs. However, DRAM-PIM architectures have a huge mapping space for multi-branch CNNs and inadequate mapping increases the latency of CNNs and the memory requirement of nodes. In this work, we propose an integer linear programming (ILP) method that integrates layer scheduling and resource quantity allocation to minimize overall latency. An ILP-based binding method is introduced to bind layers onto a node array of DRAM-PIM architectures with the maximum memory requirement of nodes reduced. Experimental results demonstrate that our method reduces the latency of branching structures in CNNs and achieves better memory balancing between nodes compared to the baseline method.
引用
收藏
页码:179 / 183
页数:5
相关论文
共 50 条
  • [21] Optimizing Weight Mapping and Data Flow for Convolutional Neural Networks on Processing-in-Memory Architectures
    Peng, Xiaochen
    Liu, Rui
    Yu, Shimeng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2020, 67 (04) : 1333 - 1343
  • [22] AR-PIM: An Adaptive-Range Processing-in-Memory Architecture
    Chou, Teyuh
    Garcia-Redondo, Fernando
    Whatmough, Paul
    Zhang, Zhengya
    2023 IEEE/ACM INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, ISLPED, 2023,
  • [23] HydraNet: Multi-branch Convolution Neural Network Architecture for MRI Denoising
    Gregory, Stephen
    Cheng, Hu
    Newman, Sharlene
    Gan, Yu
    MEDICAL IMAGING 2021: IMAGE PROCESSING, 2021, 11596
  • [24] A Processing-in-Memory Architecture Programming Paradigm for Wireless Internet-of-Things Applications
    Yang, Xu
    Hou, Yumin
    He, Hu
    SENSORS, 2019, 19 (01)
  • [25] RoPIM: A Processing-in-Memory Architecture for Accelerating Rotary Positional Embedding in Transformer Models
    Jeon, Yunhyeong
    Jang, Minwoo
    Lee, Hwanjun
    Jung, Yeji
    Jung, Jin
    Lee, Jonggeon
    So, Jinin
    Kim, Daehoon
    IEEE COMPUTER ARCHITECTURE LETTERS, 2025, 24 (01) : 41 - 44
  • [26] PAIRS: Pruning-AIded Row-Skipping for SDK-Based Convolutional Weight Mapping in Processing-In-Memory Architectures
    Rhe, Johnny
    Jeon, Kang Eun
    Ko, Jong Hwan
    2023 IEEE/ACM INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, ISLPED, 2023,
  • [27] Sky-Sorter: A Processing-in-Memory Architecture for Large-Scale Sorting
    Zokaee, Farzaneh
    Chen, Fan
    Sun, Guangyu
    Jiang, Lei
    IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (02) : 480 - 493
  • [28] abstractPIM: Bridging the Gap Between Processing-In-Memory Technology and Instruction Set Architecture
    Eliahu, Adi
    Ben-Hur, Rotem
    Ronen, Ronny
    Kvatinsky, Shahar
    2020 IFIP/IEEE 28TH INTERNATIONAL CONFERENCE ON VERY LARGE SCALE INTEGRATION (VLSI-SOC), 2020, : 28 - 33
  • [29] VW-SDK: Efficient Convolutional Weight Mapping Using Variable Windows for Processing-In-Memory Architectures
    Rhe, Johnny
    Moon, Sungmin
    Ko, Jong Hwan
    PROCEEDINGS OF THE 2022 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2022), 2022, : 214 - 219
  • [30] MultiPIM: A Detailed and Configurable Multi-Stack Processing-In-Memory Simulator
    Yu, Chao
    Liu, Sihang
    Khan, Samira
    IEEE COMPUTER ARCHITECTURE LETTERS, 2021, 20 (01) : 54 - 57