Processing-in-Memory Accelerator for Dynamic Neural Network with Run-Time Tuning of Accuracy, Power and Latency

被引:2
|
作者
Yang, Li [1 ]
He, Zhezhi [1 ]
Angizi, Shaahin [1 ]
Fan, Deliang [1 ]
机构
[1] Arizona State Univ, Sch Elect Comp & Energy Engn, Tempe, AZ 85281 USA
来源
2020 IEEE 33RD INTERNATIONAL SYSTEM-ON-CHIP CONFERENCE (SOCC) | 2020年
基金
美国国家科学基金会;
关键词
Processing-in-Memory; Dynamic neural network;
D O I
10.1109/SOCC49529.2020.9524770
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the widely deployment of powerful deep neural network (DNN) into smart, but resource limited IoT devices, many prior works have been proposed to compress DNN in a hardware-aware manner to reduce the computing complexity, while maintaining accuracy, such as weight quantization, pruning, convolution decomposition, etc. However, in typical DNN compression methods, a smaller, but fixed, network structure is generated from a relative large background model for resource limited hardware accelerator deployment. However, such optimization lacks the ability to tune its structure on-the-fly to best fit for a dynamic computing hardware resource allocation and workloads. In this paper, we mainly review two of our prior works [1], [2] to address this issue, discussing how to construct a dynamic DNN structure through either uniform or non-uniform channel selection based sub-network sampling. The constructed dynamic DNN could tune its computing path to involve different number of channels, thus providing the ability to trade-off between speed, power and accuracy on-the-fly after model deployment. Correspondingly, an emerging Spin-Orbit Torque Magnetic Random-Access-Memory (SOT-MRAM) based Processing-In-Memory (PIM) accelerator will also be discussed for such dynamic neural network structure.
引用
收藏
页码:117 / 122
页数:6
相关论文
共 9 条
  • [1] NeuroPIM: Felxible Neural Accelerator for Processing-in-Memory Architectures
    Bidgoli, Ali Monavari
    Fattahi, Sepideh
    Rezaei, Seyyed Hossein Seyyedaghaei
    Modarressi, Mehdi
    Daneshtalab, Masoud
    2023 26TH INTERNATIONAL SYMPOSIUM ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS AND SYSTEMS, DDECS, 2023, : 51 - 56
  • [2] Accelerating Neural Network Training with Processing-in-Memory GPU
    Fei, Xiang
    Han, Jianhui
    Huang, Jianqiang
    Zheng, Weimin
    Zhang, Youhui
    2022 22ND IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2022), 2022, : 414 - 421
  • [3] Functionality-Based Processing-in-Memory Accelerator for Deep Convolutional Neural Networks
    Kim, Min-Jae
    Kim, Jeong-Geun
    Yoon, Su-Kyung
    Kim, Shin-Dug
    IEEE ACCESS, 2021, 9 : 145098 - 145108
  • [4] PyGim : An Efficient Graph Neural Network Library for Real Processing-In-Memory Architectures
    Giannoula, Christina
    Yang, Peiming
    Fernandez, Ivan
    Yang, Jiacheng
    Durvasula, Sankeerth
    Li, Yu xin
    Sadrosadati, Mohammad
    Luna, Juan gomez
    Mutlu, Onur
    Pekhimenko, Gennady
    PROCEEDINGS OF THE ACM ON MEASUREMENT AND ANALYSIS OF COMPUTING SYSTEMS, 2024, 8 (03)
  • [5] Energy Harvesting-assisted Ultra-Low-Power Processing-in-Memory Accelerator for ML Applications
    Shukla, Sanket
    Bavikadi, Sathwika
    Dinakarrao, Sai Manoj Pudukotai
    PROCEEDING OF THE GREAT LAKES SYMPOSIUM ON VLSI 2024, GLSVLSI 2024, 2024, : 633 - 638
  • [6] PRIME: A Novel Processing-in-memory Architecture for Neural Network Computation in ReRAM-based Main Memory
    Chi, Ping
    Li, Shuangchen
    Xu, Cong
    Zhang, Tao
    Zhao, Jishen
    Liu, Yongpan
    Wang, Yu
    Xie, Yuan
    2016 ACM/IEEE 43RD ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2016, : 27 - 39
  • [7] Processing-in-Memory (PIM) Based Defect Prediction of Metal Surfaces Using Spiking Neural Network
    Siyad, Mohammed B.
    Mohan, R.
    JOURNAL OF THE CHINESE SOCIETY OF MECHANICAL ENGINEERS, 2023, 44 (05): : 379 - 388
  • [8] Task Parallelism-Aware Deep Neural Network Scheduling on Multiple Hybrid Memory Cube-Based Processing-in-Memory
    Lee, Young Sik
    Han, Tae Hee
    IEEE ACCESS, 2021, 9 : 68561 - 68572
  • [9] Parasitic-Aware Modeling and Neural Network Training Scheme for Energy-Efficient Processing-in-Memory With Resistive Crossbar Array
    Cao, Tiancheng
    Liu, Chen
    Gao, Yuan
    Goh, Wang Ling
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2022, 12 (02) : 436 - 444