Processing-in-Memory Accelerator for Dynamic Neural Network with Run-Time Tuning of Accuracy, Power and Latency

被引：2

作者：

Yang, Li ^{[1
]}

He, Zhezhi ^{[1
]}

Angizi, Shaahin ^{[1
]}

Fan, Deliang ^{[1
]}

机构：

[1] Arizona State Univ, Sch Elect Comp & Energy Engn, Tempe, AZ 85281 USA

来源：

2020 IEEE 33RD INTERNATIONAL SYSTEM-ON-CHIP CONFERENCE (SOCC) | 2020年

基金：

美国国家科学基金会;

关键词：

Processing-in-Memory; Dynamic neural network;

D O I：

10.1109/SOCC49529.2020.9524770

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

With the widely deployment of powerful deep neural network (DNN) into smart, but resource limited IoT devices, many prior works have been proposed to compress DNN in a hardware-aware manner to reduce the computing complexity, while maintaining accuracy, such as weight quantization, pruning, convolution decomposition, etc. However, in typical DNN compression methods, a smaller, but fixed, network structure is generated from a relative large background model for resource limited hardware accelerator deployment. However, such optimization lacks the ability to tune its structure on-the-fly to best fit for a dynamic computing hardware resource allocation and workloads. In this paper, we mainly review two of our prior works [1], [2] to address this issue, discussing how to construct a dynamic DNN structure through either uniform or non-uniform channel selection based sub-network sampling. The constructed dynamic DNN could tune its computing path to involve different number of channels, thus providing the ability to trade-off between speed, power and accuracy on-the-fly after model deployment. Correspondingly, an emerging Spin-Orbit Torque Magnetic Random-Access-Memory (SOT-MRAM) based Processing-In-Memory (PIM) accelerator will also be discussed for such dynamic neural network structure.

引用

页码：117 / 122

页数：6

共 9 条

[1] NeuroPIM: Felxible Neural Accelerator for Processing-in-Memory Architectures
Bidgoli, Ali Monavari
Fattahi, Sepideh
Rezaei, Seyyed Hossein Seyyedaghaei
Modarressi, Mehdi
Daneshtalab, Masoud
2023 26TH INTERNATIONAL SYMPOSIUM ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS AND SYSTEMS, DDECS, 2023, : 51 - 56
[2] Accelerating Neural Network Training with Processing-in-Memory GPU
Fei, Xiang
Han, Jianhui
Huang, Jianqiang
Zheng, Weimin
Zhang, Youhui
2022 22ND IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2022), 2022, : 414 - 421
[3] Functionality-Based Processing-in-Memory Accelerator for Deep Convolutional Neural Networks
Kim, Min-Jae
Kim, Jeong-Geun
Yoon, Su-Kyung
Kim, Shin-Dug
IEEE ACCESS, 2021, 9 : 145098 - 145108
[4] PyGim : An Efficient Graph Neural Network Library for Real Processing-In-Memory Architectures
Giannoula, Christina
Yang, Peiming
Fernandez, Ivan
Yang, Jiacheng
Durvasula, Sankeerth
Li, Yu xin
Sadrosadati, Mohammad
Luna, Juan gomez
Mutlu, Onur
Pekhimenko, Gennady
PROCEEDINGS OF THE ACM ON MEASUREMENT AND ANALYSIS OF COMPUTING SYSTEMS, 2024, 8 (03)
[5] Energy Harvesting-assisted Ultra-Low-Power Processing-in-Memory Accelerator for ML Applications
Shukla, Sanket
Bavikadi, Sathwika
Dinakarrao, Sai Manoj Pudukotai
PROCEEDING OF THE GREAT LAKES SYMPOSIUM ON VLSI 2024, GLSVLSI 2024, 2024, : 633 - 638
[6] PRIME: A Novel Processing-in-memory Architecture for Neural Network Computation in ReRAM-based Main Memory
Chi, Ping
Li, Shuangchen
Xu, Cong
Zhang, Tao
Zhao, Jishen
Liu, Yongpan
Wang, Yu
Xie, Yuan
2016 ACM/IEEE 43RD ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2016, : 27 - 39
[7] Processing-in-Memory (PIM) Based Defect Prediction of Metal Surfaces Using Spiking Neural Network
Siyad, Mohammed B.
Mohan, R.
JOURNAL OF THE CHINESE SOCIETY OF MECHANICAL ENGINEERS, 2023, 44 (05): : 379 - 388
[8] Task Parallelism-Aware Deep Neural Network Scheduling on Multiple Hybrid Memory Cube-Based Processing-in-Memory
Lee, Young Sik
Han, Tae Hee
IEEE ACCESS, 2021, 9 : 68561 - 68572
[9] Parasitic-Aware Modeling and Neural Network Training Scheme for Energy-Efficient Processing-in-Memory With Resistive Crossbar Array
Cao, Tiancheng
Liu, Chen
Gao, Yuan
Goh, Wang Ling
IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2022, 12 (02) : 436 - 444

← 1 →