DNN pruning and mapping on NoC-Based communication infrastructure

被引:18
作者
Mirmahaleh, Seyedeh Yasaman Hosseini [1 ]
Rahmani, Amir Masoud [1 ]
机构
[1] Islamic Azad Univ, Sci & Res Branch, Dept Comp Engn, Tehran, Iran
关键词
Deep neural network (DNN); Network on chip (NoC); DNN mapping; Dataflow mapping; Weight and neuron pruning (WNP);
D O I
10.1016/j.mejo.2019.104655
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Machine learning algorithm-based applications have been deployed for supporting the intemet of things (IoT) and web search engines without losing accuracy in order to satisfy human requests. Developments in deep learning-based applications and complexity of machine learning algorithms increase the depth of artificial neural networks (ANN). Increasing depth of neural network (NN) is challenging regarding the delay, energy consumption, learning, and inference speed up. We train a deep neural network (DNN) gradient descent-based method based on two Booth and Matyas standard generating functions. We also propose a method for pruning weights, neurons, and layers of DNNs based on minimal distance error before and after pruning in a range of safety margin error. This paper employs a new elastic dataflow and DNN mapping on the mesh topology for decreasing delay and energy consumption. Simulation results show reducing the delay and energy consumption of training and inference phases by approximately 22.56%-77% and 65.94%-88.54% compared with not employing a DNN pruning.
引用
收藏
页数:12
相关论文
共 57 条
[1]  
[Anonymous], 2017, IEEE INT C NOCS
[2]  
[Anonymous], IEEE INT C INF KNOWL
[3]  
[Anonymous], IEEE INT J IET
[4]  
[Anonymous], IEEE INT
[5]  
[Anonymous], IEEE INT C NOCS 19
[6]  
[Anonymous], 2007, IEEE INT C ICML
[7]  
[Anonymous], 2016, IEEE INT C CVPR
[8]  
[Anonymous], IEEE INT C NOCS 18
[9]  
[Anonymous], IEEE INT C ASPLOS 18
[10]  
[Anonymous], IEEE INT C DATE