High performance reconfigurable accelerator for deep convolutional neural networks

被引:0
作者
Qiao R. [1 ,2 ]
Chen G. [1 ]
Gong G. [1 ]
Lu H. [1 ,2 ,3 ,4 ]
机构
[1] Institute of Semiconductors, Chinese Academy of Sciences, Beijing
[2] University of the Chinese Academy of Sciences, Beijing
[3] Center for Excellence in Brain Science and Intelligence Technology, Chinese Academy of Sciences, Shanghai
[4] Semiconductor Neural Network Intelligent Perception and Computing Technology Beijing Key Lab, Beijing
来源
Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University | 2019年 / 46卷 / 03期
关键词
Accelerator; Deep neural networks; High performance; Reconfigurable architecture; Very large scale integrated circuit;
D O I
10.19665/j.issn1001-2400.2019.03.020
中图分类号
学科分类号
摘要
In deep convolutional neural networks, the diversity of channel sizes and kernel sizes makes it difficult for existing accelerators to achieve efficient calculations. Therefore, based on the biological brain neuron mechanism, a deep convolutional neural network accelerator is proposed which can provide not only multiple clustering methods for brain-like neurons and link organization among brain-like neurons towards different channel sizes, but also three mapping methods for different convolution kernel sizes. The accelerator implements efficient reuse of local memory data, which greatly reduces the amount of data movement and improves the computing performance. Tested by the object classification network and object detection network, the accelerator's computational performance is 498.6 GOPS and 571.3 GOPS, respectively; the energy efficiency is 582.0 GOPS/W and 651.7 GOPS/W, respectively. © 2019, The Editorial Board of Journal of Xidian University. All right reserved.
引用
收藏
页码:130 / 139
页数:9
相关论文
共 18 条
[1]  
Sehgal A., Kehtarnavaz N., A Convolutional Neural Network Smartphone App for Real-time Voice Activity Detection, IEEE Access, 6, pp. 9017-9026, (2018)
[2]  
Wang K., Wang K., Li Y., Text Image Refocusing by Using the Convolutional Neural Network, Journal of Xidian University, 45, 4, pp. 80-85, (2018)
[3]  
Krizhevsky A., Sutskever I., Hinton G.E., ImageNet Classification with Deep Convolutional Neural Networks, Advances in Neural Information Processing Systems: 2, pp. 1097-1105, (2012)
[4]  
Akcay S., Kundegorski M.E., Willcocks C.G., Et al., Using Deep Convolutional Neural Network Architectures for Object Classification and Detection within X-ray Baggage Security Imagery, IEEE Transactions on Information Forensics and Security, 13, 9, pp. 2203-2215, (2018)
[5]  
Sarikaya D., Corso J.J., Guru K.A., Detection and Localization of Robotic Tools in Robot-assisted Surgery Videos Using Deep Neural Networks For Region Proposal and Detection, IEEE Transactions on Medical Imaging, 36, 7, pp. 1542-1549, (2017)
[6]  
Nie L., Li Y., Kong X., Spatio-temporal Network Traffic Estimation and Anomaly Detection Based on Convolutional Neural Network in Vehicular Ad-hoc Networks, IEEE Access, 6, pp. 40168-40176, (2018)
[7]  
Liu Z., Cao C., Ding S., Et al., Towards Clinical Diagnosis: Automated Stroke Lesion Segmentation on Multi-spectral MR Image Using Convolutional Neural Network, IEEE Access, 6, pp. 57006-57016, (2018)
[8]  
Simonyan K., Zisserman A., Very Deep Convolutional Networks for Large-scale Image Recognition, Computer Science, 41, 5, pp. 1409-1556, (2014)
[9]  
Zhao B., Zhang L., Shi G., Et al., Design of the Programmable Neural Network Processor Based on the Transport Triggered Architecture, Journal of Xidian University, 45, 4, pp. 92-98, (2018)
[10]  
Du Z., Fasthuber R., Chen T., Et al., ShiDianNao: Shifting Vision Processing Closer to the Sensor, Proceedings of the 2015 International Symposium on Computer Architecture, pp. 92-104, (2015)