Heterogeneous FPGA Based Convolutional Network Accelerator

被引:0
|
作者
Zhou X. [1 ]
Zhong S. [1 ]
Zhang W. [1 ]
Wang J. [1 ]
机构
[1] School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan
来源
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence | 2019年 / 32卷 / 10期
基金
中国国家自然科学基金;
关键词
Accelerator; Convolutional Neural Network; Field Programmable Gate Array(FPGA); Fixed-Point; Parallelism;
D O I
10.16451/j.cnki.issn1003-6059.201910007
中图分类号
学科分类号
摘要
Computational complexity of neural network methods is high, and its application in embedded scenarios is limited. To solve this problem, a convolutional network accelerator based on heterogeneous field programmable gate array is proposed. The sliding window is employed to accelerate the convolution calculation process, and thus the convolution process of different input and output channels can be handled. A 8 bit fixed-point accelerator is designed combining network quantization process, and the usage of computing resources is reduced. Experiments demonstrate that the proposed fixed-point accelerator achieves a higher computing speed and a lower power consumption with a less performance loss. © 2019, Science Press. All right reserved.
引用
收藏
页码:927 / 935
页数:8
相关论文
共 22 条
  • [1] Lecun Y., Bengio Y., Hinto G., Deep Learning, Nature, 521, 7553, pp. 436-444, (2015)
  • [2] Russakovshy O., Deng J., Su H., Et al., Imagenet Large Scale Visual Recognition Challenge, International Journal of Computer Vision, 115, 3, pp. 211-252, (2015)
  • [3] Girshick R., Fast R-CNN, Proc of the IEEE International Conference on Computer Vision, pp. 1440-1448, (2015)
  • [4] Shellhamer E., Long J., Darrell T., Fully Convolutional Networks for Semantic Segmentation, IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 4, pp. 640-651, (2017)
  • [5] Zhang Y., Pezeshki M., Brakel P., Et al., Towards End-to-End Speech Recognition with Deep Convolutional Neural Networks
  • [6] Simonyan K., Zisserman A., Very Deep Convolutional Networks for Large-Scale Image Recognition
  • [7] Nurvitadhi E., Venkatesh G., Sim J., Et al., Can FPGAs Beat GPUs in Accelerating Next-Generation Deep Neural Networks?, Proc of the ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, pp. 5-14, (2017)
  • [8] Wu J.X., Gao B.B., Wei X.S., Et al., Resource-Constrained Deep Learning: Challenges and Practices, Chinese Science(Information Science), 48, 5, pp. 501-510, (2018)
  • [9] Gokhale V., Jin J., Dundar A., Et al., A 240 G-ops/s Mobile Coprocessor for Deep Neural Networks, Proc of the IEEE Confe-rence on Computer Vision and Pattern Recognition, pp. 682-687, (2014)
  • [10] Liu Z.C., Zhu Y.X., Wang H., Et al., Parallel Acceleration Structure Design of Convolutional Neural Network Based on FPGA, Microelectronics and Computer, 35, 10, pp. 80-84, (2018)