VHDL Generator for A High Performance Convolutional Neural Network FPGA-Based Accelerator

被引:0
作者
Hamdan, Muhammad K. [1 ]
Rover, Diane T. [1 ]
机构
[1] Iowa State Univ Sci & Technol, Elect & Comp Engn Dept, Ames, IA 50011 USA
来源
2017 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG) | 2017年
关键词
VHDL generator; CNNs; AlexNet; parallelism; reconfigurable; adaptability; pipeline; scalable; FPGA; COPROCESSOR;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional Neural Network (CNN) has been proven as a highly accurate and effective algorithm that has been used in a variety of applications such as handwriting digit recognition, visual recognition, and image classification. As a matter of fact, state-of-the-art CNNs are computationally intensive; however, their parallel and modular nature make platforms like FPGAs well suited for the acceleration process. A typical CNN takes a very long development round on FPGAs, hence in this paper, we propose a tool which allows developers, through a configurable user-interface, to automatically generate VHDL code for their desired CNN model. The generated code or architecture is modular, massively parallel, reconfigurable, scalable, fully pipelined, and adaptive to different CNN models. We demonstrate the automatic VHDL generator and its adaptability by implementing a small-scale CNN model "LeNet" and a large-scale one "AlexNet". The parameters of small scale models are automatically hard-coded as constants (part of the programmable logic) to overcome the memory bottleneck issue. On a Xilinx Virtex-7 running at 200 MHz, the system is capable of processing up to 125k images/s of size 28x28 for LeNet and achieved a peak performance of 611.52 GOP/s and 414 FPS for AlexNet.
引用
收藏
页数:6
相关论文
共 15 条
[11]  
Poulet C., CNP : AN FPGA-BASED PROCESSOR FOR CONVOLUTIONAL NETWORKS Cl ', V1
[12]   Going Deeper with Embedded FPGA Platform for Convolutional Neural Network [J].
Qiu, Jiantao ;
Wang, Jie ;
Yao, Song ;
Guo, Kaiyuan ;
Li, Boxun ;
Zhou, Erjin ;
Yu, Jincheng ;
Tang, Tianqi ;
Xu, Ningyi ;
Song, Sen ;
Wang, Yu ;
Yang, Huazhong .
PROCEEDINGS OF THE 2016 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'16), 2016, :26-35
[13]   A Massively Parallel Coprocessor for Convolutional Neural Networks [J].
Sankaradas, Murugan ;
Jakkula, Venkata ;
Cadambi, Srihari ;
Chakradhar, Srimat ;
Durdanovic, Igor ;
Cosatto, Eric ;
Graf, Hans Peter .
2009 20TH IEEE INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS, 2009, :53-60
[14]  
Zhang C, 2015, P 2015 ACM SIGDA INT, P161, DOI 10.1145/2684746.2689060
[15]  
Zhou YM, 2015, PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), P829, DOI 10.1109/ICCSNT.2015.7490869