VHDL Generator for A High Performance Convolutional Neural Network FPGA-Based Accelerator

被引:0
|
作者
Hamdan, Muhammad K. [1 ]
Rover, Diane T. [1 ]
机构
[1] Iowa State Univ Sci & Technol, Elect & Comp Engn Dept, Ames, IA 50011 USA
来源
2017 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG) | 2017年
关键词
VHDL generator; CNNs; AlexNet; parallelism; reconfigurable; adaptability; pipeline; scalable; FPGA; COPROCESSOR;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional Neural Network (CNN) has been proven as a highly accurate and effective algorithm that has been used in a variety of applications such as handwriting digit recognition, visual recognition, and image classification. As a matter of fact, state-of-the-art CNNs are computationally intensive; however, their parallel and modular nature make platforms like FPGAs well suited for the acceleration process. A typical CNN takes a very long development round on FPGAs, hence in this paper, we propose a tool which allows developers, through a configurable user-interface, to automatically generate VHDL code for their desired CNN model. The generated code or architecture is modular, massively parallel, reconfigurable, scalable, fully pipelined, and adaptive to different CNN models. We demonstrate the automatic VHDL generator and its adaptability by implementing a small-scale CNN model "LeNet" and a large-scale one "AlexNet". The parameters of small scale models are automatically hard-coded as constants (part of the programmable logic) to overcome the memory bottleneck issue. On a Xilinx Virtex-7 running at 200 MHz, the system is capable of processing up to 125k images/s of size 28x28 for LeNet and achieved a peak performance of 611.52 GOP/s and 414 FPS for AlexNet.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Heterogeneous FPGA Based Convolutional Network Accelerator
    Zhou X.
    Zhong S.
    Zhang W.
    Wang J.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (10): : 927 - 935
  • [32] Calculation Optimization for Convolutional Neural Networks and FPGA-based Accelerator Design Using the Parameters Sparsity
    Liu Qinrang
    Liu Chongyang
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2018, 40 (06) : 1368 - 1374
  • [33] A FPGA-based Neural Accelerator for Small IoT Devices
    Hong, Seongmin
    Park, Yongjun
    PROCEEDINGS INTERNATIONAL SOC DESIGN CONFERENCE 2017 (ISOCC 2017), 2017, : 294 - 295
  • [34] A Review of FPGA-Based Custom Computing Architecture for Convolutional Neural Network Inference
    Peng Xiyuan
    Yu Jinxiang
    Yao Bowen
    Liu Liansheng
    Peng Yu
    CHINESE JOURNAL OF ELECTRONICS, 2021, 30 (01) : 1 - 17
  • [35] Using Data Compression for Optimizing FPGA-Based Convolutional Neural Network Accelerators
    Guan, Yijin
    Xu, Ningyi
    Zhang, Chen
    Yuan, Zhihang
    Cong, Jason
    ADVANCED PARALLEL PROCESSING TECHNOLOGIES, 2017, 10561 : 14 - 26
  • [36] Implementation of FPGA-based Accelerator for Deep Neural Networks
    Tsai, Tsung-Han
    Ho, Yuan-Chen
    Sheu, Ming-Hwa
    2019 IEEE 22ND INTERNATIONAL SYMPOSIUM ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS & SYSTEMS (DDECS), 2019,
  • [37] An FPGA-Based accelerator for multiphysics modeling
    Huang, XM
    Ma, J
    ERSA '04: THE 2004 INTERNATIONAL CONFERENCE ON ENGINEERING OF RECONFIGURABLE SYSTEMS AND ALGORITHMS, 2004, : 209 - 212
  • [38] An Efficient FPGA-Based Convolutional Neural Network for Classification: Ad-MobileNet
    Bouguezzi, Safa
    Ben Fredj, Hana
    Belabed, Tarek
    Valderrama, Carlos
    Faiedh, Hassene
    Souani, Chokri
    ELECTRONICS, 2021, 10 (18)
  • [39] The Shunt: An FPGA-Based Accelerator for Network Intrusion Prevention
    Weaver, Nicholas
    Paxson, Vern
    Gonzalez, Jose M.
    FPGA 2007: FIFTEENTH ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS, 2007, : 199 - 206
  • [40] A convolutional neural network accelerator on FPGA for crystallography spot screening
    Jiang, Yuwei
    Feng, Yingqi
    Ren, Tao
    Zhu, Yongxin
    PROCEEDINGS OF THE 2024 IEEE 10TH IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING, HPSC 2024, 2024, : 66 - 70