VHDL Generator for A High Performance Convolutional Neural Network FPGA-Based Accelerator

被引：0

作者：

Hamdan, Muhammad K. ^{[1
]}

Rover, Diane T. ^{[1
]}

机构：

[1] Iowa State Univ Sci & Technol, Elect & Comp Engn Dept, Ames, IA 50011 USA

来源：

2017 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG) | 2017年

关键词：

VHDL generator; CNNs; AlexNet; parallelism; reconfigurable; adaptability; pipeline; scalable; FPGA; COPROCESSOR;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Convolutional Neural Network (CNN) has been proven as a highly accurate and effective algorithm that has been used in a variety of applications such as handwriting digit recognition, visual recognition, and image classification. As a matter of fact, state-of-the-art CNNs are computationally intensive; however, their parallel and modular nature make platforms like FPGAs well suited for the acceleration process. A typical CNN takes a very long development round on FPGAs, hence in this paper, we propose a tool which allows developers, through a configurable user-interface, to automatically generate VHDL code for their desired CNN model. The generated code or architecture is modular, massively parallel, reconfigurable, scalable, fully pipelined, and adaptive to different CNN models. We demonstrate the automatic VHDL generator and its adaptability by implementing a small-scale CNN model "LeNet" and a large-scale one "AlexNet". The parameters of small scale models are automatically hard-coded as constants (part of the programmable logic) to overcome the memory bottleneck issue. On a Xilinx Virtex-7 running at 200 MHz, the system is capable of processing up to 125k images/s of size 28x28 for LeNet and achieved a peak performance of 611.52 GOP/s and 414 FPS for AlexNet.

引用

页数：6

共 50 条

[31] Heterogeneous FPGA Based Convolutional Network Accelerator
Zhou X.
Zhong S.
Zhang W.
Wang J.
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (10): : 927 - 935
[32] Calculation Optimization for Convolutional Neural Networks and FPGA-based Accelerator Design Using the Parameters Sparsity
Liu Qinrang
Liu Chongyang
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2018, 40 (06) : 1368 - 1374
[33] A FPGA-based Neural Accelerator for Small IoT Devices
Hong, Seongmin
Park, Yongjun
PROCEEDINGS INTERNATIONAL SOC DESIGN CONFERENCE 2017 (ISOCC 2017), 2017, : 294 - 295
[34] A Review of FPGA-Based Custom Computing Architecture for Convolutional Neural Network Inference
Peng Xiyuan
Yu Jinxiang
Yao Bowen
Liu Liansheng
Peng Yu
CHINESE JOURNAL OF ELECTRONICS, 2021, 30 (01) : 1 - 17
[35] Using Data Compression for Optimizing FPGA-Based Convolutional Neural Network Accelerators
Guan, Yijin
Xu, Ningyi
Zhang, Chen
Yuan, Zhihang
Cong, Jason
ADVANCED PARALLEL PROCESSING TECHNOLOGIES, 2017, 10561 : 14 - 26
[36] Implementation of FPGA-based Accelerator for Deep Neural Networks
Tsai, Tsung-Han
Ho, Yuan-Chen
Sheu, Ming-Hwa
2019 IEEE 22ND INTERNATIONAL SYMPOSIUM ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS & SYSTEMS (DDECS), 2019,
[37] An FPGA-Based accelerator for multiphysics modeling
Huang, XM
Ma, J
ERSA '04: THE 2004 INTERNATIONAL CONFERENCE ON ENGINEERING OF RECONFIGURABLE SYSTEMS AND ALGORITHMS, 2004, : 209 - 212
[38] An Efficient FPGA-Based Convolutional Neural Network for Classification: Ad-MobileNet
Bouguezzi, Safa
Ben Fredj, Hana
Belabed, Tarek
Valderrama, Carlos
Faiedh, Hassene
Souani, Chokri
ELECTRONICS, 2021, 10 (18)
[39] The Shunt: An FPGA-Based Accelerator for Network Intrusion Prevention
Weaver, Nicholas
Paxson, Vern
Gonzalez, Jose M.
FPGA 2007: FIFTEENTH ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS, 2007, : 199 - 206
[40] A convolutional neural network accelerator on FPGA for crystallography spot screening
Jiang, Yuwei
Feng, Yingqi
Ren, Tao
Zhu, Yongxin
PROCEEDINGS OF THE 2024 IEEE 10TH IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING, HPSC 2024, 2024, : 66 - 70

← 1 2 3 4 5 →