FPGA-Based Implementation of a Real-Time Object Recognition System Using Convolutional Neural Network

被引:35
作者
Gilan, Ali Azarmi [1 ]
Emad, Mohammad [1 ]
Alizadeh, Bijan [1 ]
机构
[1] Univ Tehran, Coll Engn, Sch Elect & Comp Engn, Tehran 14395515, Iran
关键词
Micromechanical devices; Convolution; Kernel; Bandwidth; Object recognition; Arrays; Real-time systems; Convolutional neural network; object recognition; FPGA; configurable architecture;
D O I
10.1109/TCSII.2019.2922372
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
High computational complexity and power consumption makes convolutional neural networks (CNNs) ineligible for real-time embedded applications. In this brief, we introduce a low power and flexible platform as a hardware accelerator for CNNs. The proposed architecture is fully configurable by a software library so that it can perform different CNN models with a reconfigurable hardware. The hardware accelerator is evaluated on a ZC706 evaluation board. We make use of the AlexNet architecture in a real-time object recognition application to demonstrate the effectiveness of the proposed CNN accelerator. The results show that the performance rates of 198.1 GOP/s using 512 DSP blocks and 23.14 GOP/s using 64 DSP blocks are achievable for the convolution and fully connected layers, respectively. Moreover, images are processed at 82 frames/s, which is significantly higher than existing implementations.
引用
收藏
页码:755 / 759
页数:5
相关论文
共 9 条
[1]  
[Anonymous], 2016, P 24 ACM INT C MULT
[2]  
[Anonymous], 2015, P 23 ACM INT C MULT
[3]  
[Anonymous], 2015, 2015 ACMSIGDA INT S, DOI DOI 10.1145/2684746.2689060
[4]   Eyeriss: An Energy-Efficient Reconfigurable Accelerator for Deep Convolutional Neural Networks [J].
Chen, Yu-Hsin ;
Krishna, Tushar ;
Emer, Joel S. ;
Sze, Vivienne .
IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2017, 52 (01) :127-138
[5]  
Gokhale V., 2017, ARXIV170802579
[6]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[7]   A Resource-Limited Hardware Accelerator for Convolutional Neural Networks in Embedded Vision Applications [J].
Moini, Shayan ;
Alizadeh, Bijan ;
Emad, Mohammad ;
Ebrahimpour, Reza .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2017, 64 (10) :1217-1221
[8]   Going Deeper with Embedded FPGA Platform for Convolutional Neural Network [J].
Qiu, Jiantao ;
Wang, Jie ;
Yao, Song ;
Guo, Kaiyuan ;
Li, Boxun ;
Zhou, Erjin ;
Yu, Jincheng ;
Tang, Tianqi ;
Xu, Ningyi ;
Song, Sen ;
Wang, Yu ;
Yang, Huazhong .
PROCEEDINGS OF THE 2016 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'16), 2016, :26-35
[9]  
Wu CN, 2016, INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE: TECHNIQUES AND APPLICATIONS, AITA 2016, P80