Implementation of deep neural networks on FPGA-CPU platform using Xilinx SDSOC

被引:6
作者
Hassan, Rania O. [1 ]
Mostafa, Hassan [1 ,2 ]
机构
[1] Cairo Univ, Fac Engn, Elect & Elect Commun, Giza, Egypt
[2] Univ Sci & Technol, Nanotechnol & Nanoelect Program, Giza 12578, Egypt
关键词
Convolutional neural networks (CNNs); Alex-Net; Accelerating CNNs; FPGA; Virtex; HW; SW co-design partitioning; SDSOC; HLS;
D O I
10.1007/s10470-020-01638-5
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Convolutional Neural Networks (CNNs) are the state-of-the-art systems for image classification due to their high accuracy but on the other hand their high computational complexity is very costly. The acceleration is the target in this field nowadays for using these systems in real time applications. The Graphics Processing Units is the solution but its high-power consumption prevents its utilization in daily-used equipment moreover the Field Programmable Gate Array (FPGA) has low power consumption and flexible architecture which fits more for CNN implementations. This work discusses this problem and provides a solution that compromises between the speed of the CNN and the power consumption of the FPGA. This solution depends on two main techniques for speeding up: parallelism of layers resources and pipelining inside some layers. On the other hand, we added a new methodology to compromise the area requirements with the speed and design time by implementing CNN using Xilinx SDSOC tool (including processor and FPGA on the same board). Implementing design using HW/SW partitioning will enhance time design based on high level language(C or C++) in Vivado HLS (High Level Synthesis). It also fits for more large designs than using FPGA only and faster in design time.
引用
收藏
页码:399 / 408
页数:10
相关论文
共 15 条
[1]  
ADEL E, 2018, 30 INT C MICR ICM
[2]  
[Anonymous], ISFPGA
[3]  
[Anonymous], 2016, Proceedings of the 2016 ACM/SIGDA international symposium on field-programmable gate arrays, P26, DOI DOI 10.1145/2847263.2847265
[4]  
Chen YH, 2016, ISSCC DIG TECH PAP I, V59, P262, DOI 10.1109/ISSCC.2016.7418007
[5]  
Chen ZL, 2017, IEEE INT VEH SYM, P1856, DOI 10.1109/IVS.2017.7995975
[6]  
COKUN M, 2017, MEES
[7]  
ELNABAWY A, 2018, IEEE INT NEWCAS
[8]  
Goodfellow I, 2016, ADAPT COMPUT MACH LE, P1
[9]  
GYSEL PM, 2012, THESIS
[10]  
KANG Y, NRF2014R1A1A2A160552