Implementation of deep neural networks on FPGA-CPU platform using Xilinx SDSOC

被引:5
|
作者
Hassan, Rania O. [1 ]
Mostafa, Hassan [1 ,2 ]
机构
[1] Cairo Univ, Fac Engn, Elect & Elect Commun, Giza, Egypt
[2] Univ Sci & Technol, Nanotechnol & Nanoelect Program, Giza 12578, Egypt
关键词
Convolutional neural networks (CNNs); Alex-Net; Accelerating CNNs; FPGA; Virtex; HW; SW co-design partitioning; SDSOC; HLS;
D O I
10.1007/s10470-020-01638-5
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Convolutional Neural Networks (CNNs) are the state-of-the-art systems for image classification due to their high accuracy but on the other hand their high computational complexity is very costly. The acceleration is the target in this field nowadays for using these systems in real time applications. The Graphics Processing Units is the solution but its high-power consumption prevents its utilization in daily-used equipment moreover the Field Programmable Gate Array (FPGA) has low power consumption and flexible architecture which fits more for CNN implementations. This work discusses this problem and provides a solution that compromises between the speed of the CNN and the power consumption of the FPGA. This solution depends on two main techniques for speeding up: parallelism of layers resources and pipelining inside some layers. On the other hand, we added a new methodology to compromise the area requirements with the speed and design time by implementing CNN using Xilinx SDSOC tool (including processor and FPGA on the same board). Implementing design using HW/SW partitioning will enhance time design based on high level language(C or C++) in Vivado HLS (High Level Synthesis). It also fits for more large designs than using FPGA only and faster in design time.
引用
收藏
页码:399 / 408
页数:10
相关论文
共 50 条
  • [41] Hardware Implementation of Spiking Neural Networks on FPGA
    Jianhui Han
    Zhaolin Li
    Weimin Zheng
    Youhui Zhang
    Tsinghua Science and Technology, 2020, 25 (04) : 479 - 486
  • [42] Noise Convolutional Neural Networks and FPGA Implementation
    Munakala, Atsuki
    Nakahara, IIiroki
    Sato, Shimpei
    2019 IEEE 49TH INTERNATIONAL SYMPOSIUM ON MULTIPLE-VALUED LOGIC (ISMVL), 2019, : 85 - 90
  • [43] An FPGA Implementation of Deep Spiking Neural Networks for Low-Power and Fast Classification
    Ju, Xiping
    Fang, Biao
    Yan, Rui
    Xu, Xiaoliang
    Tang, Huajin
    NEURAL COMPUTATION, 2020, 32 (01) : 182 - 204
  • [44] Hardware Implementation of Spiking Neural Networks on FPGA
    Han, Jianhui
    Li, Zhaolin
    Zheng, Weimin
    Zhang, Youhui
    TSINGHUA SCIENCE AND TECHNOLOGY, 2020, 25 (04) : 479 - 486
  • [45] Accelerating Binarized Neural Networks: Comparison of FPGA, CPU, GPU, and ASIC
    Nurvitadhi, Eriko
    Sheffield, David
    Sim, Jaewoong
    Mishra, Asit
    Venkatesh, Ganesh
    Marr, Debbie
    2016 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), 2016, : 77 - 84
  • [46] Acceleration of the Secure Hash Algorithm-256 (SHA-256) on an FPGA-CPU Cluster Using OpenCL
    Bensalem, Hachem
    Blaquiere, Yves
    Savaria, Yvon
    2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
  • [47] AddNet: Deep Neural Networks Using FPGA-Optimized Multipliers
    Faraone, Julian
    Kumm, Martin
    Hardieck, Martin
    Zipf, Peter
    Liu, Xueyuan
    Boland, David
    Leong, Philip H. W.
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2020, 28 (01) : 115 - 128
  • [48] FPGA-based implementation of deep neural network using stochastic computing
    Nobari, Maedeh
    Jahanirad, Hadi
    APPLIED SOFT COMPUTING, 2023, 137
  • [49] Implementation of GSC Based Subarray Adaptive LMS Algorithm using Xilinx FPGA
    Salim, T.
    Trinkle, M.
    Drake, R.
    2008 INTERNATIONAL CONFERENCE ON RADAR, VOLS 1 AND 2, 2008, : 525 - 529
  • [50] FPGA Implementation of Matrix-Vector Multiplication Using Xilinx System Generator
    Sayahi, Intissar
    Machhout, Mohsen
    Tourki, Rached
    2018 INTERNATIONAL CONFERENCE ON ADVANCED SYSTEMS AND ELECTRICAL TECHNOLOGIES (IC_ASET), 2017, : 290 - 295