Implementation of deep neural networks on FPGA-CPU platform using Xilinx SDSOC

被引：5

作者：

Hassan, Rania O. ^{[1
]}

Mostafa, Hassan ^{[1
,2
]}

机构：

[1] Cairo Univ, Fac Engn, Elect & Elect Commun, Giza, Egypt

[2] Univ Sci & Technol, Nanotechnol & Nanoelect Program, Giza 12578, Egypt

来源：

ANALOG INTEGRATED CIRCUITS AND SIGNAL PROCESSING | 2021年 / 106卷 / 02期

关键词：

Convolutional neural networks (CNNs); Alex-Net; Accelerating CNNs; FPGA; Virtex; HW; SW co-design partitioning; SDSOC; HLS;

D O I：

10.1007/s10470-020-01638-5

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep Convolutional Neural Networks (CNNs) are the state-of-the-art systems for image classification due to their high accuracy but on the other hand their high computational complexity is very costly. The acceleration is the target in this field nowadays for using these systems in real time applications. The Graphics Processing Units is the solution but its high-power consumption prevents its utilization in daily-used equipment moreover the Field Programmable Gate Array (FPGA) has low power consumption and flexible architecture which fits more for CNN implementations. This work discusses this problem and provides a solution that compromises between the speed of the CNN and the power consumption of the FPGA. This solution depends on two main techniques for speeding up: parallelism of layers resources and pipelining inside some layers. On the other hand, we added a new methodology to compromise the area requirements with the speed and design time by implementing CNN using Xilinx SDSOC tool (including processor and FPGA on the same board). Implementing design using HW/SW partitioning will enhance time design based on high level language(C or C++) in Vivado HLS (High Level Synthesis). It also fits for more large designs than using FPGA only and faster in design time.

引用

页码：399 / 408

页数：10

共 50 条

[41] Hardware Implementation of Spiking Neural Networks on FPGA
Jianhui Han
Zhaolin Li
Weimin Zheng
Youhui Zhang
Tsinghua Science and Technology, 2020, 25 (04) : 479 - 486
[42] Noise Convolutional Neural Networks and FPGA Implementation
Munakala, Atsuki
Nakahara, IIiroki
Sato, Shimpei
2019 IEEE 49TH INTERNATIONAL SYMPOSIUM ON MULTIPLE-VALUED LOGIC (ISMVL), 2019, : 85 - 90
[43] An FPGA Implementation of Deep Spiking Neural Networks for Low-Power and Fast Classification
Ju, Xiping
Fang, Biao
Yan, Rui
Xu, Xiaoliang
Tang, Huajin
NEURAL COMPUTATION, 2020, 32 (01) : 182 - 204
[44] Hardware Implementation of Spiking Neural Networks on FPGA
Han, Jianhui
Li, Zhaolin
Zheng, Weimin
Zhang, Youhui
TSINGHUA SCIENCE AND TECHNOLOGY, 2020, 25 (04) : 479 - 486
[45] Accelerating Binarized Neural Networks: Comparison of FPGA, CPU, GPU, and ASIC
Nurvitadhi, Eriko
Sheffield, David
Sim, Jaewoong
Mishra, Asit
Venkatesh, Ganesh
Marr, Debbie
2016 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), 2016, : 77 - 84
[46] Acceleration of the Secure Hash Algorithm-256 (SHA-256) on an FPGA-CPU Cluster Using OpenCL
Bensalem, Hachem
Blaquiere, Yves
Savaria, Yvon
2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
[47] AddNet: Deep Neural Networks Using FPGA-Optimized Multipliers
Faraone, Julian
Kumm, Martin
Hardieck, Martin
Zipf, Peter
Liu, Xueyuan
Boland, David
Leong, Philip H. W.
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2020, 28 (01) : 115 - 128
[48] FPGA-based implementation of deep neural network using stochastic computing
Nobari, Maedeh
Jahanirad, Hadi
APPLIED SOFT COMPUTING, 2023, 137
[49] Implementation of GSC Based Subarray Adaptive LMS Algorithm using Xilinx FPGA
Salim, T.
Trinkle, M.
Drake, R.
2008 INTERNATIONAL CONFERENCE ON RADAR, VOLS 1 AND 2, 2008, : 525 - 529
[50] FPGA Implementation of Matrix-Vector Multiplication Using Xilinx System Generator
Sayahi, Intissar
Machhout, Mohsen
Tourki, Rached
2018 INTERNATIONAL CONFERENCE ON ADVANCED SYSTEMS AND ELECTRICAL TECHNOLOGIES (IC_ASET), 2017, : 290 - 295

← 1 2 3 4 5 →