Efficient Design of Pruned Convolutional Neural Networks on FPGA

被引：6

作者：

Vestias, Mario ^{[1
]}

机构：

[1] Inst Politecn Lisboa, INESC ID, Inst Super Engn Lisboa, Lisbon, Portugal

来源：

JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY | 2021年 / 93卷 / 05期

关键词：

Deep learning; Convolutional neural network; FPGA; Block pruning; Edge computing;

D O I：

10.1007/s11265-020-01606-2

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Convolutional Neural Networks (CNNs) have improved several computer vision applications, like object detection and classification, when compared to other machine learning algorithms. Running these models in edge computing devices close to data sources is attracting the attention of the community since it avoids high-latency data communication of private data for cloud processing and permits real-time decisions turning these systems into smart embedded devices. Running these models is computationally very demanding and requires a large amount of memory, which are scarce in edge devices compared to a cloud center. In this paper, we proposed an architecture for the inference of pruned convolutional neural networks in any density FPGAs. A configurable block pruning method is proposed together with an architecture that supports the efficient execution of pruned networks. Also, pruning and batching are studied together to determine how they influence each other. With the proposed architecture, we run the inference of a CNN with an average performance of 322 GOPs for 8-bit data in a XC7Z020 FPGA. The proposed architecture running AlexNet processes 240 images/s in a ZYNQ7020 and 775 images/s in a ZYNQ7045 with only 1.2% accuracy degradation.

引用

页码：531 / 544

页数：14

共 50 条

[1] Efficient Design of Pruned Convolutional Neural Networks on FPGA
Mário Véstias
Journal of Signal Processing Systems, 2021, 93 : 531 - 544
[2] Efficient Implementation of Convolutional Neural Networks on FPGA
Hadnagy, A.
Feher, B.
Kovacshazy, T.
2018 19TH INTERNATIONAL CARPATHIAN CONTROL CONFERENCE (ICCC), 2018, : 359 - 364
[3] Fast and Efficient Implementation of Convolutional Neural Networks on FPGA
Podili, Abhinav
Zhang, Chi
Prasanna, Viktor
2017 IEEE 28TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP), 2017, : 11 - 18
[4] Efficient Utilization of FPGA Multipliers for Convolutional Neural Networks
Boulasikis, M. A.
Birbas, M.
Tsafas, N.
Kanakaris, N.
2021 10TH INTERNATIONAL CONFERENCE ON MODERN CIRCUITS AND SYSTEMS TECHNOLOGIES (MOCAST), 2021,
[5] An Efficient FPGA-Based Architecture for Convolutional Neural Networks
Hwang, Wen-Jyi
Jhang, Yun-Jie
Tai, Tsung-Ming
2017 40TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2017, : 582 - 588
[6] Reliability evaluation of FPGA based pruned neural networks
Gao, Zhen
Yao, Yi
Wei, Xiaohui
Yan, Tong
Zeng, Shulin
Ge, Guangjun
Wang, Yu
Ullah, Anees
Reviriego, Pedro
MICROELECTRONICS RELIABILITY, 2022, 130
[7] Efficient Design of Low Bitwidth Convolutional Neural Networks on FPGA with Optimized Dot Product Units
Vestias, Mario
Duarte, Rui P.
de Sousa, Jose T.
Neto, Horacio
ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2023, 16 (01)
[8] Design of Convolutional Neural Networks Hardware Acceleration Based on FPGA
Qin Huabiao
Cao Qinping
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2019, 41 (11) : 2599 - 2605
[9] Hardware Acceleration Design of Convolutional Neural Networks Based on FPGA
Zhang, Guoning
Hu, Jing
Li, Laiquan
Jiang, Haoyang
2024 9TH INTERNATIONAL CONFERENCE ON ELECTRONIC TECHNOLOGY AND INFORMATION SCIENCE, ICETIS 2024, 2024, : 11 - 15
[10] Design Space Exploration of FPGA Accelerators for Convolutional Neural Networks
Rahman, Atul
Oh, Sangyun
Lee, Jongeun
Choi, Kiyoung
PROCEEDINGS OF THE 2017 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2017, : 1147 - 1152

← 1 2 3 4 5 →