Efficient Design of Pruned Convolutional Neural Networks on FPGA

被引:6
|
作者
Vestias, Mario [1 ]
机构
[1] Inst Politecn Lisboa, INESC ID, Inst Super Engn Lisboa, Lisbon, Portugal
关键词
Deep learning; Convolutional neural network; FPGA; Block pruning; Edge computing;
D O I
10.1007/s11265-020-01606-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional Neural Networks (CNNs) have improved several computer vision applications, like object detection and classification, when compared to other machine learning algorithms. Running these models in edge computing devices close to data sources is attracting the attention of the community since it avoids high-latency data communication of private data for cloud processing and permits real-time decisions turning these systems into smart embedded devices. Running these models is computationally very demanding and requires a large amount of memory, which are scarce in edge devices compared to a cloud center. In this paper, we proposed an architecture for the inference of pruned convolutional neural networks in any density FPGAs. A configurable block pruning method is proposed together with an architecture that supports the efficient execution of pruned networks. Also, pruning and batching are studied together to determine how they influence each other. With the proposed architecture, we run the inference of a CNN with an average performance of 322 GOPs for 8-bit data in a XC7Z020 FPGA. The proposed architecture running AlexNet processes 240 images/s in a ZYNQ7020 and 775 images/s in a ZYNQ7045 with only 1.2% accuracy degradation.
引用
收藏
页码:531 / 544
页数:14
相关论文
共 50 条
  • [1] Efficient Design of Pruned Convolutional Neural Networks on FPGA
    Mário Véstias
    Journal of Signal Processing Systems, 2021, 93 : 531 - 544
  • [2] Efficient Implementation of Convolutional Neural Networks on FPGA
    Hadnagy, A.
    Feher, B.
    Kovacshazy, T.
    2018 19TH INTERNATIONAL CARPATHIAN CONTROL CONFERENCE (ICCC), 2018, : 359 - 364
  • [3] Fast and Efficient Implementation of Convolutional Neural Networks on FPGA
    Podili, Abhinav
    Zhang, Chi
    Prasanna, Viktor
    2017 IEEE 28TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP), 2017, : 11 - 18
  • [4] Efficient Utilization of FPGA Multipliers for Convolutional Neural Networks
    Boulasikis, M. A.
    Birbas, M.
    Tsafas, N.
    Kanakaris, N.
    2021 10TH INTERNATIONAL CONFERENCE ON MODERN CIRCUITS AND SYSTEMS TECHNOLOGIES (MOCAST), 2021,
  • [5] An Efficient FPGA-Based Architecture for Convolutional Neural Networks
    Hwang, Wen-Jyi
    Jhang, Yun-Jie
    Tai, Tsung-Ming
    2017 40TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2017, : 582 - 588
  • [6] Reliability evaluation of FPGA based pruned neural networks
    Gao, Zhen
    Yao, Yi
    Wei, Xiaohui
    Yan, Tong
    Zeng, Shulin
    Ge, Guangjun
    Wang, Yu
    Ullah, Anees
    Reviriego, Pedro
    MICROELECTRONICS RELIABILITY, 2022, 130
  • [7] Efficient Design of Low Bitwidth Convolutional Neural Networks on FPGA with Optimized Dot Product Units
    Vestias, Mario
    Duarte, Rui P.
    de Sousa, Jose T.
    Neto, Horacio
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2023, 16 (01)
  • [8] Design of Convolutional Neural Networks Hardware Acceleration Based on FPGA
    Qin Huabiao
    Cao Qinping
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2019, 41 (11) : 2599 - 2605
  • [9] Hardware Acceleration Design of Convolutional Neural Networks Based on FPGA
    Zhang, Guoning
    Hu, Jing
    Li, Laiquan
    Jiang, Haoyang
    2024 9TH INTERNATIONAL CONFERENCE ON ELECTRONIC TECHNOLOGY AND INFORMATION SCIENCE, ICETIS 2024, 2024, : 11 - 15
  • [10] Design Space Exploration of FPGA Accelerators for Convolutional Neural Networks
    Rahman, Atul
    Oh, Sangyun
    Lee, Jongeun
    Choi, Kiyoung
    PROCEEDINGS OF THE 2017 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2017, : 1147 - 1152