Inference of Quantized Neural Networks on Heterogeneous All-Programmable Devices

被引:0
|
作者
Preusser, Thomas B. [1 ]
Gambardella, Giulio [1 ]
Fraser, Nicholas [1 ]
Blott, Michaela [1 ]
机构
[1] Xilinx Res Labs, Dublin, Ireland
基金
欧盟地平线“2020”;
关键词
all-programmable; quantized neural networks; object detection;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Neural networks have established as a generic and powerful means to approach challenging problems such as image classification, object detection or decision making. Their successful employment foots on an enormous demand of compute. The quantization of network parameters and the processed data has proven a valuable measure to reduce the challenges of network inference so effectively that the feasible scope of applications is expanded even into the embedded domain. This paper describes the making of a real-time object detection in a live video stream processed on an embedded all programmable device. The presented case illustrates how the required processing is tamed and parallelized across both the CPU cores and the programmable logic and how the most suitable resources and powerful extensions, such as NEON vectorization, are leveraged for the individual processing steps. The crafted result is an extended Darknet framework implementing a fully integrated, end-to-end solution from video capture over object annotation to video output applying neural network inference at different quantization levels running at 16 frames per second on an embedded Zynq UltraScale+ (XCZU3EG) platform.
引用
收藏
页码:833 / 838
页数:6
相关论文
共 50 条
  • [21] Inference and Energy Efficient Design of Deep Neural Networks for Embedded Devices
    Galanis, Ioannis
    Anagnostopoulos, Iraklis
    Nguyen, Chinh
    Bares, Guillermo
    Burkard, Dona
    2020 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2020), 2020, : 36 - 41
  • [22] Iterative neural networks for adaptive inference on resource-constrained devices
    Sam Leroux
    Tim Verbelen
    Pieter Simoens
    Bart Dhoedt
    Neural Computing and Applications, 2022, 34 : 10321 - 10336
  • [23] Iterative neural networks for adaptive inference on resource-constrained devices
    Leroux, Sam
    Verbelen, Tim
    Simoens, Pieter
    Dhoedt, Bart
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (13): : 10321 - 10336
  • [24] Neural Networks Meet Physical Networks: Distributed Inference Between Edge Devices and the Cloud
    Chinchali, Sandeep P.
    Cidon, Eyal
    Pergament, Evgenya
    Chu, Tianshu
    Katti, Sachin
    HOTNETS-XVII: PROCEEDINGS OF THE 2018 ACM WORKSHOP ON HOT TOPICS IN NETWORKS, 2018, : 50 - 56
  • [25] Smart Classrooms aided by Deep Neural Networks inference on Mobile Devices
    Pacheco, Alberto
    Flores, Ever
    Sanchez, Raul
    Almanza-Garcia, Salvador
    2018 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY (EIT), 2018, : 605 - 609
  • [26] Biomass estimation using Artificial Neural Networks on Field Programmable Analog Devices
    Ascencio, MRL
    Galicia, CA
    PROCEEDINGS OF THE 2000 IEEE INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS, VOL 1 AND 2, 2000, : 61 - 66
  • [27] Programmable chalcogenide-based all-optical deep neural networks
    Teo, Ting Yu
    Ma, Xiaoxuan
    Pastor, Ernest
    Wang, Hao
    George, Jonathan K.
    Yang, Joel K. W.
    Wall, Simon
    Miscuglio, Mario
    Simpson, Robert E.
    Sorger, Volker J.
    NANOPHOTONICS, 2022, 11 (17) : 4073 - 4088
  • [28] Enabling Mixed-Precision Quantized Neural Networks in Extreme-Edge Devices
    Bruschi, Nazareno
    Garofalo, Angelo
    Conti, Francesco
    Tagliavini, Giuseppe
    Rossi, Davide
    17TH ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS 2020 (CF 2020), 2020, : 217 - 220
  • [29] Zac: Towards Automatic Optimization and Deployment of Quantized Deep Neural Networks on Embedded Devices
    Xiao, Qingcheng
    Liang, Yun
    2019 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2019,
  • [30] Experiment Planning for Heterogeneous Programmable Networks
    Sultana, Nik
    18TH ANNUAL INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING IN SENSOR SYSTEMS (DCOSS 2022), 2022, : 434 - 437