Inference of Quantized Neural Networks on Heterogeneous All-Programmable Devices

被引:0
|
作者
Preusser, Thomas B. [1 ]
Gambardella, Giulio [1 ]
Fraser, Nicholas [1 ]
Blott, Michaela [1 ]
机构
[1] Xilinx Res Labs, Dublin, Ireland
基金
欧盟地平线“2020”;
关键词
all-programmable; quantized neural networks; object detection;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Neural networks have established as a generic and powerful means to approach challenging problems such as image classification, object detection or decision making. Their successful employment foots on an enormous demand of compute. The quantization of network parameters and the processed data has proven a valuable measure to reduce the challenges of network inference so effectively that the feasible scope of applications is expanded even into the embedded domain. This paper describes the making of a real-time object detection in a live video stream processed on an embedded all programmable device. The presented case illustrates how the required processing is tamed and parallelized across both the CPU cores and the programmable logic and how the most suitable resources and powerful extensions, such as NEON vectorization, are leveraged for the individual processing steps. The crafted result is an extended Darknet framework implementing a fully integrated, end-to-end solution from video capture over object annotation to video output applying neural network inference at different quantization levels running at 16 frames per second on an embedded Zynq UltraScale+ (XCZU3EG) platform.
引用
收藏
页码:833 / 838
页数:6
相关论文
共 50 条
  • [1] Implementation of Binarized Neural Networks in All-Programmable System-on-Chip Platforms
    Xiang, Maoyang
    Teo, Tee Hui
    ELECTRONICS, 2022, 11 (04)
  • [2] Vertical Layering of Quantized Neural Networks for Heterogeneous Inference
    Wu, Hai
    He, Ruifei
    Tan, Haoru
    Qi, Xiaojuan
    Huang, Kaibin
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 15964 - 15978
  • [3] RETRACTION: Implementation of Binarized Neural Networks in All-Programmable System-on-Chip Platforms.
    Xiang, Maoyang
    Teo, Tee Hui
    ELECTRONICS, 2025, 14 (02):
  • [4] Simulating quantized inference on convolutional neural networks
    Finotti, Vitor
    Albertini, Bruno
    COMPUTERS & ELECTRICAL ENGINEERING, 2021, 95
  • [5] Simulating quantized inference on convolutional neural networks
    Finotti, Vitor
    Albertini, Bruno
    Computers and Electrical Engineering, 2021, 95
  • [6] Quantized Convolutional Neural Networks for Mobile Devices
    Wu, Jiaxiang
    Leng, Cong
    Wang, Yuhang
    Hu, Qinghao
    Cheng, Jian
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4820 - 4828
  • [7] Partitioning Deep Neural Networks for Optimally Pipelined Inference on Heterogeneous IoT Devices with Low Latency Networks
    Sect, Woobean
    Kim, Saehwa
    Hong, Seongsoo
    2024 IEEE 44TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, ICDCS 2024, 2024, : 1470 - 1471
  • [8] Implementation of a GNSS Rebroadcaster in an All-Programmable System-On-Chip Platform
    Majoral, M.
    Arribas, J.
    Fernandez-Prades, C.
    2022 10TH WORKSHOP ON SATELLITE NAVIGATION TECHNOLOGY (NAVITEC 2022), 2022,
  • [9] A Reconfigurable MapReduce Accelerator for multi-core all-programmable SoCs
    Kachris, Christoforos
    Sirakoulis, Georgios Ch.
    Soudris, Dimitrios
    2014 INTERNATIONAL SYMPOSIUM ON SYSTEM-ON-CHIP (SOC), 2014,
  • [10] Security Solutions in the First-Generation Zynq All-Programmable SoC
    Trimberger, Steve
    2014 IEEE INTERNATIONAL TEST CONFERENCE (ITC), 2014,