Inference of Quantized Neural Networks on Heterogeneous All-Programmable Devices

被引：0

作者：

Preusser, Thomas B. ^{[1
]}

Gambardella, Giulio ^{[1
]}

Fraser, Nicholas ^{[1
]}

Blott, Michaela ^{[1
]}

机构：

[1] Xilinx Res Labs, Dublin, Ireland

来源：

PROCEEDINGS OF THE 2018 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE) | 2018年

基金：

欧盟地平线“2020”;

关键词：

all-programmable; quantized neural networks; object detection;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Neural networks have established as a generic and powerful means to approach challenging problems such as image classification, object detection or decision making. Their successful employment foots on an enormous demand of compute. The quantization of network parameters and the processed data has proven a valuable measure to reduce the challenges of network inference so effectively that the feasible scope of applications is expanded even into the embedded domain. This paper describes the making of a real-time object detection in a live video stream processed on an embedded all programmable device. The presented case illustrates how the required processing is tamed and parallelized across both the CPU cores and the programmable logic and how the most suitable resources and powerful extensions, such as NEON vectorization, are leveraged for the individual processing steps. The crafted result is an extended Darknet framework implementing a fully integrated, end-to-end solution from video capture over object annotation to video output applying neural network inference at different quantization levels running at 16 frames per second on an embedded Zynq UltraScale+ (XCZU3EG) platform.

引用

页码：833 / 838

页数：6

共 50 条

[1] Implementation of Binarized Neural Networks in All-Programmable System-on-Chip Platforms
Xiang, Maoyang
Teo, Tee Hui
ELECTRONICS, 2022, 11 (04)
[2] Vertical Layering of Quantized Neural Networks for Heterogeneous Inference
Wu, Hai
He, Ruifei
Tan, Haoru
Qi, Xiaojuan
Huang, Kaibin
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 15964 - 15978
[3] RETRACTION: Implementation of Binarized Neural Networks in All-Programmable System-on-Chip Platforms.
Xiang, Maoyang
Teo, Tee Hui
ELECTRONICS, 2025, 14 (02):
[4] Simulating quantized inference on convolutional neural networks
Finotti, Vitor
Albertini, Bruno
COMPUTERS & ELECTRICAL ENGINEERING, 2021, 95
[5] Simulating quantized inference on convolutional neural networks
Finotti, Vitor
Albertini, Bruno
Computers and Electrical Engineering, 2021, 95
[6] Quantized Convolutional Neural Networks for Mobile Devices
Wu, Jiaxiang
Leng, Cong
Wang, Yuhang
Hu, Qinghao
Cheng, Jian
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4820 - 4828
[7] Partitioning Deep Neural Networks for Optimally Pipelined Inference on Heterogeneous IoT Devices with Low Latency Networks
Sect, Woobean
Kim, Saehwa
Hong, Seongsoo
2024 IEEE 44TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, ICDCS 2024, 2024, : 1470 - 1471
[8] Implementation of a GNSS Rebroadcaster in an All-Programmable System-On-Chip Platform
Majoral, M.
Arribas, J.
Fernandez-Prades, C.
2022 10TH WORKSHOP ON SATELLITE NAVIGATION TECHNOLOGY (NAVITEC 2022), 2022,
[9] A Reconfigurable MapReduce Accelerator for multi-core all-programmable SoCs
Kachris, Christoforos
Sirakoulis, Georgios Ch.
Soudris, Dimitrios
2014 INTERNATIONAL SYMPOSIUM ON SYSTEM-ON-CHIP (SOC), 2014,
[10] Security Solutions in the First-Generation Zynq All-Programmable SoC
Trimberger, Steve
2014 IEEE INTERNATIONAL TEST CONFERENCE (ITC), 2014,

← 1 2 3 4 5 →