Efficient FPGA-based Accelerator for Post-Processing in Object Detection

被引:1
|
作者
Guo, Zibo [1 ]
Liu, Kai [1 ]
Liu, Wei [2 ]
Li, Shangrong [1 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian, Peoples R China
[2] State Key Lab Geoinformat Engn, Xian, Peoples R China
来源
2023 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY, ICFPT | 2023年
基金
中国国家自然科学基金;
关键词
Object Detection; FPGA; NMS; Post-Processing; Hardware Accelerator;
D O I
10.1109/ICFPT59805.2023.00019
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The fast object detection algorithms such as YOLO, etc., have two main components: convolutional neural network (CNN) and post-processing. In this paper, we propose an efficient FPGA-based accelerator for the post-processing of YOLO object detection. We schedule a pipelined data path for post-processing, which fuses data scanning/caching, decoding, class identification, and non-maximum suppression (NMS) operations. This data path effectively hides the processing time of operations without data dependencies, providing up to 43x speedup over primitive serial processes. Moreover, We propose a parallel hardware architecture for NMS, allowing for parallel intersection over union (IoU) computation and threshold comparison. This architecture significantly reduces the latency of NMS, providing up to 811x speedup over primitive serial processes. Our accelerator implemented on a Xilinx Virtex-7 690t FPGA runs at 150MHz. Impressively, our accelerator achieves minimal latency of only 0.19 mu s and a processing time of merely 4.46 mu s for the postprocessing of the YOLOv3Tiny algorithm. This represents an 378x speedup compared to the Intel i7-8700 CPU with 3.2GHz. Additionally, our NMS unit exhibits a latency of 0.07 mu s, which is nearly 2x faster than the state-of-the-art FPGA acceleration method.
引用
收藏
页码:125 / 131
页数:7
相关论文
共 50 条
  • [1] Efficient Hardware Post Processing of Anchor-Based Object Detection on FPGA
    Zhang, Hui
    Wu, Wei
    Ma, Yufei
    Wang, Zhongfeng
    2020 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2020), 2020, : 580 - 585
  • [2] FPGA-based accelerator for object detection: a comprehensive survey
    Kai Zeng
    Qian Ma
    Jia Wen Wu
    Zhe Chen
    Tao Shen
    Chenggang Yan
    The Journal of Supercomputing, 2022, 78 : 14096 - 14136
  • [3] FPGA-based accelerator for object detection: a comprehensive survey
    Zeng, Kai
    Ma, Qian
    Wu, Jia Wen
    Chen, Zhe
    Shen, Tao
    Yan, Chenggang
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (12) : 14096 - 14136
  • [4] An FPGA-Based YOLOv6 Accelerator for High-Throughput and Energy-Efficient Object Detection
    Sha, Xingan
    Yanagisawa, Masao
    Shi, Youhua
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2025, E108A (03) : 473 - 481
  • [5] WGeod: A General and Efficient FPGA Accelerator for Object Detection
    Wang, Zihan
    Zhao, Mengying
    Gong, Lei
    Wang, Chao
    2022 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING, ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM, 2022, : 730 - 738
  • [6] A FPGA-based accelerator implementaion for YOLOv2 object detection using Winograd algorithm
    Lv, Peng
    Liu, Wei
    Li, Jinghui
    2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 1894 - 1898
  • [7] An Efficient FPGA-Based Accelerator Design for Convolution
    Song, Peng-Fei
    Pan, Jeng-Shyang
    Yang, Chun-Sheng
    Lee, Chiou-Yng
    2017 IEEE 8TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST), 2017, : 494 - 500
  • [8] An Efficient FPGA-based Accelerator for Deep Forest
    Zhu, Mingyu
    Luo, Jiapeng
    Mao, Wendong
    Wang, Zhongfeng
    2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 3334 - 3338
  • [9] FPGA-Based Vehicle Detection and Tracking Accelerator
    Zhai, Jiaqi
    Li, Bin
    Lv, Shunsen
    Zhou, Qinglei
    SENSORS, 2023, 23 (04)
  • [10] FPGA-based Object Detection for Autonomous Driving System
    Harada, Kenichi
    Kanazawa, Kenji
    Yasunaga, Moritoshi
    2019 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT 2019), 2019, : 465 - 468