Realization of high-precision heterogeneous anchor-free detection model based on PYNQ framework

被引：0

作者：

Zhang R. ^{[1
,2
]}

Jiang X. ^{[1
]}

An J. ^{[1
]}

Cui T. ^{[1
]}

机构：

[1] Key Laboratory of Electronics and Information Technology for Space Systems(National Space Science Center, Chinese Academy of Sciences), Beijing

[2] University of Chinese Academy of Sciences, Beijing

来源：

Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology | 2022年 / 54卷 / 05期

关键词：

Anchor-free; Object detection; Optical remote sensing image; Overall scaling factor; !text type='Python']Python[!/text] productivity for ZYNQ;

D O I：

10.11918/202111015

中图分类号：

学科分类号：

摘要：

Due to the large number of parameters and large amount of calculation of deep convolutional networks, it is difficult to quickly and accurately deploy multi-scale target detection networks on many platforms with limited resources and power consumption. To solve this problem, based on the Python productivity for ZYNQ (PYNQ) framework, this paper realizes the IP core design and heterogeneous system architecture deployment of CTiny model, which is an anchor-free object detection model. First, a method of segmental quantization of the overall scaling factors in the convolution kernel was proposed, so that the pre-trained high-precision algorithm could be deployed on the field programmable gate array (FPGA) with low loss. Then, the system of the CTiny model was constructed based on the PYNQ framework, including ResNet backbone network, deconvolution network, and branch detection network. Finally, the time-consuming calculation such as picture preprocessing and post-processing was moved from serial ARM to parallel FPGA, further reducing the total processing time. Experimental results show that after deploying the CTiny model on the PYNQ-Z2 development board, the proposed quantization method achieved a mean average precision of 81.60% in the public optical remote sensing dataset NWPU VHR-10, which increased by 14.27% than truncated quantization. It has realized the requirement of deploying a tiny anchor-free object detection network with low loss. In addition, the processing time of post-processing was reduced from 9.228 s on the ARM side to 0.008 s on the FPGA side, which improved the speed of the detection model. Copyright ©2022 Journal of Harbin Institute of Technology.All rights reserved.

引用

页码：24 / 33

页数：9

共 16 条

[1]

LU Xiaoyan, ZHONG Yanfei, ZHENG Zhuo, A novel global-aware deep network for road detection of very high resolution remote sensing imagery, 2020 IEEE International Geoscience and Remote Sensing Symposium, (2020)

[2]

DONG Zhipeng, WANG Mi, WANG Yanli, Et al., Object detection in high resolution remote sensing imagery based on convolutional neural networks with suitable object scale features, IEEE Transactions on Geoscience and Remote Sensing, 58, 3, (2020)

[3]

KRISHNAMOORTHI R., Quantizing deep convolutional networks for efficient inference: A whitepaper

[4]

JACOB B, KLIGYS S, CHEN Bo, Et al., Quantization and training of neural networks for efficient integer-arithmetic-only inference, Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, (2018)

[5]

ZHANG Xiaofan, HAO Cong, LU Haoming, Et al., SkyNet: A champion model for DAC-SDC on low power object detection

[6]

REDMON J, FARHADI A., YOLO9000: Better, faster, stronger, Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition(CVPR), (2017)

[7]

ZHOU Xingyi, WANG Dequan, KRAHENBVHL P., Objects as points

[8]

HE Kaiming, ZHANG Xiangyu, REN Shaoqing, Et al., Deep residual learning for image recognition, Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition, (2016)

[9]

MELONI P, DERIU G, CONTI F, Et al., A high-efficiency runtime reconfigurable IP for CNN acceleration on a mid-range all-programmable SoC, Proceedings of 2016 International Conference on ReConFigurable Computing and FPGAs (ReConFig), (2016)

[10]

LIU Ye, WANG Yin, CHANG Liang, Et al., A fast and efficient FPGA-based level set hardware accelerator for image segmentation, Proceedings of 2020 IEEE International Conference on Integrated Circuits, Technologies and Applications(ICTA), (2020)

← 1 2 →