Design of hardware accelerator for target recognition convolutional neural network based on DSP plus FPGA architecture

被引:0
作者
Lu Bocheng [1 ]
Xu Chuangang [1 ]
Li Shiliang [1 ]
Sun Ping [1 ]
Zhang Jingfeng [1 ]
机构
[1] Tianjin Jinhang Inst Phys Technol, Tianjin 300308, Peoples R China
来源
AOPC 2024: INFRARED TECHNOLOGY AND APPLICATIONS | 2025年 / 13493卷
关键词
FPGA; hardware accelerator; target recognition; YOLO;
D O I
10.1117/12.3045653
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
In response to the deployment requirements of target recognition algorithms based on convolutional neural networks (CNN) on embedded platforms, this paper proposes a hardware accelerator based on the Digital Signal Processor (DSP) + Field Programmable Gate Array (FPGA) architecture, aiming to overcome the current issues of low recognition speed and low hardware resource utilization. The system is designed based on the 6678 series DSP and the V7 series FPGA. The DSP is used as the host computer, and the design of various operators required in the CNN are completed in the FPGA as the core computing module of the accelerator. This architecture enables flexibility in implementing different neural networks by configuring different instruction sets. Within the core computing module of the FPGA, this design completes the implementation of operators such as Convolution, MaxPool, Upsample, Concat, and Split, and further focuses on optimizing the Convolution operator. For the Convolution operator, a parallel acceleration method is adopted, which improves computational speed. Moreover, a method of input reuse is adopted to minimize the number of memory accesses. Additionally, a new feature map data input cache module is designed, which reduces the occupancy of on- chip hardware resources while minimizing the time from inputting image to the start of convolution calculation, enhancing the system's real-time performance. Finally, the system is used to accelerate the You Only Look Once (YOLO) neural network. It achieves a single-frame recognition time of 135.63ms for a 640x640-sized image at 200MHZ, with a throughput of up to 100.132GOPS.
引用
收藏
页数:9
相关论文
共 10 条
[1]  
[Anonymous], 2015, P 2015 ACM SIGDA INT, DOI DOI 10.1145/2684746.2689060
[2]   High-Utilization, High-Flexibility Depth-First CNN Coprocessor for Image Pixel Processing on FPGA [J].
Colleman, Steven ;
Verhelst, Marian .
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2021, 29 (03) :461-471
[3]  
Eid O., 2021, 2021 INT C MICR ICM, P270, DOI [10.1109/ICM52667.2021.9664943, DOI 10.1109/ICM52667.2021.9664943]
[4]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[5]   Infrared ship target detection algorithm based on YOLOv5 [J].
Liu F. ;
Sun J. ;
Zhang S. ;
Sang H. ;
Sun X. .
Hongwai yu Jiguang Gongcheng/Infrared and Laser Engineering, 2023, 52 (10)
[6]   An Efficient Hardware Accelerator for Sparse Convolutional Neural Networks on FPGAs [J].
Lu, Liqiang ;
Xie, Jiaming ;
Huang, Ruirui ;
Zhang, Jiansong ;
Lin, Wei ;
Liang, Yun .
2019 27TH IEEE ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2019, :17-25
[7]   You Only Look Once: Unified, Real-Time Object Detection [J].
Redmon, Joseph ;
Divvala, Santosh ;
Girshick, Ross ;
Farhadi, Ali .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :779-788
[8]   Throughput-Optimized OpenCL-based FPGA Accelerator for Large-Scale Convolutional Neural Networks [J].
Suda, Naveen ;
Chandra, Vikas ;
Dasika, Ganesh ;
Mohanty, Abinash ;
Ma, Yufei ;
Vrudhula, Sarma ;
Seo, Jae-Sun ;
Cao, Yu .
PROCEEDINGS OF THE 2016 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'16), 2016, :16-25
[9]  
Wei GJ, 2018, IEEE INT CONF COMMUN, P734, DOI 10.1109/ICCChina.2018.8641256
[10]   An infrared vehicle detection method based on improved YOLOv5 [J].
Zhang X. ;
Zhao H. ;
Liu W. ;
Zhao Y. ;
Guan S. .
Hongwai yu Jiguang Gongcheng/Infrared and Laser Engineering, 2023, 52 (08)