Optimizing FPGA-Based Convolutional Neural Network Performance

被引：3

作者：

Kao, Chi-Chou ^{[1
]}

机构：

[1] Natl Univ Tainan, Dept Comp Sci & Informat Engn, Tainan 700, Taiwan

来源：

JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS | 2023年 / 32卷 / 15期

关键词：

CNN; FPGA; optimize; performance; architecture;

D O I：

10.1142/S0218126623502547

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In deep learning, convolutional neural networks (CNNs) are a class of artificial neural networks (ANNs), most commonly applied to analyze visual imagery. They are also known as Shift-Invariant or Space-Invariant Artificial Neural Networks (SIANNs), based on the shared-weight architecture of the convolution kernels or filters that slide along input features and provide translation-equivariant responses known as feature maps. Recently, various architectures for CNN based on FPGA platform have been proposed because it has the advantages of high performance and fast development cycle. However, some key issues including how to optimize the performance of CNN layers with different structures, high-performance heterogeneous accelerator design, and how to reduce the neural network framework integration overhead need to be improved. To overcome and improve these problems, we propose dynamic cycle pipeline tiling, data layout optimization, and a pipelined software and hardware (SW-HW)-integrated architecture with flexibility and integration. Some benchmarks have been tested and implemented on the FPGA board for the proposed architecture. The proposed dynamic tiling and data layout transformation improved by 2.3 times in the performance. Moreover, with two-level pipelining, we achieve up to five times speedup and the proposed system is 3.8 times more energy-efficient than the GPU.

引用

页数：19

共 50 条

[41] Edge FPGA-based Onsite Neural Network Training
Chen, Ruiqi
Zhang, Haoyang
Li, Yu
Zhang, Runzhou
Li, Guoyu
Yu, Jun
Wang, Kun
2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,
[42] An FPGA-based memristor emulator for artificial neural network
Zhang, Zhang
Li, Chao
Zhang, Weiqi
Zhou, Jing
Liu, Gang
MICROELECTRONICS JOURNAL, 2023, 131
[43] FPGA based convolution and memory architecture for Convolutional Neural Network
Shahan, K. A.
Rani, Sheeba J.
2020 33RD INTERNATIONAL CONFERENCE ON VLSI DESIGN AND 2020 19TH INTERNATIONAL CONFERENCE ON EMBEDDED SYSTEMS (VLSID), 2020, : 183 - 188
[44] An FPGA-Based High-Throughput Keypoint Detection Accelerator Using Convolutional Neural Network for Mobile Robot Applications
Li, Jingyuan
Liu, Ye
Huang, Kun
Zhou, Liang
Chang, Liang
Zhou, Jun
2022 IEEE ASIA PACIFIC CONFERENCE ON POSTGRADUATE RESEARCH IN MICROELECTRONICS AND ELECTRONICS, PRIMEASIA, 2022, : 81 - 84
[45] FPGA-based Accelerator for Deep Convolutional Neural Networks for the SPARK Environment
Morcel, Raghid
Ezzeddine, Mazen
Akkary, Haitham
2016 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD), 2016, : 126 - 133
[46] FPGA-Based Memristor Emulator Circuit for Binary Convolutional Neural Networks
Tolba, Mohammed F.
Halawani, Yasmin
Saleh, Hani
Mohammad, Baker
Al-Qutayri, Mahmoud
IEEE ACCESS, 2020, 8 : 117736 - 117745
[47] Efficient Two-Stage Max-Pooling Engines for an FPGA-Based Convolutional Neural Network
Hong, Eonpyo
Choi, Kang-A
Joo, Jhihoon
ELECTRONICS, 2023, 12 (19)
[48] FPGA-Based Implementation of a Real-Time Object Recognition System Using Convolutional Neural Network
Gilan, Ali Azarmi
Emad, Mohammad
Alizadeh, Bijan
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2020, 67 (04) : 755 - 759
[49] A Study on the Design Procedure of Re-Configurable Convolutional Neural Network Engine for FPGA-Based Applications
Kumar, Pervesh
Ali, Imran
Kim, Dong-Gyun
Byun, Sung-June
Kim, Dong-Gyu
Pu, Young-Gun
Lee, Kang-Yoon
ELECTRONICS, 2022, 11 (23)
[50] FPGA-Based High-Performance Data Compression Deep Neural Network Accelerator
Wang, Hanze
Fu, Yingxun
Ma, Li
2022 INTERNATIONAL CONFERENCE ON BIG DATA, INFORMATION AND COMPUTER NETWORK (BDICN 2022), 2022, : 563 - 569

← 1 2 3 4 5 →