Convolutional Neural Networks using FPGA-based Pipelining

被引：0

作者：

Ali G.A. ^{[1
]}

Ali A.H. ^{[2
]}

机构：

[1] AL-Rafidain University College, Iraq

[2] Al-Iraqia University, Iraq

来源：

Iraqi Journal for Computer Science and Mathematics | 2023年 / 4卷 / 02期

关键词：

Accelerators; Convolutional Neural Networks; FPGA; Hardware; Performance; Pipelining;

D O I：

10.52866/ijcsm.2023.02.02.019

中图分类号：

学科分类号：

摘要：

In order to speed up convolutional neural networks (CNNs), this study gives a complete overview of the use of FPGA-based pipelining for hardware acceleration of CNNs. These days, most people use convolutional neural networks (CNNs) to perform computer vision tasks like picture categorization and object recognition. The processing and memory demands of CNNs, however, can be excessive, especially for real-time applications. In order to speed up CNNs, FPGA-based pipelining has emerged as a viable option thanks to its parallel processing capabilities and low power consumption. The examination describes the fundamentals of FPGA-based pipelining and the basic structure of convolutional neural networks (CNNs). The current best practises for developing pipelined accelerators for CNNs on FPGAs are then reviewed, covering topics like partitioning and pipelining. Area and power limits, memory needs, and latency considerations are only some of the difficulties and trade-offs discussed in the article. In addition, the survey evaluates and contrasts the various pipelined FPGA accelerators for CNNs in terms of performance, energy consumption, and resource utilisation. Future directions and potential research areas are also discussed in the paper, such as the use of approximate computing techniques, the integration of reconfigurable architectures with emerging memory technologies, and the exploration of hybrid architectures that combine FPGAs and other hardware accelerators. This survey was created to aid researchers and practitioners in developing efficient and effective hardware accelerators for neural networks by providing a thorough overview of current trends and issues in FPGA-based pipelining for CNNs. © 2023 Authors. All rights reserved.

引用

页码：215 / 223

页数：8

共 11 条

[1]

Zhang C., Li P., Sun G., Guan Y., Xiao B., Cong J., Optimizing FPGA-based accelerator design for deep convolutional neural networks, Proceedings of the 2015 ACM/SIGDA international symposium on field-programmable gate arrays, pp. 161-170, (2015)

[2]

Yi Q., Sun H., Fujita M., Fpga based accelerator for neural networks computation with flexible pipelining, (2021)

[3]

Gong L., Wang C., Li X., Chen H., Zhou X., MALOC: A fully pipelined FPGA accelerator for convolutional neural networks with all layers mapped on chip, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 37, 11, pp. 2601-2612, (2018)

[4]

mohammed rajaa, Kadhem M., Iraqi Sign Language Translator system using Deep Learning, Al-Salam Journal for Engineering and Technology, 2, 1, pp. 109-116, (2023)

[5]

Alajanbi M., Malerba D., Liu H., Distributed reduced convolution neural networks, Mesopotamian Journal of Big Data, 2021, pp. 26-29, (2021)

[6]

Bajao N. A., Sarucam J.-a., Threats Detection in the Internet of Things Using Convolutional neural networks, long short-term memory, and gated recurrent units, Mesopotamian journal of cybersecurity, 2023, 2023, pp. 22-29

[7]

Shimoda M., Sada Y., Nakahara H., FPGA-based inter-layer pipelined accelerators for filter-wise weight-balanced sparse fully convolutional networks with overlapped tiling, Journal of Signal Processing Systems, 93, pp. 499-512, (2021)

[8]

Salman S. A., Dheyab S. A., Salih Q. M., Hammood W. A., Parallel Machine Learning Algorithms, Mesopotamian Journal of Big Data, 2023, 2023, pp. 13-17

[9]

Feng G., Hu Z., Chen S., Wu F., Energy-efficient and high-throughput FPGA-based accelerator for Convolutional Neural Networks, 2016 13th IEEE International Conference on Solid-State and Integrated Circuit Technology (ICSICT), pp. 624-626, (2016)

[10]

Bank-Tavakoli E., Ghasemzadeh S. A., Kamal M., Afzali-Kusha A., Pedram M., Polar: A pipelined/overlapped fpga-based lstm accelerator, IEEE Transactions on Very Large Scale Integration (VLSI) Systems, 28, 3, pp. 838-842, (2019)

← 1 2 →