Optimizing FPGA-Based Convolutional Neural Network Performance

被引:3
|
作者
Kao, Chi-Chou [1 ]
机构
[1] Natl Univ Tainan, Dept Comp Sci & Informat Engn, Tainan 700, Taiwan
关键词
CNN; FPGA; optimize; performance; architecture;
D O I
10.1142/S0218126623502547
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In deep learning, convolutional neural networks (CNNs) are a class of artificial neural networks (ANNs), most commonly applied to analyze visual imagery. They are also known as Shift-Invariant or Space-Invariant Artificial Neural Networks (SIANNs), based on the shared-weight architecture of the convolution kernels or filters that slide along input features and provide translation-equivariant responses known as feature maps. Recently, various architectures for CNN based on FPGA platform have been proposed because it has the advantages of high performance and fast development cycle. However, some key issues including how to optimize the performance of CNN layers with different structures, high-performance heterogeneous accelerator design, and how to reduce the neural network framework integration overhead need to be improved. To overcome and improve these problems, we propose dynamic cycle pipeline tiling, data layout optimization, and a pipelined software and hardware (SW-HW)-integrated architecture with flexibility and integration. Some benchmarks have been tested and implemented on the FPGA board for the proposed architecture. The proposed dynamic tiling and data layout transformation improved by 2.3 times in the performance. Moreover, with two-level pipelining, we achieve up to five times speedup and the proposed system is 3.8 times more energy-efficient than the GPU.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Edge FPGA-based Onsite Neural Network Training
    Chen, Ruiqi
    Zhang, Haoyang
    Li, Yu
    Zhang, Runzhou
    Li, Guoyu
    Yu, Jun
    Wang, Kun
    2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,
  • [42] An FPGA-based memristor emulator for artificial neural network
    Zhang, Zhang
    Li, Chao
    Zhang, Weiqi
    Zhou, Jing
    Liu, Gang
    MICROELECTRONICS JOURNAL, 2023, 131
  • [43] FPGA based convolution and memory architecture for Convolutional Neural Network
    Shahan, K. A.
    Rani, Sheeba J.
    2020 33RD INTERNATIONAL CONFERENCE ON VLSI DESIGN AND 2020 19TH INTERNATIONAL CONFERENCE ON EMBEDDED SYSTEMS (VLSID), 2020, : 183 - 188
  • [44] An FPGA-Based High-Throughput Keypoint Detection Accelerator Using Convolutional Neural Network for Mobile Robot Applications
    Li, Jingyuan
    Liu, Ye
    Huang, Kun
    Zhou, Liang
    Chang, Liang
    Zhou, Jun
    2022 IEEE ASIA PACIFIC CONFERENCE ON POSTGRADUATE RESEARCH IN MICROELECTRONICS AND ELECTRONICS, PRIMEASIA, 2022, : 81 - 84
  • [45] FPGA-based Accelerator for Deep Convolutional Neural Networks for the SPARK Environment
    Morcel, Raghid
    Ezzeddine, Mazen
    Akkary, Haitham
    2016 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD), 2016, : 126 - 133
  • [46] FPGA-Based Memristor Emulator Circuit for Binary Convolutional Neural Networks
    Tolba, Mohammed F.
    Halawani, Yasmin
    Saleh, Hani
    Mohammad, Baker
    Al-Qutayri, Mahmoud
    IEEE ACCESS, 2020, 8 : 117736 - 117745
  • [47] Efficient Two-Stage Max-Pooling Engines for an FPGA-Based Convolutional Neural Network
    Hong, Eonpyo
    Choi, Kang-A
    Joo, Jhihoon
    ELECTRONICS, 2023, 12 (19)
  • [48] FPGA-Based Implementation of a Real-Time Object Recognition System Using Convolutional Neural Network
    Gilan, Ali Azarmi
    Emad, Mohammad
    Alizadeh, Bijan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2020, 67 (04) : 755 - 759
  • [49] A Study on the Design Procedure of Re-Configurable Convolutional Neural Network Engine for FPGA-Based Applications
    Kumar, Pervesh
    Ali, Imran
    Kim, Dong-Gyun
    Byun, Sung-June
    Kim, Dong-Gyu
    Pu, Young-Gun
    Lee, Kang-Yoon
    ELECTRONICS, 2022, 11 (23)
  • [50] FPGA-Based High-Performance Data Compression Deep Neural Network Accelerator
    Wang, Hanze
    Fu, Yingxun
    Ma, Li
    2022 INTERNATIONAL CONFERENCE ON BIG DATA, INFORMATION AND COMPUTER NETWORK (BDICN 2022), 2022, : 563 - 569