An Adaptive Row-based Weight Reuse Scheme for FPGA Implementation of Convolutional Neural Networks

被引:0
作者
Je, Hyeonseung [1 ]
Duy Thanh Nguyen [1 ]
Lee, Kyujoong [2 ]
Lee, Hyuk-Jae [1 ]
机构
[1] Seoul Natl Univ, Dept Elect & Comp Engn, Seoul, South Korea
[2] Sunmoon Univ, Dept Elect Engn, Asan, South Korea
来源
2021 36TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC) | 2021年
关键词
FPGA; Convolutional neural networks; U-Net; Row-reuse scheme; Adaptive;
D O I
10.1109/ITC-CSCC52171.2021.9501490
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There is an increasing need to implement the Convolutional Neural network (CNN) with an FPGA thanks to its design flexibility over an ASIC and low power consumption over a GPU. The size of the network and the resource of the target FPGA board should be considered to deploy the CNN Network successfully. However, previous works use the fixed dataflow which is not optimized for each layer. As a result, high on-chip buffer utilization and frequent memory access are required. The row-based weight reuse scheme is efficient in reducing input/output buffer size. However, it causes resource underutilization for layers with small feature maps size. This paper proposes an adaptive row reuse scheme by applying each level of row-reuse for each layer depending on its characteristic. Finally, the proposed design is implemented with a Xilinx KCU1500 board, and the accelerator achieves 994.74 GOPS of the throughput for U-Net. For general CNN implementation, the proposed scheme achieves 1080.9 GOPS when running VGG16 with 1.7 times less buffer size compared to previous works.
引用
收藏
页数:4
相关论文
共 50 条
  • [41] FPGA Implementation of Radio Frequency Neural Networks
    Bhatia, Amit
    Robinson, Josh
    Carmack, Joseph
    Kuzdeba, Scott
    2022 IEEE 12TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2022, : 613 - 618
  • [42] Implementation of Data-optimized FPGA-based Accelerator for Convolutional Neural Network
    Cho, Mannhee
    Kim, Youngmin
    2020 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2020,
  • [43] All Binarized Convolutional Neural Network and Its implementation on an FPGA
    Shimoda, Masayuki
    Sato, Shimpei
    Nakahara, Hiroki
    2017 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY (ICFPT), 2017, : 291 - 294
  • [44] Optimizing Loop Operation and Dataflow in FPGA Acceleration of Deep Convolutional Neural Networks
    Ma, Yufei
    Cao, Yu
    Vrudhula, Sarma
    Seo, Jae-sun
    FPGA'17: PROCEEDINGS OF THE 2017 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS, 2017, : 45 - 54
  • [45] FPGA implementation of a near computation free image compression scheme based on adaptive decimation
    Wu, A
    Tsang, PWM
    Tang, J
    INTEGRATION-THE VLSI JOURNAL, 2003, 36 (03) : 121 - 143
  • [46] Adaptive activation functions in convolutional neural networks
    Qian, Sheng
    Liu, Hua
    Liu, Cheng
    Wu, Si
    Wong, Hau San
    NEUROCOMPUTING, 2018, 272 : 204 - 212
  • [47] A Block-Floating-Point Arithmetic Based FPGA Accelerator for Convolutional Neural Networks
    Zhang, Heshan
    Liu, Zhenyu
    Zhang, Guanwen
    Dai, Jiwu
    Lian, Xiaocong
    Zhou, Wei
    Ji, Xiangyang
    2019 7TH IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (IEEE GLOBALSIP), 2019,
  • [48] Morphological Galaxy Classification Using Convolutional Neural Networks on FPGA
    Barnwal, Rahul
    Kala, S.
    2024 IEEE SPACE, AEROSPACE AND DEFENCE CONFERENCE, SPACE 2024, 2024, : 190 - 193
  • [49] Throughput-Optimized OpenCL-based FPGA Accelerator for Large-Scale Convolutional Neural Networks
    Suda, Naveen
    Chandra, Vikas
    Dasika, Ganesh
    Mohanty, Abinash
    Ma, Yufei
    Vrudhula, Sarma
    Seo, Jae-Sun
    Cao, Yu
    PROCEEDINGS OF THE 2016 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'16), 2016, : 16 - 25
  • [50] Scheme of Signal Processing in a Multimode Communication Receiver Based on Convolutional Neural Networks
    O. S. Sidelnikov
    A. A. Redyuk
    M. P. Fedoruk
    Bulletin of the Lebedev Physics Institute, 2023, 50 : S336 - S342