FPGA based convolution and memory architecture for Convolutional Neural Network

被引:2
作者
Shahan, K. A. [1 ]
Rani, Sheeba J. [1 ]
机构
[1] Indian Inst Space Sci & Technol, Dept Avion, Thiruvananthapuram, Kerala, India
来源
2020 33RD INTERNATIONAL CONFERENCE ON VLSI DESIGN AND 2020 19TH INTERNATIONAL CONFERENCE ON EMBEDDED SYSTEMS (VLSID) | 2020年
关键词
convolution; neural network; winograd efficient; hardware; architecture; deep convolutional neural network; memory reuse; FPGA;
D O I
10.1109/VLSID49098.2020.00049
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional Neural Networks(CNNs) are widely used in vision based applications to increase the performance but at the cost of higher storage and increase in computation. Hardware implementations of CNN are limited by the computational complexity and bandwidth while accessing off-chip memory. In this work a novel FPGA based hardware architecture for 2D convolution operation with reduced computational complexity using Winograd's 2D minimal filtering algorithm and a memory architecture to reduce on-chip read operations to access adjacent input data tiles for convolution operations is proposed to accelerate CNNs. An on-chip memory bank reuse architecture is also utilized to reduce the number of memory read and write operations to off-chip memory. The proposed architecture for convolution operation achieves lower computational complexity by reducing the number of multiplication operations without proportionate increase in number of addition operations compared to prior implementations. The number of data read operations from on-chip memory is reduced by 4 times and using the on-chip memory bank reuse scheme latency associated with accessing intermediate data is reduced. The implemented uses 16-bit fixed point representation which could reduce bit width to save area and energy. Virtex Ultra scale+ VCU118 Evaluation Board 2.0 populated with XCVU9P-L2FLGA2104 is used as the platform for implementing the design. VGG Net based CNN is used for the implementation. The computation time for individual convolutional layer is also estimated and it is found to be reduced. For a 3x3 kernel the number of multiplications is reduced to 4 from 9 compared to standard convolution operation and the number of addition operations reduced to 12 from 14 compared to prior hardware implementations of Winograd's 2D minimal filtering algorithm.
引用
收藏
页码:183 / 188
页数:6
相关论文
共 50 条
  • [21] Cordic-based Softmax Acceleration Method of Convolution Neural Network on FPGA
    Cao, Yongxiang
    Xiao, Wan'ang
    Jia, Jingdun
    Wu, Dehua
    Zhou, Weixin
    PROCEEDINGS OF 2020 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS), 2020, : 66 - 70
  • [22] FPGA-based Accelerator for Convolutional Neural Network Application in Mobile Robotics
    Mazzetto, Lucas F. R.
    Castanho, Jose E. C.
    2023 LATIN AMERICAN ROBOTICS SYMPOSIUM, LARS, 2023 BRAZILIAN SYMPOSIUM ON ROBOTICS, SBR, AND 2023 WORKSHOP ON ROBOTICS IN EDUCATION, WRE, 2023, : 433 - 438
  • [23] An Energy-Efficient FPGA-based Convolutional Neural Network Implementation
    Irmak, Hasan
    Alachiotis, Nikolaos
    Ziener, Daniel
    29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
  • [24] An Efficient FPGA-Based Dilated and Transposed Convolutional Neural Network Accelerator
    Wu, Tsung-Hsi
    Shu, Chang
    Liu, Tsung-Te
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2024, 71 (11) : 5178 - 5186
  • [25] Implementation of energy-efficient fast convolution algorithm for deep convolutional neural networks based on FPGA
    Li, W. -J.
    Ruan, S. -J.
    Yang, D. -S.
    ELECTRONICS LETTERS, 2020, 56 (10) : 485 - 487
  • [26] An FPGA-Based Computation-Efficient Convolutional Neural Network Accelerator
    Archana, V. S.
    2022 IEEE INTERNATIONAL POWER AND RENEWABLE ENERGY CONFERENCE, IPRECON, 2022,
  • [27] Scalable FPGA-Based Convolutional Neural Network Accelerator for Embedded Systems
    Zhao, Jingyuan
    Yin, Zhendong
    Zhao, Yanlong
    Wu, Mingyang
    Xu, Mingdong
    2019 4TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND APPLICATIONS (ICCIA 2019), 2019, : 36 - 40
  • [28] An FPGA-Based Microinstruction Sequence Driven Spaceborne Convolution Neural Network Accelerator
    Guo Z.-B.
    Liu K.
    Hu H.-T.
    Li Y.-D.
    Qu Z.-X.
    Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (10): : 2047 - 2064
  • [29] FPNet: Customized Convolutional Neural Network for FPGA Platforms
    Yang, Yang
    Wang, Chao
    Gong, Lei
    Zhou, Xuehai
    2019 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT 2019), 2019, : 399 - 402
  • [30] EdgeNet: SqueezeNet like Convolution Neural Network on Embedded FPGA
    Pradeep, Kathirgamaraja
    Kamalavasan, Kamalakkannan
    Natheesan, Ratnasegar
    Pasqual, Ajith
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS (ICECS), 2018, : 85 - 88