A Deep Convolutional Neural Network Based on Nested Residue Number System

被引:0
|
作者
Nakahara, Hiroki [1 ]
Sasao, Tsutomu [2 ]
机构
[1] Ehime Univ, Matsuyama, Ehime 790, Japan
[2] Meiji Univ, Tokyo 101, Japan
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A pre-trained deep convolutional neural network (DCNN) is the feed-forward computation perspective which is widely used for the embedded vision systems. In the DCNN, the 2D convolutional operation occupies more than 90% of the computation time. Since the 2D convolutional operation performs massive multiply-accumulation (MAC) operations, conventional realizations could not implement a fully parallel DCNN. The RNS decomposes an integer into a tuple of L integers by residues of moduli set. Since no pair of modulus have a common factor with any other, the conventional RNS decomposes the MAC unit into circuits with different sizes. It means that the RNS could not utilize resources of an FPGA with uniform size. In this paper, we propose the nested RNS (NRNS), which recursively decompose the RNS. It can decompose the MAC unit into circuits with small sizes. In the DCNN using the NRNS, a 48-bit MAC unit is decomposed into 4-bit ones realized by look-up tables of the FPGA. In the system, we also use binary to NRNS converters and NRNS to binary converters. The binary to NRNS converter is realized by on-chip BRAMs, while the NRNS to binary one is realized by DSP blocks and BRAMs. Thus, a balanced usage of FPGA resources leads to a high clock frequency with less hardware. The ImageNet DCNN using the NRNS is implemented on a Xilinx Virtex VC707 evaluation board. As for the performance per area GOPS (Giga operations per second) per a slice, the proposed one is 5.86 times better than the existing best realization.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Method for Convolutional Neural Network Hardware Implementation Based on a Residue Number System
    Valueva, M.
    Valuev, G.
    Babenko, M.
    Tchernykh, A.
    Cortes-Mendoza, J. M.
    PROGRAMMING AND COMPUTER SOFTWARE, 2022, 48 (08) : 735 - 744
  • [2] Method for Convolutional Neural Network Hardware Implementation Based on a Residue Number System
    M. Valueva
    G. Valuev
    M. Babenko
    A. Tchernykh
    J. M. Cortes-Mendoza
    Programming and Computer Software, 2022, 48 : 735 - 744
  • [3] Increasing of Convolutional Neural Network Performance Using Residue Number System
    Chervyakov, N. I.
    Lyakhov, P. A.
    Valueva, M. V.
    2017 INTERNATIONAL MULTI-CONFERENCE ON ENGINEERING, COMPUTER AND INFORMATION SCIENCES (SIBIRCON), 2017, : 135 - 140
  • [4] Residue Number System-Based Solution for Reducing the Hardware Cost of a Convolutional Neural Network
    Chervyakov, N., I
    Lyakhov, P. A.
    Deryabin, M. A.
    Nagornov, N. N.
    Valueva, M. V.
    Valuev, G. V.
    NEUROCOMPUTING, 2020, 407 : 439 - 453
  • [5] Hardware implementation of a convolutional neural network using calculations in the residue number system
    Chervyakov, N. I.
    Lyakhov, P. A.
    Nagornov, N. N.
    Valueva, M. V.
    Valuev, G. V.
    COMPUTER OPTICS, 2019, 43 (05) : 857 - 868
  • [6] Application of the residue number system to reduce hardware costs of the convolutional neural network implementation
    Valueva, M. V.
    Nagornov, N. N.
    Lyakhov, P. A.
    Valuev, G. V.
    Chervyakov, N., I
    MATHEMATICS AND COMPUTERS IN SIMULATION, 2020, 177 (177) : 232 - 243
  • [7] Neural network-based residue number signal processing system
    Yao, Tianren
    High Technology Letters, 1995, 5 (07):
  • [8] NASOLABIAL WRINKLE SEGMENTATION BASED ON NESTED CONVOLUTIONAL NEURAL NETWORK
    Sabina, Umirzakova
    Whangbo, Taeg Keun
    12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION, 2021, : 483 - 485
  • [9] Area-Efficient FPGA Implementation of Minimalistic Convolutional Neural Network Using Residue Number System
    Chervyakov, Nikolay I.
    Lyakhov, Pavel A.
    Valueva, Maria V.
    Valuev, Georgii V.
    Kaplun, Dmitrii I.
    Efimenko, George A.
    Gnezdilov, Denis V.
    PROCEEDINGS OF THE 2018 23RD CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT), 2018, : 112 - 118
  • [10] Residue Number System Design Automation for Neural Network Acceleration
    Lin, Liang-Yu
    Schroff, Jerrae
    Lin, Tsu-Ping
    Huang, Tsung-Chu
    2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN), 2020,