A Deep Convolutional Neural Network Based on Nested Residue Number System

被引：0

作者：

Nakahara, Hiroki ^{[1
]}

Sasao, Tsutomu ^{[2
]}

机构：

[1] Ehime Univ, Matsuyama, Ehime 790, Japan

[2] Meiji Univ, Tokyo 101, Japan

来源：

2015 25TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS | 2015年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

A pre-trained deep convolutional neural network (DCNN) is the feed-forward computation perspective which is widely used for the embedded vision systems. In the DCNN, the 2D convolutional operation occupies more than 90% of the computation time. Since the 2D convolutional operation performs massive multiply-accumulation (MAC) operations, conventional realizations could not implement a fully parallel DCNN. The RNS decomposes an integer into a tuple of L integers by residues of moduli set. Since no pair of modulus have a common factor with any other, the conventional RNS decomposes the MAC unit into circuits with different sizes. It means that the RNS could not utilize resources of an FPGA with uniform size. In this paper, we propose the nested RNS (NRNS), which recursively decompose the RNS. It can decompose the MAC unit into circuits with small sizes. In the DCNN using the NRNS, a 48-bit MAC unit is decomposed into 4-bit ones realized by look-up tables of the FPGA. In the system, we also use binary to NRNS converters and NRNS to binary converters. The binary to NRNS converter is realized by on-chip BRAMs, while the NRNS to binary one is realized by DSP blocks and BRAMs. Thus, a balanced usage of FPGA resources leads to a high clock frequency with less hardware. The ImageNet DCNN using the NRNS is implemented on a Xilinx Virtex VC707 evaluation board. As for the performance per area GOPS (Giga operations per second) per a slice, the proposed one is 5.86 times better than the existing best realization.

引用

页数：6

共 50 条

[1] Method for Convolutional Neural Network Hardware Implementation Based on a Residue Number System
Valueva, M.
Valuev, G.
Babenko, M.
Tchernykh, A.
Cortes-Mendoza, J. M.
PROGRAMMING AND COMPUTER SOFTWARE, 2022, 48 (08) : 735 - 744
[2] Method for Convolutional Neural Network Hardware Implementation Based on a Residue Number System
M. Valueva
G. Valuev
M. Babenko
A. Tchernykh
J. M. Cortes-Mendoza
Programming and Computer Software, 2022, 48 : 735 - 744
[3] Increasing of Convolutional Neural Network Performance Using Residue Number System
Chervyakov, N. I.
Lyakhov, P. A.
Valueva, M. V.
2017 INTERNATIONAL MULTI-CONFERENCE ON ENGINEERING, COMPUTER AND INFORMATION SCIENCES (SIBIRCON), 2017, : 135 - 140
[4] Residue Number System-Based Solution for Reducing the Hardware Cost of a Convolutional Neural Network
Chervyakov, N., I
Lyakhov, P. A.
Deryabin, M. A.
Nagornov, N. N.
Valueva, M. V.
Valuev, G. V.
NEUROCOMPUTING, 2020, 407 : 439 - 453
[5] Hardware implementation of a convolutional neural network using calculations in the residue number system
Chervyakov, N. I.
Lyakhov, P. A.
Nagornov, N. N.
Valueva, M. V.
Valuev, G. V.
COMPUTER OPTICS, 2019, 43 (05) : 857 - 868
[6] Application of the residue number system to reduce hardware costs of the convolutional neural network implementation
Valueva, M. V.
Nagornov, N. N.
Lyakhov, P. A.
Valuev, G. V.
Chervyakov, N., I
MATHEMATICS AND COMPUTERS IN SIMULATION, 2020, 177 (177) : 232 - 243
[7] Neural network-based residue number signal processing system
Yao, Tianren
High Technology Letters, 1995, 5 (07):
[8] NASOLABIAL WRINKLE SEGMENTATION BASED ON NESTED CONVOLUTIONAL NEURAL NETWORK
Sabina, Umirzakova
Whangbo, Taeg Keun
12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION, 2021, : 483 - 485
[9] Area-Efficient FPGA Implementation of Minimalistic Convolutional Neural Network Using Residue Number System
Chervyakov, Nikolay I.
Lyakhov, Pavel A.
Valueva, Maria V.
Valuev, Georgii V.
Kaplun, Dmitrii I.
Efimenko, George A.
Gnezdilov, Denis V.
PROCEEDINGS OF THE 2018 23RD CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT), 2018, : 112 - 118
[10] Residue Number System Design Automation for Neural Network Acceleration
Lin, Liang-Yu
Schroff, Jerrae
Lin, Tsu-Ping
Huang, Tsung-Chu
2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN), 2020,

← 1 2 3 4 5 →