RISC-V Barrel Processor for Deep Neural Network Acceleration

被引:14
作者
AskariHemmat, MohammadHossein [1 ]
Bilaniuk, Olexa [2 ]
Wagner, Sean [3 ]
Savaria, Yvon [1 ]
David, Jean-Pierre [1 ]
机构
[1] Ecole Polytech Montreal, Elect Engn Dept, Montreal, PQ, Canada
[2] Univ Montreal, Mila, Montreal, PQ, Canada
[3] IBM Canada, Markham, ON, Canada
来源
2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS) | 2021年
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1109/ISCAS51556.2021.9401617
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a barrel RISC-V processor designed to control a deep neural network accelerator. Our design has a 5-stage pipeline data path with 8 hardware threads (harts). Each thread is executed under a strict round robin scheduler and is responsible for providing data and control signals to a neural network processing element (PE). Each PE is capable of arbitrary precision GEneral Matrix Vector (GEMV) operations. The execution of each thread is independent of other threads and any communication between threads are sent through shared memory via software. To reduce the area required for implementation, our processor is an implementation of the RV32I plus a set of custom CSRs for controlling the PEs. Our design passes all riscv test written in assembly and compiled with RISC-V gcc. Our 8-hart barrel processor runs at 250 MHz with CPI of 1 and consumes 0.372W. To demonstrate the capabilities of our design, we computed a GEMV operation with an input matrix size of 8 by 128 and a weight matrix size of 128 by 128 with two-bit precision in only 16 clock cycles.
引用
收藏
页数:5
相关论文
共 23 条
[1]   Tera computer system [J].
Alverson, Robert ;
Callahan, David ;
Cummings, Daniel ;
Koblenz, Brian ;
Porterfield, Allan ;
Smith, Burton .
Conference Proceedings - International Conference on Supercomputing, 1990,
[2]  
[Anonymous], 2020, VIRTEX 6 FPGA CONFIG
[3]  
[Anonymous], 2020, THIS REPOSITORY HOST
[4]  
Bilaniuk O., 2019, 2019 IEEE International Symposium on Circuits and Systems (ISCAS), P1
[5]  
Choi J., 2019, P MACHINE LEARNING S, P348
[6]  
Fotland D., 2003, HOT CHIPS 15
[7]  
Fraser N. J., 2017, P 8 WORKSH 6 WORKSH, P25
[8]   GRVI Phalanx: A Massively Parallel RISC-V FPGA Accelerator Accelerator [J].
Gray, Jan .
2016 IEEE 24TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2016, :17-20
[9]  
Hacene G. B., 2018, ARXIV181211337
[10]  
Hennessey John., 2011, Computer architecture a quantitative approach, V5th