Deep Learning Accelerators' Configuration Space Exploration Effect on Performance and Resource Utilization: A Gemmini Case Study

被引:3
作者
Gookyi, Dennis Agyemanh Nana [1 ]
Lee, Eunchong [2 ]
Kim, Kyungho [2 ]
Jang, Sung-Joon [2 ]
Lee, Sang-Seol [2 ]
机构
[1] CSIR, Inst Sci & Technol Informat, Elect Div, Accra, Ghana
[2] Korea Elect Technol Inst, Intelligent Image Proc Res Ctr, Seongnam Si 13488, South Korea
关键词
deep learning; hardware accelerators; open-source; Gemmini; systolic array; GEMM; output; weight stationary dataflow; FPGA; image-to-column;
D O I
10.3390/s23052380
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Though custom deep learning (DL) hardware accelerators are attractive for making inferences in edge computing devices, their design and implementation remain a challenge. Open-source frameworks exist for exploring DL hardware accelerators. Gemmini is an open-source systolic array generator for agile DL accelerator exploration. This paper details the hardware/software components generated using Gemmini. The general matrix-to-matrix multiplication (GEMM) of different dataflow options, including output/weight stationary (OS/WS), was explored in Gemmini to estimate the performance relative to a CPU implementation. The Gemmini hardware was implemented on an FPGA device to explore the effect of several accelerator parameters, including array size, memory capacity, and the CPU/hardware image-to-column (im2col) module, on metrics such as the area, frequency, and power. This work revealed that regarding the performance, the WS dataflow offered a speedup of 3x relative to the OS dataflow, and the hardware im2col operation offered a speedup of 1.1x relative to the operation on the CPU. For hardware resources, an increase in the array size by a factor of 2 led to an increase in both the area and power by a factor of 3.3, and the im2col module led to an increase in area and power by factors of 1.01 and 1.06, respectively.
引用
收藏
页数:26
相关论文
共 38 条
  • [1] Anderson A, 2018, Arxiv, DOI arXiv:1710.01079
  • [2] [Anonymous], SIFIVE ELF2HEX
  • [3] [Anonymous], RISCV BOOM
  • [4] [Anonymous], GEMM
  • [5] [Anonymous], EXPL GEMM
  • [6] [Anonymous], ROCK COR
  • [7] [Anonymous], FSD CHIP TESL
  • [8] [Anonymous], NVDLA OP SOURC PROJ
  • [9] [Anonymous], AM EC2 INF1 INST
  • [10] [Anonymous], ?About us"