An Energy-Efficient FPGA-based Matrix Multiplier

被引：0

作者：

Tan, Yiyu ^{[1
]}

Imamura, Toshiyuki ^{[1
]}

机构：

[1] RIKEN Adv Inst Computat Sci, Chuo Ku, 7-1-26 Minatojima Minami Machi, Kobe, Hyogo, Japan

来源：

2017 24TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS (ICECS) | 2017年

关键词：

Matrix multiplication; FPGA; OpenCL; HIGH-PERFORMANCE; ACCELERATOR; CODESIGN;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Matrix multiplication is a fundamental operation of numerical linear algebra, and applied widely in high performance computing to solve scientific and engineering problems. It requires computer systems have huge computing capacity and data throughputs as problem size is increased, and consumes much more power. In this research, an OpenCL-based matrix multiplier is presented to improve energy efficiency. When data are single precision floating-point, and matrix dimension is 16384x16384, the matrix multiplier implemented by the FPGA board DE5a-NET achieves 240.34 GFLOPs in data throughput and 19.64 GFLOPs/W in energy efficiency, which are 296 times and 1964 times over the software simulation carried out on a PC with 32 GB DDR4 RAMs and an AMD processor Ryzen 7 1700 running at 3.0 GHz, respectively.

引用

页码：514 / 517

页数：4

共 14 条

[11] Codesign Tradeoffs for High-Performance, Low-Power Linear Algebra Architectures [J].

Pedram, Ardavan ;

van de Geijn, Robert A. ;

Gerstlauer, Andreas .

IEEE TRANSACTIONS ON COMPUTERS, 2012, 61 (12) :1724-1736

[12] Processor arrays generation for matrix algorithms used in embedded platforms implemented on FPGAs [J].

Perez-Andrade, Roberto ;

Torres-Huitzil, Cesar ;

Cumplido, Rene .

MICROPROCESSORS AND MICROSYSTEMS, 2015, 39 (07) :576-588

[13]

Putnam A, 2014, CONF PROC INT SYMP C, P13, DOI 10.1109/ISCA.2014.6853195

[14] High-performance designs for linear algebra operations on reconfigurable hardware [J].

Zhuo, Ling ;

Prasanna, Viktor K. .

IEEE TRANSACTIONS ON COMPUTERS, 2008, 57 (08) :1057-1071

← 1 2 →