An Energy-Efficient FPGA-based Matrix Multiplier

被引：0

作者：

Tan, Yiyu ^{[1
]}

Imamura, Toshiyuki ^{[1
]}

机构：

[1] RIKEN Adv Inst Computat Sci, Chuo Ku, 7-1-26 Minatojima Minami Machi, Kobe, Hyogo, Japan

来源：

2017 24TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS (ICECS) | 2017年

关键词：

Matrix multiplication; FPGA; OpenCL; HIGH-PERFORMANCE; ACCELERATOR; CODESIGN;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Matrix multiplication is a fundamental operation of numerical linear algebra, and applied widely in high performance computing to solve scientific and engineering problems. It requires computer systems have huge computing capacity and data throughputs as problem size is increased, and consumes much more power. In this research, an OpenCL-based matrix multiplier is presented to improve energy efficiency. When data are single precision floating-point, and matrix dimension is 16384x16384, the matrix multiplier implemented by the FPGA board DE5a-NET achieves 240.34 GFLOPs in data throughput and 19.64 GFLOPs/W in energy efficiency, which are 296 times and 1964 times over the software simulation carried out on a PC with 32 GB DDR4 RAMs and an AMD processor Ryzen 7 1700 running at 3.0 GHz, respectively.

引用

页码：514 / 517

页数：4

共 14 条

[1]

Caulfield A., 2016, 49 ANN IEEE ACM INT

[2]

D'Hollander Erik H., 2016, ACM SIGARCH Computer Architecture News, V44, P74, DOI 10.1145/3039902.3039916

[3]

Dou S.Yong., 2005, Proceedings of the 2005 ACM/SIGDA 13th International Symposium on Field-Programmable Gate Arrays, FPGA'05, P86, DOI [DOI 10.1145/1046192.1046204, 10.1145/1046192.1046204]

[4]

Guiming Wu, 2010, Proceedings 2010 International Conference on Field-Programmable Technology (FPT 2010), P134, DOI 10.1109/FPT.2010.5681769

[5] Energy- and time-efficient matrix multiplication on FPGAs [J].

Jang, JW ;

Choi, SB ;

Prasanna, VK .

IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2005, 13 (11) :1305-1319

[6]

Jian Ouyang, 2014 IEEE HOT CHIPS

[7] FPGA accelerator for floating-point matrix multiplication [J].

Jovanovic, Z. ;

Milutinovic, V. .

IET COMPUTERS AND DIGITAL TECHNIQUES, 2012, 6 (04) :249-256

[8] FPGA Based High Performance Double-Precision Matrix Multiplication [J].

Kumar, Vinay B. Y. ;

Joshi, Siddharth ;

Patkar, Sachin B. ;

Narayanan, H. .

INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2010, 38 (3-4) :322-338

[9]

Matam Kiran Kumar, 2013, IEEE INT C FIELD PRO, P1

[10] Algorithm, Architecture, and Floating-Point Unit Codesign of a Matrix Factorization Accelerator [J].

Pedram, Ardavan ;

Gerstlauer, Andreas ;

van de Geijn, Robert A. .

IEEE TRANSACTIONS ON COMPUTERS, 2014, 63 (08) :1854-1867

← 1 2 →