An optimized floating-point matrix multiplication on FPGA

被引:7
作者
机构
[1] Embedded System and Network Laboratory, College of Information Science and Engineering, Hunan University, Changsha
[2] College of Information Science and Engineering, Hunan University, Changsha
来源
Zhang, T. | 1832年 / Asian Network for Scientific Information卷 / 12期
关键词
Floating-point; Matrix multiplication; Multiplier accumulator; Pipeline;
D O I
10.3923/itj.2013.1832.1838
中图分类号
学科分类号
摘要
Matrix multiplication is a kernel and fundamental operation in many applications including image, robotic and digital signal processing. The key component of matrix multiplication is Multiplier Accumulator (MAC) which is a decisive component for the performance of matrix multiplication. This study proposes a pipelined floating-point MAC architecture on Field Programmable Gate Array (FPGA) using a novel accumulating method. By adding the last N-stage results of the pipelined adder, the accumulation of the multiplier products can be obtained. Then, a matrix multiplication is implemented by employing parallel systolic structure based on the proposed MAC. Experimental results demonstrate that the proposed MAC architecture achieves higher clock speed and consumes less hardware resources than previous designs and the matrix multiplier has a good performance and scalability. It also can be concluded that the efficiency of the matrix multiplier is even higher when the matrices are larger. © 2031 Asian Network for Scientific Information.
引用
收藏
页码:1832 / 1838
页数:6
相关论文
共 50 条
[21]   High-Radix Formats for Enhancing Floating-Point FPGA Implementations [J].
Julio Villalba ;
Javier Hormigo .
Circuits, Systems, and Signal Processing, 2022, 41 :1683-1703
[22]   CLA based Floating-point adder suitable for chaotic generators on FPGA [J].
Hassan, Hossam S. ;
Ismail, Samar M. .
2018 30TH INTERNATIONAL CONFERENCE ON MICROELECTRONICS (ICM), 2018, :299-302
[23]   FPGA-based floating-point datapath design for geometry processing [J].
Xing, SZ ;
Yu, WWH .
CONFIGURABLE COMPUTING: TECHNOLOGY AND APPLICATIONS, 1998, 3526 :212-217
[24]   FPGA Implementation of CORDIC Algorithms for Sine and Cosine Floating-Point Calculations [J].
Sergiyenko, Anatoliy ;
Moroz, Leonid ;
Mychuda, Lesya ;
Samotyj, Volodymir .
PROCEEDINGS OF THE THE 11TH IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS: TECHNOLOGY AND APPLICATIONS (IDAACS'2021), VOL 1, 2021, :383-386
[25]   On-line IEEE floating-point multiplication and division for reduced power dissipation [J].
Seidel, PM .
CONFERENCE RECORD OF THE THIRTY-EIGHTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 2004, :498-502
[26]   An algorithm for converting floating-point computations to fixed-point in MATLAB based FPGA design [J].
Roy, S ;
Banerjee, P .
41ST DESIGN AUTOMATION CONFERENCE, PROCEEDINGS 2004, 2004, :484-487
[27]   Floating-point discrete wavelet transform-based image compression on FPGA [J].
Farghaly, Sarah H. ;
Ismail, Samar M. .
AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2020, 124
[28]   Acceleration of Multiple Precision Matrix Multiplication Based on Multi-component Floating-Point Arithmetic Using AVX2 [J].
Kouya, Tomonori .
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2021, PT V, 2021, 12953 :202-217
[29]   Systolic Architecture for Integer Point Matrix Multiplication using FPGA [J].
Sonawane, D. N. ;
Sutaone, M. S. ;
Malek, Inayat .
ICIEA: 2009 4TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOLS 1-6, 2009, :3813-+
[30]   Termination of Floating-Point Computations [J].
Alexander Serebrenik ;
Danny De Schreye .
Journal of Automated Reasoning, 2005, 34 :141-177