A Fused Continuous Floating-Point MAC on FPGA

被引：0

作者：

Yuan, Min ^{[1
]}

Xing, Qianjian ^{[1
]}

Ma, Zhenguo ^{[1
]}

Yu, Feng ^{[1
]}

Xu, Yingke ^{[1
]}

机构：

[1] Zhejiang Univ, Dept Instrument Engn, Hangzhou 310027, Zhejiang, Peoples R China

来源：

IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES | 2018年 / E101A卷 / 09期

关键词：

floating-point multiply-accumulator; fused algorithm; normalization and alignment; MULTIPLY-ADD; REDUCED LATENCY; UNIT; DESIGN;

D O I：

10.1587/transfun.E101.A.1594

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this letter, we present a novel single-precision floating-point multiply-accumulator (FNA-MAC) to achieve lower hardware resource, reduced computing latency and improved computing accuracy for continuous dot product operations. By further fusing the normalization and alignment in the traditional FMA algorithm, the proposed architecture eliminates the first N - 1 normalization and rounding operations for an N point dot product, and preserves the precision of interim results in a significant bit size that is twice of that in the traditional methods. The normalization and rounding of the final result is processed at the cost of consuming an additional multiply-add operation. The simulation results show that the improvement in computational accuracy is significant. Meanwhile, when comparing to a recently published FMA design, the proposed FNA-MAC can reduce the slice look-up table/ flip-flop resource and computing latency by a fact of 18%, 33.3%, respectively.

引用

页码：1594 / 1598

页数：5

共 50 条

[41] A Fused Floating-Point Four-Term Dot Product Unit
Sohn, Jongwook
Swartzlander, Earl E., Jr.
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2016, 63 (03) : 370 - 378
[42] A microprocessor with a 128-bit CPU, ten floating-point MAC's, four floating-point dividers, and an MPEG-2 decoder
Suzuoki, M
Kutaragi, K
Hiroi, T
Magoshi, K
Okamoto, S
Oka, M
Ohba, A
Yanamoto, Y
Furuhashi, M
Tanaka, M
Yutaka, T
Okada, T
Nagamatsu, M
Urakawa, Y
Funyu, M
Kunimatsu, A
Goto, H
Hashimoto, K
Ide, N
Murakami, H
Ohtaguro, Y
Aono, A
IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1999, 34 (11) : 1608 - 1618
[43] UNIVERSAL FUSED FLOATING-POINT DOT-PRODUCT UNIT (UFDP)
Saleh, Hani H.
Mohammad, Baker S.
2013 8TH INTERNATIONAL DESIGN AND TEST SYMPOSIUM (IDT), 2013,
[44] Multiple path IEEE floating-point fused multiply-add
Seidel, PM
PROCEEDINGS OF THE 46TH IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS & SYSTEMS, VOLS 1-3, 2003, : 1359 - 1362
[45] Improved Architectures for a Fused Floating-Point Add-Subtract Unit
Sohn, Jongwook
Swartzlander, Earl E., Jr.
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2012, 59 (10) : 2285 - 2291
[46] A novel architecture for floating-point multiply-add-fused operation
Sun, HP
Gao, ML
ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 1675 - 1679
[47] Floating-Point Fused Multiply-Add under HUB Format
Hormigo, Javier
Villalba-Moreno, Julio
Gonzalez-Navarro, Sonia
2020 IEEE 27TH SYMPOSIUM ON COMPUTER ARITHMETIC (ARITH), 2020, : 1 - 8
[48] Design Issues and Implementations for Floating-Point Divide-Add Fused
Amaricai, Alexandru
Vladutiu, Mircea
Boncalo, Oana
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2010, 57 (04) : 295 - 299
[49] A Fused Hybrid Floating-Point and Fixed-Point Dot-Product for FPGAs
Lopes, Antonio Roldao
Constantinides, George A.
RECONFIGURABLE COMPUTING: ARCHITECTURES, TOOLS AND APPLICATIONS, 2010, 5992 : 157 - 168
[50] High-Radix Formats for Enhancing Floating-Point FPGA Implementations
Julio Villalba
Javier Hormigo
Circuits, Systems, and Signal Processing, 2022, 41 : 1683 - 1703

← 1 2 3 4 5 →