A Fused Continuous Floating-Point MAC on FPGA

被引:0
|
作者
Yuan, Min [1 ]
Xing, Qianjian [1 ]
Ma, Zhenguo [1 ]
Yu, Feng [1 ]
Xu, Yingke [1 ]
机构
[1] Zhejiang Univ, Dept Instrument Engn, Hangzhou 310027, Zhejiang, Peoples R China
关键词
floating-point multiply-accumulator; fused algorithm; normalization and alignment; MULTIPLY-ADD; REDUCED LATENCY; UNIT; DESIGN;
D O I
10.1587/transfun.E101.A.1594
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this letter, we present a novel single-precision floating-point multiply-accumulator (FNA-MAC) to achieve lower hardware resource, reduced computing latency and improved computing accuracy for continuous dot product operations. By further fusing the normalization and alignment in the traditional FMA algorithm, the proposed architecture eliminates the first N - 1 normalization and rounding operations for an N point dot product, and preserves the precision of interim results in a significant bit size that is twice of that in the traditional methods. The normalization and rounding of the final result is processed at the cost of consuming an additional multiply-add operation. The simulation results show that the improvement in computational accuracy is significant. Meanwhile, when comparing to a recently published FMA design, the proposed FNA-MAC can reduce the slice look-up table/ flip-flop resource and computing latency by a fact of 18%, 33.3%, respectively.
引用
收藏
页码:1594 / 1598
页数:5
相关论文
共 50 条
  • [41] A Fused Floating-Point Four-Term Dot Product Unit
    Sohn, Jongwook
    Swartzlander, Earl E., Jr.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2016, 63 (03) : 370 - 378
  • [42] A microprocessor with a 128-bit CPU, ten floating-point MAC's, four floating-point dividers, and an MPEG-2 decoder
    Suzuoki, M
    Kutaragi, K
    Hiroi, T
    Magoshi, K
    Okamoto, S
    Oka, M
    Ohba, A
    Yanamoto, Y
    Furuhashi, M
    Tanaka, M
    Yutaka, T
    Okada, T
    Nagamatsu, M
    Urakawa, Y
    Funyu, M
    Kunimatsu, A
    Goto, H
    Hashimoto, K
    Ide, N
    Murakami, H
    Ohtaguro, Y
    Aono, A
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1999, 34 (11) : 1608 - 1618
  • [43] UNIVERSAL FUSED FLOATING-POINT DOT-PRODUCT UNIT (UFDP)
    Saleh, Hani H.
    Mohammad, Baker S.
    2013 8TH INTERNATIONAL DESIGN AND TEST SYMPOSIUM (IDT), 2013,
  • [44] Multiple path IEEE floating-point fused multiply-add
    Seidel, PM
    PROCEEDINGS OF THE 46TH IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS & SYSTEMS, VOLS 1-3, 2003, : 1359 - 1362
  • [45] Improved Architectures for a Fused Floating-Point Add-Subtract Unit
    Sohn, Jongwook
    Swartzlander, Earl E., Jr.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2012, 59 (10) : 2285 - 2291
  • [46] A novel architecture for floating-point multiply-add-fused operation
    Sun, HP
    Gao, ML
    ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 1675 - 1679
  • [47] Floating-Point Fused Multiply-Add under HUB Format
    Hormigo, Javier
    Villalba-Moreno, Julio
    Gonzalez-Navarro, Sonia
    2020 IEEE 27TH SYMPOSIUM ON COMPUTER ARITHMETIC (ARITH), 2020, : 1 - 8
  • [48] Design Issues and Implementations for Floating-Point Divide-Add Fused
    Amaricai, Alexandru
    Vladutiu, Mircea
    Boncalo, Oana
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2010, 57 (04) : 295 - 299
  • [49] A Fused Hybrid Floating-Point and Fixed-Point Dot-Product for FPGAs
    Lopes, Antonio Roldao
    Constantinides, George A.
    RECONFIGURABLE COMPUTING: ARCHITECTURES, TOOLS AND APPLICATIONS, 2010, 5992 : 157 - 168
  • [50] High-Radix Formats for Enhancing Floating-Point FPGA Implementations
    Julio Villalba
    Javier Hormigo
    Circuits, Systems, and Signal Processing, 2022, 41 : 1683 - 1703