Floating-Point Fused Multiply-Add under HUB Format

被引:0
|
作者
Hormigo, Javier [1 ]
Villalba-Moreno, Julio [1 ]
Gonzalez-Navarro, Sonia [1 ]
机构
[1] Univ Malaga, Dept Comp Architecture, Andalucia Tech, Malaga, Spain
来源
2020 IEEE 27TH SYMPOSIUM ON COMPUTER ARITHMETIC (ARITH) | 2020年
关键词
Fused multiplication-addition; HUB format; DSP applications; Deep-learning;
D O I
10.1109/ARITH48897.2020.00010
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The Half-Unit-Biased (HUB) format has interesting advantages for implementing floating-point arithmetic which has been proved for the four basic arithmetic operations as well as square root. Nevertheless, although Floating-point Fused Multiply-add (FMA) operation (AxB + C) is one of the most important and complex arithmetic instructions in modern processors, FMA operation for HUB numbers has not been confronted yet. In this paper, we present a design to deal with this operation under HUB format. The key points to turn the conventional FMA architecture into a HUB unit are explained. Comparing the ASIC implementation of a HUB FMA unit with the conventional one, the former reduces the required area and power up to 38% and 35%, respectively, for single-precision. For BFloat16, the HUB FMA increases the speed a 15%, and even then, reduces the area and power by 26% and 12%, respectively.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 50 条
  • [1] Floating-point fused multiply-add architectures
    Quinnell, Eric
    Swartzlander, Earl E., Jr.
    Lemonds, Carl
    CONFERENCE RECORD OF THE FORTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1-5, 2007, : 331 - +
  • [2] Fused Multiply-Add for Variable Precision Floating-Point
    Nannarelli, Alberto
    32ND IEEE INTERNATIONAL SYSTEM ON CHIP CONFERENCE (IEEE SOCC 2019), 2019, : 342 - 347
  • [3] Floating-point fused multiply-add with reduced latency
    Lang, T
    Bruguera, JD
    ICCD'2002: IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN: VLSI IN COMPUTERS AND PROCESSORS, PROCEEDINGS, 2002, : 145 - 150
  • [4] Bridge Floating-Point Fused Multiply-Add Design
    Quinnell, Eric
    Swartzlander, Earl E., Jr.
    Lemonds, Carl
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2008, 16 (12) : 1726 - 1730
  • [5] Floating-point fused multiply-add: Reduced latency for floating-point addition
    Bruguera, JD
    Lang, T
    17TH IEEE SYMPOSIUM ON COMPUTER ARITHMETIC, PROCEEDINGS, 2005, : 42 - 51
  • [6] Multiple path IEEE floating-point fused multiply-add
    Seidel, PM
    PROCEEDINGS OF THE 46TH IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS & SYSTEMS, VOLS 1-3, 2003, : 1359 - 1362
  • [7] Decimal floating-point fused multiply-add with redundant internal encodings
    Han, Liu
    Zhang, Hao
    Ko, Seok-Bum
    IET COMPUTERS AND DIGITAL TECHNIQUES, 2016, 10 (04): : 147 - 156
  • [8] An efficient multiple precision floating-point Multiply-Add Fused unit
    Manolopoulos, K.
    Reisis, D.
    Chouliaras, V. A.
    MICROELECTRONICS JOURNAL, 2016, 49 : 10 - 18
  • [9] Proxy Bits for Low Cost Floating-Point Fused Multiply-Add Unit
    Kim, Hyunpil
    Moon, Sangook
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2016, 25 (10)
  • [10] Development of a RISC-V-conform fused multiply-add floating-point unit
    Kaiser F.
    Kosnac S.
    Brüning U.
    Supercomputing Frontiers and Innovations, 2019, 6 (02) : 64 - 74