Floating-Point Fused Multiply-Add under HUB Format

被引：0

作者：

Hormigo, Javier ^{[1
]}

Villalba-Moreno, Julio ^{[1
]}

Gonzalez-Navarro, Sonia ^{[1
]}

机构：

[1] Univ Malaga, Dept Comp Architecture, Andalucia Tech, Malaga, Spain

来源：

2020 IEEE 27TH SYMPOSIUM ON COMPUTER ARITHMETIC (ARITH) | 2020年

关键词：

Fused multiplication-addition; HUB format; DSP applications; Deep-learning;

D O I：

10.1109/ARITH48897.2020.00010

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

The Half-Unit-Biased (HUB) format has interesting advantages for implementing floating-point arithmetic which has been proved for the four basic arithmetic operations as well as square root. Nevertheless, although Floating-point Fused Multiply-add (FMA) operation (AxB + C) is one of the most important and complex arithmetic instructions in modern processors, FMA operation for HUB numbers has not been confronted yet. In this paper, we present a design to deal with this operation under HUB format. The key points to turn the conventional FMA architecture into a HUB unit are explained. Comparing the ASIC implementation of a HUB FMA unit with the conventional one, the former reduces the required area and power up to 38% and 35%, respectively, for single-precision. For BFloat16, the HUB FMA increases the speed a 15%, and even then, reduces the area and power by 26% and 12%, respectively.

引用

页码：1 / 8

页数：8

共 50 条

[1] Floating-point fused multiply-add architectures
Quinnell, Eric
Swartzlander, Earl E., Jr.
Lemonds, Carl
CONFERENCE RECORD OF THE FORTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1-5, 2007, : 331 - +
[2] Fused Multiply-Add for Variable Precision Floating-Point
Nannarelli, Alberto
32ND IEEE INTERNATIONAL SYSTEM ON CHIP CONFERENCE (IEEE SOCC 2019), 2019, : 342 - 347
[3] Floating-point fused multiply-add with reduced latency
Lang, T
Bruguera, JD
ICCD'2002: IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN: VLSI IN COMPUTERS AND PROCESSORS, PROCEEDINGS, 2002, : 145 - 150
[4] Bridge Floating-Point Fused Multiply-Add Design
Quinnell, Eric
Swartzlander, Earl E., Jr.
Lemonds, Carl
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2008, 16 (12) : 1726 - 1730
[5] Floating-point fused multiply-add: Reduced latency for floating-point addition
Bruguera, JD
Lang, T
17TH IEEE SYMPOSIUM ON COMPUTER ARITHMETIC, PROCEEDINGS, 2005, : 42 - 51
[6] Multiple path IEEE floating-point fused multiply-add
Seidel, PM
PROCEEDINGS OF THE 46TH IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS & SYSTEMS, VOLS 1-3, 2003, : 1359 - 1362
[7] Decimal floating-point fused multiply-add with redundant internal encodings
Han, Liu
Zhang, Hao
Ko, Seok-Bum
IET COMPUTERS AND DIGITAL TECHNIQUES, 2016, 10 (04): : 147 - 156
[8] An efficient multiple precision floating-point Multiply-Add Fused unit
Manolopoulos, K.
Reisis, D.
Chouliaras, V. A.
MICROELECTRONICS JOURNAL, 2016, 49 : 10 - 18
[9] Proxy Bits for Low Cost Floating-Point Fused Multiply-Add Unit
Kim, Hyunpil
Moon, Sangook
JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2016, 25 (10)
[10] Development of a RISC-V-conform fused multiply-add floating-point unit
Kaiser F.
Kosnac S.
Brüning U.
Supercomputing Frontiers and Innovations, 2019, 6 (02) : 64 - 74

← 1 2 3 4 5 →