Decimal floating-point fused multiply-add with redundant internal encodings

被引：2

作者：

Han, Liu ^{[1
]}

Zhang, Hao ^{[1
]}

Ko, Seok-Bum ^{[1
]}

机构：

[1] Univ Saskatchewan, Dept Elect & Comp Engn, 57 Campus Dr, Saskatoon, SK S7N 5A9, Canada

来源：

IET COMPUTERS AND DIGITAL TECHNIQUES | 2016年 / 10卷 / 04期

基金：

加拿大自然科学与工程研究理事会;

关键词：

floating point arithmetic; encoding; decimal floating-point fused multiply-add; redundant internal encodings; DFP arithmetic; FMA function; decimal redundant encoding system; rounding operation; critical path reduction; UNIT;

D O I：

10.1049/iet-cdt.2015.0058

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Decimal floating-point (DFP) arithmetic has attracted attention in the applications of financial and commercial computing. However, the processing efficiency of DFP is still far away from that of binary designs. On the other hand, a floating-point fused multiply-add (FMA) function is widely used in many processors within functional iterations to implement division, square root, and many other functions due to the better accuracy achieved by a single rounding of continuous multiplication and addition. In this work, a new architecture of FMA is proposed to speed up the DFP processing. Compared with previous architectures, first, the proposed design applies a specific decimal redundant encoding system. The circuits to decide and shift the rounding position on a redundant result are therefore simplified. Second, the only digit-set conversion in the entire design is combined with the rounding operation to further reduce the critical path. Third, the techniques applied in different previous FMAs are merged in the proposed design. In addition the multiplier and adder referred to the previous designs are further optimised. Consequently, compared with the fastest previous design, the synthesis results show about 33.7% speed advantage and about 16.6% area advantage.

引用

页码：147 / 156

页数：10

共 50 条

[41] Decimal Floating-Point Multiplication
Erle, Mark A.
Hickmann, Brian J.
Schulte, Michael J.
IEEE TRANSACTIONS ON COMPUTERS, 2009, 58 (07) : 902 - 916
[42] A decimal floating-point specification
Cowlishaw, MF
Schwarz, EM
Smith, RM
Webb, CF
ARITH-15 2001: 15TH SYMPOSIUM ON COMPUTER ARITHMETIC, PROCEEDINGS, 2001, : 147 - 154
[43] Formally Verified Argument Reduction with a Fused Multiply-Add
Boldo, Sylvie
Daumas, Marc
Li, Ren-Cang
IEEE TRANSACTIONS ON COMPUTERS, 2009, 58 (08) : 1139 - 1145
[44] Design of a double-precision floating-point multiply-add-fused unit with consideration of data dependence
Li, Zhaolin
Li, Gongqiong
IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, PROCEEDINGS: EMERGING VLSI TECHNOLOGIES AND ARCHITECTURES, 2007, : 492 - 497
[45] Design of an extended floating-point multiply-add-fused unit for exploiting instruction-level parallelism
Li, Zhaolin
Li, Gongqiong
2007 INTERNATIONAL SYMPOSIUM ON INTEGRATED CIRCUITS, VOLS 1 AND 2, 2007, : 17 - 20
[46] Optimized Design of a Double-Precision Floating-Point Multiply-Add-Fused Unit for Data Dependence
Li, Gongqiong
Li, Zhaolin
2007 IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, VOLS, 1 AND 2, 2007, : 311 - 316
[47] Fast decimal floating-point division
Nikmehr, Hooman
Phillips, Braden
Lim, Cheng-Chew
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2006, 14 (09) : 951 - 961
[48] Decimal floating-point: Algorism for computers
Cowlishaw, MF
16TH IEEE SYMPOSIUM ON COMPUTER ARITHMETIC, PROCEEDINGS, 2003, : 104 - 111
[49] Fused Multiply-Add Microarchitecture Comprising Separate Early-Normalizing Multiply and Add Pipelines
Lutz, David R.
2011 20TH IEEE SYMPOSIUM ON COMPUTER ARITHMETIC (ARITH-20), 2011, : 123 - 128
[50] Mechanical derivation of fused multiply-add algorithms for linear transforms
Voronenko, Yevgen
Pueschel, Markus
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2007, 55 (09) : 4458 - 4473

← 1 2 3 4 5 →