Decimal floating-point fused multiply-add with redundant internal encodings

被引:2
|
作者
Han, Liu [1 ]
Zhang, Hao [1 ]
Ko, Seok-Bum [1 ]
机构
[1] Univ Saskatchewan, Dept Elect & Comp Engn, 57 Campus Dr, Saskatoon, SK S7N 5A9, Canada
来源
IET COMPUTERS AND DIGITAL TECHNIQUES | 2016年 / 10卷 / 04期
基金
加拿大自然科学与工程研究理事会;
关键词
floating point arithmetic; encoding; decimal floating-point fused multiply-add; redundant internal encodings; DFP arithmetic; FMA function; decimal redundant encoding system; rounding operation; critical path reduction; UNIT;
D O I
10.1049/iet-cdt.2015.0058
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Decimal floating-point (DFP) arithmetic has attracted attention in the applications of financial and commercial computing. However, the processing efficiency of DFP is still far away from that of binary designs. On the other hand, a floating-point fused multiply-add (FMA) function is widely used in many processors within functional iterations to implement division, square root, and many other functions due to the better accuracy achieved by a single rounding of continuous multiplication and addition. In this work, a new architecture of FMA is proposed to speed up the DFP processing. Compared with previous architectures, first, the proposed design applies a specific decimal redundant encoding system. The circuits to decide and shift the rounding position on a redundant result are therefore simplified. Second, the only digit-set conversion in the entire design is combined with the rounding operation to further reduce the critical path. Third, the techniques applied in different previous FMAs are merged in the proposed design. In addition the multiplier and adder referred to the previous designs are further optimised. Consequently, compared with the fastest previous design, the synthesis results show about 33.7% speed advantage and about 16.6% area advantage.
引用
收藏
页码:147 / 156
页数:10
相关论文
共 50 条
  • [41] Decimal Floating-Point Multiplication
    Erle, Mark A.
    Hickmann, Brian J.
    Schulte, Michael J.
    IEEE TRANSACTIONS ON COMPUTERS, 2009, 58 (07) : 902 - 916
  • [42] A decimal floating-point specification
    Cowlishaw, MF
    Schwarz, EM
    Smith, RM
    Webb, CF
    ARITH-15 2001: 15TH SYMPOSIUM ON COMPUTER ARITHMETIC, PROCEEDINGS, 2001, : 147 - 154
  • [43] Formally Verified Argument Reduction with a Fused Multiply-Add
    Boldo, Sylvie
    Daumas, Marc
    Li, Ren-Cang
    IEEE TRANSACTIONS ON COMPUTERS, 2009, 58 (08) : 1139 - 1145
  • [44] Design of a double-precision floating-point multiply-add-fused unit with consideration of data dependence
    Li, Zhaolin
    Li, Gongqiong
    IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, PROCEEDINGS: EMERGING VLSI TECHNOLOGIES AND ARCHITECTURES, 2007, : 492 - 497
  • [45] Design of an extended floating-point multiply-add-fused unit for exploiting instruction-level parallelism
    Li, Zhaolin
    Li, Gongqiong
    2007 INTERNATIONAL SYMPOSIUM ON INTEGRATED CIRCUITS, VOLS 1 AND 2, 2007, : 17 - 20
  • [46] Optimized Design of a Double-Precision Floating-Point Multiply-Add-Fused Unit for Data Dependence
    Li, Gongqiong
    Li, Zhaolin
    2007 IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, VOLS, 1 AND 2, 2007, : 311 - 316
  • [47] Fast decimal floating-point division
    Nikmehr, Hooman
    Phillips, Braden
    Lim, Cheng-Chew
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2006, 14 (09) : 951 - 961
  • [48] Decimal floating-point: Algorism for computers
    Cowlishaw, MF
    16TH IEEE SYMPOSIUM ON COMPUTER ARITHMETIC, PROCEEDINGS, 2003, : 104 - 111
  • [49] Fused Multiply-Add Microarchitecture Comprising Separate Early-Normalizing Multiply and Add Pipelines
    Lutz, David R.
    2011 20TH IEEE SYMPOSIUM ON COMPUTER ARITHMETIC (ARITH-20), 2011, : 123 - 128
  • [50] Mechanical derivation of fused multiply-add algorithms for linear transforms
    Voronenko, Yevgen
    Pueschel, Markus
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2007, 55 (09) : 4458 - 4473