Floating-point multiply-add-fused with reduced latency

被引:45
作者
Lang, T [1 ]
Bruguera, JD
机构
[1] Univ Calif Irvine, Dept Elect Engn & Comp Sci, Irvine, CA 92697 USA
[2] Univ Santiago de Compostela, Dept Elect & Comp Engn, Santiago De Compostela 15782, Spain
基金
美国国家科学基金会;
关键词
computer arithmetic; floating-point functional units; multiply-add-fused (MAF) operation; VLSI design;
D O I
10.1109/TC.2004.44
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We propose an architecture for the computation of the double-precision floating-point multiply-add-fused (MAF) operation A + (B x C). This architecture is based on the combined addition and rounding (using a dual adder) and in the anticipation of the normalization step before the addition. Because the normalization is performed before the addition, it is not possible to overlap the leading-zero-anticipator with the adder. Consequently, to avoid the increase in delay, we modify the design of the LZA so that the leading bits of its output are produced first and can be used to begin the normalization. Moreover, parts of the addition are also anticipated. We have estimated the delay of the resulting architecture considering the load introduced by long connections, and we estimate a delay reduction of between 15 percent and 20 percent, with respect to previous implementations.
引用
收藏
页码:988 / 1003
页数:16
相关论文
共 16 条
  • [1] A novel design of a two operand normalization circuit
    Antelo, E
    Boo, M
    Bruguera, JD
    Zapata, EL
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 1998, 6 (01) : 173 - 176
  • [2] BURGESS N, 1998, P SPIE
  • [3] Architectural design of a fast floating-point multiplication-add fused unit using signed-digit addition
    Chen, CY
    Chen, LA
    Cheng, JR
    [J]. EUROMICRO SYMPOSIUM ON DIGITAL SYSTEMS DESIGN, PROCEEDINGS, 2001, : 346 - 353
  • [4] EVALUATION OF A+B=K CONDITIONS WITHOUT CARRY PROPAGATION
    CORTADELLA, J
    LLABERIA, JM
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1992, 41 (11) : 1484 - 1488
  • [5] A comparison of three rounding algorithms for IEEE floating-point multiplication
    Even, G
    Seidel, PM
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 2000, 49 (07) : 638 - 650
  • [6] A dual floating point coprocessor with an FMAC architecture
    Heikes, C
    ColonBonet, G
    [J]. 1996 IEEE INTERNATIONAL SOLID-STATE CIRCUITS CONFERENCE, DIGEST OF TECHNICAL PAPERS, 1996, 39 : 354 - 355
  • [7] 2ND-GENERATION RISC FLOATING POINT WITH MULTIPLY-ADD FUSED
    HOKENEK, E
    MONTOYE, RK
    COOK, PW
    [J]. IEEE JOURNAL OF SOLID-STATE CIRCUITS, 1990, 25 (05) : 1207 - 1213
  • [8] Comparison of single- and dual-pass multiply-add fused floating-point units
    Jessani, RM
    Putrino, M
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1998, 47 (09) : 927 - 937
  • [9] A family of adders
    Knowles, S
    [J]. 14TH IEEE SYMPOSIUM ON COMPUTER ARITHMETIC, PROCEEDINGS, 1999, : 30 - 34
  • [10] POWER3: The next generation of PowerPC processors
    O'Connell, FP
    White, SW
    [J]. IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2000, 44 (06) : 873 - 884