Improved Architectures for a Fused Floating-Point Add-Subtract Unit

被引:23
|
作者
Sohn, Jongwook [1 ,2 ]
Swartzlander, Earl E., Jr. [1 ]
机构
[1] Univ Texas Austin, Dept Elect & Comp Engn, Austin, TX 78712 USA
[2] Intel Corp, Austin, TX 78746 USA
关键词
Digital signal processing (DSP); floating-point arithmetic; fused floating-point operation; high-speed computer arithmetic; REDUCED LATENCY; EXECUTION UNIT;
D O I
10.1109/TCSI.2012.2188955
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents improved architectures for a fused floating-point add-subtract unit. The fused floating-point add-subtract unit is useful for digital signal processing (DSP) applications such as fast Fourier transform (FFT) and discrete cosine transform (DCT) butterfly operations. To improve the performance of the fused floating-point add-subtract unit, a dual-path algorithm and pipelining are employed. The proposed designs are implemented for both single and double precision and synthesized with a 45-nm standard-cell library. The fused floating-point add-subtract unit saves 40% of the area and power consumption compared to a discrete floating-point add-subtract unit. The proposed dual-path design reduces the latency by 30% compared to the discrete design with area and power consumption between that of the discrete and fused designs. Based on a data flow analysis, the proposed fused dual-path floating-point add-subtract unit can be split into two pipeline stages. Since the latencies of two pipeline stages are fairly well balanced, the throughput is increased by 80% compared to the nonpipelined dual-path design.
引用
收藏
页码:2285 / 2291
页数:7
相关论文
共 50 条
  • [31] Design of a double-precision floating-point multiply-add-fused unit with consideration of data dependence
    Li, Zhaolin
    Li, Gongqiong
    IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, PROCEEDINGS: EMERGING VLSI TECHNOLOGIES AND ARCHITECTURES, 2007, : 492 - 497
  • [32] Multiple-Mode Floating-Point Multiply-Add Fused Unit for Trading Accuracy with Power Consumption
    Wu, Kun-Yi
    Liang, Chih-Yuan
    Yu, Kee-Khuan
    Kuang, Shiann-Rong
    2013 IEEE/ACIS 12TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2013, : 429 - 435
  • [33] Optimized Design of a Double-Precision Floating-Point Multiply-Add-Fused Unit for Data Dependence
    Li, Gongqiong
    Li, Zhaolin
    2007 IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, VOLS, 1 AND 2, 2007, : 311 - 316
  • [34] Design of an extended floating-point multiply-add-fused unit for exploiting instruction-level parallelism
    Li, Zhaolin
    Li, Gongqiong
    2007 INTERNATIONAL SYMPOSIUM ON INTEGRATED CIRCUITS, VOLS 1 AND 2, 2007, : 17 - 20
  • [35] Decimal floating-point fused multiply-add with redundant internal encodings
    Han, Liu
    Zhang, Hao
    Ko, Seok-Bum
    IET COMPUTERS AND DIGITAL TECHNIQUES, 2016, 10 (04): : 147 - 156
  • [36] Speed-Independent Fused Multiply Add and Subtract Unit
    Stepchenkov, Yuri
    Zakharov, Victor
    Rogdestvenski, Yuri
    Diachenko, Yuri
    Morozov, Nickolaj
    Stepchenkov, Dmitri
    PROCEEDINGS OF 2016 IEEE EAST-WEST DESIGN & TEST SYMPOSIUM (EWDTS), 2016,
  • [37] A Fused Floating-Point Four-Term Dot Product Unit
    Sohn, Jongwook
    Swartzlander, Earl E., Jr.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2016, 63 (03) : 370 - 378
  • [38] Efficient-Fused Architectures for FFT Processor Using Floating-Point Arithmetic
    Eniyan, D. Tamil
    Harirajkumar, J.
    ARTIFICIAL INTELLIGENCE AND EVOLUTIONARY COMPUTATIONS IN ENGINEERING SYSTEMS, ICAIECES 2015, 2016, 394 : 1029 - 1038
  • [39] UNIVERSAL FUSED FLOATING-POINT DOT-PRODUCT UNIT (UFDP)
    Saleh, Hani H.
    Mohammad, Baker S.
    2013 8TH INTERNATIONAL DESIGN AND TEST SYMPOSIUM (IDT), 2013,
  • [40] Architectural design of a fast floating-point multiplication-add fused unit using signed-digit addition
    Chen, CY
    Chen, LA
    Cheng, JR
    EUROMICRO SYMPOSIUM ON DIGITAL SYSTEMS DESIGN, PROCEEDINGS, 2001, : 346 - 353