Improved Architectures for a Fused Floating-Point Add-Subtract Unit

被引:23
|
作者
Sohn, Jongwook [1 ,2 ]
Swartzlander, Earl E., Jr. [1 ]
机构
[1] Univ Texas Austin, Dept Elect & Comp Engn, Austin, TX 78712 USA
[2] Intel Corp, Austin, TX 78746 USA
关键词
Digital signal processing (DSP); floating-point arithmetic; fused floating-point operation; high-speed computer arithmetic; REDUCED LATENCY; EXECUTION UNIT;
D O I
10.1109/TCSI.2012.2188955
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents improved architectures for a fused floating-point add-subtract unit. The fused floating-point add-subtract unit is useful for digital signal processing (DSP) applications such as fast Fourier transform (FFT) and discrete cosine transform (DCT) butterfly operations. To improve the performance of the fused floating-point add-subtract unit, a dual-path algorithm and pipelining are employed. The proposed designs are implemented for both single and double precision and synthesized with a 45-nm standard-cell library. The fused floating-point add-subtract unit saves 40% of the area and power consumption compared to a discrete floating-point add-subtract unit. The proposed dual-path design reduces the latency by 30% compared to the discrete design with area and power consumption between that of the discrete and fused designs. Based on a data flow analysis, the proposed fused dual-path floating-point add-subtract unit can be split into two pipeline stages. Since the latencies of two pipeline stages are fairly well balanced, the throughput is increased by 80% compared to the nonpipelined dual-path design.
引用
收藏
页码:2285 / 2291
页数:7
相关论文
共 50 条
  • [41] Architectural design of a fast floating-point multiplication-add fused unit using signed-digit addition
    Chen, C
    Chen, LA
    Cheng, JR
    IEE PROCEEDINGS-COMPUTERS AND DIGITAL TECHNIQUES, 2002, 149 (04): : 113 - 120
  • [42] Fused Floating-Point Arithmetic for DSP
    Swartzlander, Earl E., Jr.
    Saleh, Hani H.
    2008 42ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-4, 2008, : 767 - +
  • [43] Efficient dual-precision floating-point fused-multiply-add architecture
    Arunachalam, V.
    Raj, Alex Noel Joseph
    Hampannavar, Naveen
    Bidul, C. B.
    MICROPROCESSORS AND MICROSYSTEMS, 2018, 57 : 23 - 31
  • [44] Advanced Clockgating Schemes for Fused-Multiply-Add-Type Floating-Point Units
    Preiss, Jochen
    Boersma, Maarten
    Mueller, Silvia Melitta
    ARITH: 2009 19TH IEEE INTERNATIONAL SYMPOSIUM ON COMPUTER ARITHMETIC, 2009, : 48 - 56
  • [45] Fused Floating-Point Two-term Sum-of-Squares Unit
    Min, Jae Hong
    Swartzlander, Earl E., Jr.
    PROCEEDINGS OF THE 2013 IEEE 24TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 13), 2013, : 147 - 152
  • [46] Design of Reversible 32-Bit BCD Add-Subtract Unit using Parallel Pipelined Method
    Anjana, A.
    Ananthalakshmi, A. V.
    PROCEEDINGS OF THE 2016 IEEE 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL & ELECTRONICS, INFORMATION, COMMUNICATION & BIO INFORMATICS (IEEE AEEICB-2016), 2016, : 162 - 165
  • [47] FFT Implementation with Fused Floating-Point Operations
    Swartzlander, Earl E., Jr.
    Saleh, Hani H. M.
    IEEE TRANSACTIONS ON COMPUTERS, 2012, 61 (02) : 284 - 288
  • [48] Leading zero anticipation for latency improvement in floating-point fused multiply-add units
    Mei, XL
    2005 6th International Conference on ASIC Proceedings, Books 1 and 2, 2005, : 128 - 131
  • [49] A Fused Continuous Floating-Point MAC on FPGA
    Yuan, Min
    Xing, Qianjian
    Ma, Zhenguo
    Yu, Feng
    Xu, Yingke
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2018, E101A (09): : 1594 - 1598
  • [50] THE SUPRENUM VECTOR FLOATING-POINT UNIT
    KAMMER, H
    PARALLEL COMPUTING, 1988, 7 (03) : 315 - 323