Improved Architectures for a Fused Floating-Point Add-Subtract Unit

被引:23
|
作者
Sohn, Jongwook [1 ,2 ]
Swartzlander, Earl E., Jr. [1 ]
机构
[1] Univ Texas Austin, Dept Elect & Comp Engn, Austin, TX 78712 USA
[2] Intel Corp, Austin, TX 78746 USA
关键词
Digital signal processing (DSP); floating-point arithmetic; fused floating-point operation; high-speed computer arithmetic; REDUCED LATENCY; EXECUTION UNIT;
D O I
10.1109/TCSI.2012.2188955
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents improved architectures for a fused floating-point add-subtract unit. The fused floating-point add-subtract unit is useful for digital signal processing (DSP) applications such as fast Fourier transform (FFT) and discrete cosine transform (DCT) butterfly operations. To improve the performance of the fused floating-point add-subtract unit, a dual-path algorithm and pipelining are employed. The proposed designs are implemented for both single and double precision and synthesized with a 45-nm standard-cell library. The fused floating-point add-subtract unit saves 40% of the area and power consumption compared to a discrete floating-point add-subtract unit. The proposed dual-path design reduces the latency by 30% compared to the discrete design with area and power consumption between that of the discrete and fused designs. Based on a data flow analysis, the proposed fused dual-path floating-point add-subtract unit can be split into two pipeline stages. Since the latencies of two pipeline stages are fairly well balanced, the throughput is increased by 80% compared to the nonpipelined dual-path design.
引用
收藏
页码:2285 / 2291
页数:7
相关论文
共 50 条
  • [21] Reconfigurable half-precision floating-point real/complex fused multiply and add unit
    Nesam, J. Jean Jenifer
    Sivanantham, S.
    INTERNATIONAL JOURNAL OF MATERIALS & PRODUCT TECHNOLOGY, 2020, 60 (01): : 58 - 72
  • [22] Implementation of Low Power and Area Efficient Floating-Point Fused Multiply-Add Unit
    Dhanabal, R.
    Sahoo, Sarat Kumar
    Bharathi, V.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SOFT COMPUTING SYSTEMS, ICSCS 2015, VOL 1, 2016, 397 : 329 - 342
  • [23] Reconfigurable half-precision floating-point real/complex fused multiply and add unit
    Jean Jenifer Nesam J.
    Sivanantham S.
    International Journal of Materials and Product Technology, 2020, 60 (01) : 58 - 72
  • [24] A Floating-Point Fused Dot-Product Unit
    Saleh, Hani H.
    Swartzlander, Earl E., Jr.
    2008 IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, 2008, : 427 - +
  • [25] A new architecture for multiple-precision floating-point multiply-add fused unit design
    Huang, Libo
    Shen, Li
    Dai, Kui
    Wang, Zhiying
    18TH IEEE SYMPOSIUM ON COMPUTER ARITHMETIC, PROCEEDINGS, 2007, : 69 - +
  • [26] Multiple path IEEE floating-point fused multiply-add
    Seidel, PM
    PROCEEDINGS OF THE 46TH IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS & SYSTEMS, VOLS 1-3, 2003, : 1359 - 1362
  • [27] Floating-Point Fused Multiply-Add under HUB Format
    Hormigo, Javier
    Villalba-Moreno, Julio
    Gonzalez-Navarro, Sonia
    2020 IEEE 27TH SYMPOSIUM ON COMPUTER ARITHMETIC (ARITH), 2020, : 1 - 8
  • [28] Design Issues and Implementations for Floating-Point Divide-Add Fused
    Amaricai, Alexandru
    Vladutiu, Mircea
    Boncalo, Oana
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2010, 57 (04) : 295 - 299
  • [29] A novel architecture for floating-point multiply-add-fused operation
    Sun, HP
    Gao, ML
    ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 1675 - 1679
  • [30] A Decimal Floating-point Fused Multiply-Add Unit with a Novel Decimal Leading-zero Anticipator
    Akkas, Ahmet
    Schulte, Michael J.
    ASAP 2011 - 22ND IEEE INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2011), 2011, : 43 - 50