Improved Architectures for a Fused Floating-Point Add-Subtract Unit

被引：23

作者：

Sohn, Jongwook ^{[1
,2
]}

Swartzlander, Earl E., Jr. ^{[1
]}

机构：

[1] Univ Texas Austin, Dept Elect & Comp Engn, Austin, TX 78712 USA

[2] Intel Corp, Austin, TX 78746 USA

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS | 2012年 / 59卷 / 10期

关键词：

Digital signal processing (DSP); floating-point arithmetic; fused floating-point operation; high-speed computer arithmetic; REDUCED LATENCY; EXECUTION UNIT;

D O I：

10.1109/TCSI.2012.2188955

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper presents improved architectures for a fused floating-point add-subtract unit. The fused floating-point add-subtract unit is useful for digital signal processing (DSP) applications such as fast Fourier transform (FFT) and discrete cosine transform (DCT) butterfly operations. To improve the performance of the fused floating-point add-subtract unit, a dual-path algorithm and pipelining are employed. The proposed designs are implemented for both single and double precision and synthesized with a 45-nm standard-cell library. The fused floating-point add-subtract unit saves 40% of the area and power consumption compared to a discrete floating-point add-subtract unit. The proposed dual-path design reduces the latency by 30% compared to the discrete design with area and power consumption between that of the discrete and fused designs. Based on a data flow analysis, the proposed fused dual-path floating-point add-subtract unit can be split into two pipeline stages. Since the latencies of two pipeline stages are fairly well balanced, the throughput is increased by 80% compared to the nonpipelined dual-path design.

引用

页码：2285 / 2291

页数：7

共 50 条

[21] Reconfigurable half-precision floating-point real/complex fused multiply and add unit
Nesam, J. Jean Jenifer
Sivanantham, S.
INTERNATIONAL JOURNAL OF MATERIALS & PRODUCT TECHNOLOGY, 2020, 60 (01): : 58 - 72
[22] Implementation of Low Power and Area Efficient Floating-Point Fused Multiply-Add Unit
Dhanabal, R.
Sahoo, Sarat Kumar
Bharathi, V.
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SOFT COMPUTING SYSTEMS, ICSCS 2015, VOL 1, 2016, 397 : 329 - 342
[23] Reconfigurable half-precision floating-point real/complex fused multiply and add unit
Jean Jenifer Nesam J.
Sivanantham S.
International Journal of Materials and Product Technology, 2020, 60 (01) : 58 - 72
[24] A Floating-Point Fused Dot-Product Unit
Saleh, Hani H.
Swartzlander, Earl E., Jr.
2008 IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN, 2008, : 427 - +
[25] A new architecture for multiple-precision floating-point multiply-add fused unit design
Huang, Libo
Shen, Li
Dai, Kui
Wang, Zhiying
18TH IEEE SYMPOSIUM ON COMPUTER ARITHMETIC, PROCEEDINGS, 2007, : 69 - +
[26] Multiple path IEEE floating-point fused multiply-add
Seidel, PM
PROCEEDINGS OF THE 46TH IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS & SYSTEMS, VOLS 1-3, 2003, : 1359 - 1362
[27] Floating-Point Fused Multiply-Add under HUB Format
Hormigo, Javier
Villalba-Moreno, Julio
Gonzalez-Navarro, Sonia
2020 IEEE 27TH SYMPOSIUM ON COMPUTER ARITHMETIC (ARITH), 2020, : 1 - 8
[28] Design Issues and Implementations for Floating-Point Divide-Add Fused
Amaricai, Alexandru
Vladutiu, Mircea
Boncalo, Oana
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2010, 57 (04) : 295 - 299
[29] A novel architecture for floating-point multiply-add-fused operation
Sun, HP
Gao, ML
ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 1675 - 1679
[30] A Decimal Floating-point Fused Multiply-Add Unit with a Novel Decimal Leading-zero Anticipator
Akkas, Ahmet
Schulte, Michael J.
ASAP 2011 - 22ND IEEE INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP 2011), 2011, : 43 - 50

← 1 2 3 4 5 →