The Unified Accumulator Architecture: A Configurable, Portable, and Extensible Floating-Point Accumulator

被引：3

作者：

Wilson, David ^{[1
]}

Stitt, Greg ^{[1
]}

机构：

[1] Univ Florida, Dept Elect & Comp Engn, Gainesville, FL 32611 USA

来源：

ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS | 2016年 / 9卷 / 03期

基金：

美国国家科学基金会;

关键词：

Design; Algorithms; Performance; FPGA; floating-point accumulation; reduction circuits; FPGA; ACCURATE; SINGLE; CPU;

D O I：

10.1145/2809432

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Applications accelerated by field-programmable gate arrays (FPGAs) often require pipelined floating-point accumulators with a variety of different trade-offs. Although previous work has introduced numerous floating-point accumulation architectures, few cores are available for public use, which forces designers to use fixed-point implementations or vendor-provided cores that are not portable and are often not optimized for the desired set of trade-offs. In this article, we combine and extend previous floating-point accumulator architectures into a configurable, open-source core, referred to as the unified accumulator architecture (UAA), which enables designers to choose between different trade-offs for different applications. UAA is portable across FPGAs and allows designers to specialize the underlying adder core to take advantage of device-specific optimizations. By providing an extensible, open-source implementation, we hope for the research community to extend the provided core with new architectures and optimizations.

引用

页数：23

共 26 条

[21] Single Precision Natural Logarithm Architecture for Hard Floating-Point and DSP-Enabled FPGAs [J].

Langhammer, Martin ;

Pasca, Bogdan .

2016 IEEE 23ND SYMPOSIUM ON COMPUTER ARITHMETIC (ARITH), 2016, :164-171

[22] An Area-Efficient Iterative Single-Precision Floating-Point Multiplier Architecture for FPGA [J].

Kim, Sunwoong ;

Rutenbar, Rob A. .

GLSVLSI '19 - PROCEEDINGS OF THE 2019 ON GREAT LAKES SYMPOSIUM ON VLSI, 2019, :87-92

[23] An Architecture of Area-Effective High Radix Floating-Point Divider With Low-Power Consumption [J].

Yang, Yuheng ;

Yuan, Qing ;

Liu, Jian .

IEEE ACCESS, 2021, 9 :40039-40048

[24] GH CORDIC-Based Architecture for Computing Nth Root of Single-Precision Floating-Point Number [J].

Wang, Yuxuan ;

Luo, Yuanyong ;

Wang, Zhongfeng ;

Shen, Qinghong ;

Pan, Hongbing .

IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2020, 28 (04) :864-875

[25] A Low-Cost Floating-Point Dot-Product-Dual-Accumulate Architecture for HPC-Enabled AI [J].

Tan, Hongbing ;

Huang, Libo ;

Zheng, Zhong ;

Guo, Hui ;

Yang, Qianmin ;

Shen, Li ;

Chen, Gang ;

Xiao, Liquan ;

Xiao, Nong .

IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 43 (02) :681-693

[26] A pipelined-loop-compatible architecture and algorithm to reduce variable-length sets of floating-point data on a reconfigurable computer [J].

Morris, Gerald R. ;

Prasanna, Viktor K. .

JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2008, 68 (07) :913-921

← 1 2 3 →