The Unified Accumulator Architecture: A Configurable, Portable, and Extensible Floating-Point Accumulator

被引:3
作者
Wilson, David [1 ]
Stitt, Greg [1 ]
机构
[1] Univ Florida, Dept Elect & Comp Engn, Gainesville, FL 32611 USA
基金
美国国家科学基金会;
关键词
Design; Algorithms; Performance; FPGA; floating-point accumulation; reduction circuits; FPGA; ACCURATE; SINGLE; CPU;
D O I
10.1145/2809432
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Applications accelerated by field-programmable gate arrays (FPGAs) often require pipelined floating-point accumulators with a variety of different trade-offs. Although previous work has introduced numerous floating-point accumulation architectures, few cores are available for public use, which forces designers to use fixed-point implementations or vendor-provided cores that are not portable and are often not optimized for the desired set of trade-offs. In this article, we combine and extend previous floating-point accumulator architectures into a configurable, open-source core, referred to as the unified accumulator architecture (UAA), which enables designers to choose between different trade-offs for different applications. UAA is portable across FPGAs and allows designers to specialize the underlying adder core to take advantage of device-specific optimizations. By providing an extensible, open-source implementation, we hope for the research community to extend the provided core with new architectures and optimizations.
引用
收藏
页数:23
相关论文
共 26 条
[21]   Single Precision Natural Logarithm Architecture for Hard Floating-Point and DSP-Enabled FPGAs [J].
Langhammer, Martin ;
Pasca, Bogdan .
2016 IEEE 23ND SYMPOSIUM ON COMPUTER ARITHMETIC (ARITH), 2016, :164-171
[22]   An Area-Efficient Iterative Single-Precision Floating-Point Multiplier Architecture for FPGA [J].
Kim, Sunwoong ;
Rutenbar, Rob A. .
GLSVLSI '19 - PROCEEDINGS OF THE 2019 ON GREAT LAKES SYMPOSIUM ON VLSI, 2019, :87-92
[23]   An Architecture of Area-Effective High Radix Floating-Point Divider With Low-Power Consumption [J].
Yang, Yuheng ;
Yuan, Qing ;
Liu, Jian .
IEEE ACCESS, 2021, 9 :40039-40048
[24]   GH CORDIC-Based Architecture for Computing Nth Root of Single-Precision Floating-Point Number [J].
Wang, Yuxuan ;
Luo, Yuanyong ;
Wang, Zhongfeng ;
Shen, Qinghong ;
Pan, Hongbing .
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2020, 28 (04) :864-875
[25]   A Low-Cost Floating-Point Dot-Product-Dual-Accumulate Architecture for HPC-Enabled AI [J].
Tan, Hongbing ;
Huang, Libo ;
Zheng, Zhong ;
Guo, Hui ;
Yang, Qianmin ;
Shen, Li ;
Chen, Gang ;
Xiao, Liquan ;
Xiao, Nong .
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 43 (02) :681-693
[26]   A pipelined-loop-compatible architecture and algorithm to reduce variable-length sets of floating-point data on a reconfigurable computer [J].
Morris, Gerald R. ;
Prasanna, Viktor K. .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2008, 68 (07) :913-921