Design tradeoff analysis of floating-point adders in FPGAs

被引:7
作者
Malik, Ali [1 ]
Chen, Dongdong [1 ]
Choi, Younhee [1 ]
Lee, Moon Ho [2 ]
Ko, Seok-Bum [1 ]
机构
[1] Univ Saskatchewan, Dept Elect & Comp Engn, Saskatoon, SK S7N 5A9, Canada
[2] Chonbuk Natl Univ, Elect & Informat Engn Dept, Jeonju 561756, Jeonbuk, South Korea
来源
CANADIAN JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING-REVUE CANADIENNE DE GENIE ELECTRIQUE ET INFORMATIQUE | 2008年 / 33卷 / 3-4期
关键词
floating-point adder; FPGA;
D O I
10.1109/CJECE.2008.4721634
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With gate counts of ten million, field-programmable gate arrays (FPGAs) are becoming suitable for floating-point computations. Addition is the most complex operation in a floating-point unit and can cause major delay while requiring a significant area. Over the years, the VLSI community has developed many floating-point adder algorithms aimed primarily at reducing the overall latency. An efficient design of the floating-point adder offers major area and performance improvements for FPGAs. Given recent advances in FPGA architecture and area density, latency has become the main focus in attempts to improve performance. This paper studies the implementation of standard; leading-one predictor (LOP); and far and close datapath (2-path) floating-point addition algorithms in FPGAs. Each algorithm has complex sub-operations which contribute significantly to the overall latency of the design. Each of the sub-operations is researched for different implementations and is then synthesized onto a Xilinx Vertex-II Pro FPGA device. Standard and LOP algorithms are also pipelined into five stages and compared with the Xilinx IP. According to the results, the standard algorithm is the best implementation with respect to area, but has a large overall latency of 27.059 ns while occupying 541 slices. The LOP algorithm reduces latency by 6.5% at the cost of a 38% increase in area compared to the standard algorithm. The 2-path implementation shows a 19% reduction in latency with an added expense of 88% in area compared to the standard algorithm. The five-stage standard pipeline implementation shows a 6.4% improvement in clock speed compared to the Xilinx IP with a 23% smaller area requirement. The five-stage pipelined LOP implementation shows a 22% improvement in clock speed compared to the Xilinx IP at a cost of 15% more area.
引用
收藏
页码:169 / 175
页数:7
相关论文
共 50 条
  • [21] Design of Floating-point Operand Memory Controller based on FPGA
    Li, Kejian
    Li, Yang
    Ke, Baozhong
    Lei, Lin
    PROCEEDINGS OF 2017 IEEE 2ND INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC), 2017, : 792 - 796
  • [22] Design and Implementation of Adaptive Binary Divider for Fixed-Point and Floating-Point Numbers
    Satyajit Bora
    Roy Paily
    Circuits, Systems, and Signal Processing, 2022, 41 : 1131 - 1145
  • [23] FPGA-based floating-point datapath design for geometry processing
    Xing, SZ
    Yu, WWH
    CONFIGURABLE COMPUTING: TECHNOLOGY AND APPLICATIONS, 1998, 3526 : 212 - 217
  • [24] Accurate Parallel Floating-Point Accumulation
    Kadric, Edin
    Gurniak, Paul
    DeHon, Andre
    2013 21ST IEEE SYMPOSIUM ON COMPUTER ARITHMETIC (ARITH), 2013, : 153 - 162
  • [25] Accurate Parallel Floating-Point Accumulation
    Kadric, Edin
    Gurniak, Paul
    DeHon, Andre
    IEEE TRANSACTIONS ON COMPUTERS, 2016, 65 (11) : 3224 - 3238
  • [26] Improving Floating-Point Performance in Less Area: Fractured Floating Point Units (FFPUs)
    Neil Hockert
    Katherine Compton
    Journal of Signal Processing Systems, 2012, 67 : 31 - 46
  • [27] Improving Floating-Point Performance in Less Area: Fractured Floating Point Units (FFPUs)
    Hockert, Neil
    Compton, Katherine
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2012, 67 (01): : 31 - 46
  • [28] Return of the hardware floating-point elementary function
    Detrey, Jeremie
    de Dinechin, Florent
    Pujol, Xavier
    18TH IEEE SYMPOSIUM ON COMPUTER ARITHMETIC, PROCEEDINGS, 2007, : 161 - +
  • [29] The implementation of the enhanced quantum floating-point adder
    Zhao, Shantao
    Li, Haisheng
    Li, Guiqiong
    Tang, Xiaohu
    MODERN PHYSICS LETTERS A, 2022, 37 (26)
  • [30] Evaluation of a Floating-Point Intensive Kernel on FPGA
    Jin, Zheming
    Finkel, Hal
    Yoshii, Kazutomo
    Cappello, Franck
    EURO-PAR 2017: PARALLEL PROCESSING WORKSHOPS, 2018, 10659 : 664 - 675