A floating point conversion algorithm for mixed precision computations

被引:0
作者
Hoo, Choon Lih [1 ]
Haris, Sallehuddin Mohamed [1 ]
Mohamed, Nik Abdullah Nik [1 ]
机构
[1] Univ Kebangsaan Malaysia, Dept Mech & Mat Engn, Ukm Bangi 43600, Malaysia
来源
JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE C-COMPUTERS & ELECTRONICS | 2012年 / 13卷 / 09期
关键词
Double precision; Single precision; FPGA; Verilog; HooHar algorithm;
D O I
10.1631/jzus.C1200043
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The floating point number is the most commonly used real number representation for digital computations due to its high precision characteristics. It is used on computers and on single chip applications such as DSP chips. Double precision (64-bit) representations allow for a wider range of real numbers to be denoted. However, single precision (32-bit) operations are more efficient. Recently, there has been an increasing interest in mixed precision computations which take advantage of single precision efficiency on 64-bit numbers. This calls for the ability to interchange between the two formats. In this paper, an algorithm that converts floating point numbers from 64-to 32-bit representations is presented. The algorithm was implemented as a Verilog code and tested on field programmable gate array (FPGA) using the Quartus II DE2 board and Agilent 16821A portable logic analyzer. Results indicate that the algorithm can perform the conversion reliably and accurately within a constant execution time of 25 ns with a 20 MHz clock frequency regardless of the number being converted.
引用
收藏
页码:711 / 718
页数:8
相关论文
共 16 条
  • [1] Anzt H., 2011, 2011 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum, P829, DOI 10.1109/IPDPS.2011.226
  • [2] Anzt H, 2011, LECT NOTES COMPUT SC, V6449, P58, DOI 10.1007/978-3-642-19328-6_8
  • [3] Accelerating scientific computations with mixed precision algorithms
    Baboulin, Marc
    Buttari, Alfredo
    Dongarra, Jack
    Kurzak, Jakub
    Langou, Julie
    Langou, Julien
    Luszczek, Piotr
    Tomov, Stanimire
    [J]. COMPUTER PHYSICS COMMUNICATIONS, 2009, 180 (12) : 2526 - 2533
  • [4] Boldo S, 2001, CONF REC ASILOMAR C, P1299, DOI 10.1109/ACSSC.2001.987700
  • [5] Christopher V., 2012, IEEE754
  • [6] Goddeke D., 2005, S SIMULATION TECHNIQ, P139
  • [7] Performance and accuracy of hardware-oriented native-, emulated- and mixed-precision solvers in FEM simulations
    Goeddeke, Dominik
    Strzodka, Robert
    Turek, Stefan
    [J]. INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2007, 22 (04) : 221 - 256
  • [8] Differential predictive floating-point analog-to-digital converter
    Groza, V
    Dzerdz, B
    [J]. MEASUREMENT, 2004, 35 (02) : 139 - 151
  • [9] FPGA implementation of adaptive ANN controller for speed regulation of permanent magnet stepper motor drives
    Hasanien, Hany M.
    [J]. ENERGY CONVERSION AND MANAGEMENT, 2011, 52 (02) : 1252 - 1257
  • [10] Hollasch S., 2005, 754 IEEE