3D FFTs on a Single FPGA

被引:13
作者
Humphries, Benjamin [1 ]
Zhang, Hansen [1 ]
Sheng, Jiayi [1 ]
Landaverde, Raphael [1 ]
Herbordt, Martin C. [1 ]
机构
[1] Boston Univ, Dept Elect & Comp Engn, Boston, MA 02215 USA
来源
2014 IEEE 22ND ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2014) | 2014年
关键词
High Performance Reconfigurable Computing; FFT;
D O I
10.1109/FCCM.2014.28
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The 3D FFT is critical in many physical simulations and image processing applications. On FPGAs, however, the 3D FFT was thought to be inefficient relative to other methods such as convolution-based implementations of multigrid. We find the opposite: a simple design, operating at a conservative frequency, takes 4 mu s for 16(3), 21 mu s for 32(3), and 215 mu s for 64(3) single precision data points. The first two of these compare favorably with the 25 mu s and 29 mu s obtained running on a current Nvidia GPU. Some broader significance is that this is a critical piece in implementing a large scale FPGA-based MD engine: even a single FPGA is capable of keeping the FFT off of the critical path for a large fraction of possible MD simulations.
引用
收藏
页码:68 / 71
页数:4
相关论文
共 17 条
[1]  
Akin B., 2012, P IEEE S FIELD PROGR
[2]  
Altera, QUART 2 HDB
[3]  
[Anonymous], LogiCORE IP Fast Fourier Transform v7.1
[4]  
[Anonymous], 2005, THESIS
[5]  
[Anonymous], PROCWIZARD PROD BRIE
[6]   Molecular Dynamics Simulations on High-Performance Reconfigurable Computing Systems [J].
Chiu, Matt ;
Herbordt, Martin C. .
ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2010, 3 (04)
[7]  
D'Alberto P., 2007, P IEEE S FIELD PROGR
[8]  
Dick C., 1998, 8 INT C SIGN PROC AP
[9]  
Gu Y, 2007, P HIGH PERF EMB COMP
[10]  
Gu YF, 2007, ANN IEEE SYM FIELD P, P117, DOI [10.1109/FCCM.2007.33, 10.1109/FCCM.2007.42]